AI Trends

Learning without Pointillistic Labels using Data Programming – La Biblia de la IA – The Bible of AI™ Journal


R0:088528eaae518c6f09835c249f9a8635-The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming -

🔘 Paper page: arxiv.org/abs/2108.10921

Abstract

Most advanced supervised Machine Learning (ML) models rely on vast amounts of point-by-point labelled training examples. Hand-labelling vast amounts of data may be tedious, expensive, and error-prone. Recently, some studies have explored the use of diverse sources of weak supervision to produce competitive end model classifiers. In this paper, we survey recent work on weak supervision, and in particular, we investigate the Data Programming (DP) framework. Taking a set of potentially noisy heuristics as input, DP assigns denoised probabilistic labels to each data point in a dataset using a probabilistic graphical model of heuristics. We analyze the math fundamentals behind DP and demonstrate the power of it by applying it on two real-world text classification tasks. Furthermore, we compare DP with pointillistic active and semi-supervised learning techniques traditionally applied in data-sparse settings.


Authors

Chufan Gao, Mononito Goswami


Liked this post? Follow this blog to get more. 

#Learning #Pointillistic #Labels #Data #Programming #Biblia #Bible #Journal

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button

Adblocker Detected

Please Turn off Ad blocker