4.6 Article

Learning from crowdsourced labeled data: a survey

期刊

ARTIFICIAL INTELLIGENCE REVIEW
卷 46, 期 4, 页码 543-576

出版社

SPRINGER
DOI: 10.1007/s10462-016-9491-9

关键词

Crowdsourcing; Learning from crowds; Multiple noisy labeling; Label quality; Learning model quality; Ground truth inference

资金

  1. China Postdoctoral Science Foundation [2016M590457]
  2. Program for Changjiang Scholars and Innovative Research Team in University (PCSIRT) of the Ministry of Education, China [IRT13059]
  3. National 973 Program of China [2013CB329604]
  4. US National Science Foundation [IIS-1115417]
  5. Direct For Computer & Info Scie & Enginr
  6. Div Of Information & Intelligent Systems [1115417] Funding Source: National Science Foundation

向作者/读者索取更多资源

With the rapid growing of crowdsourcing systems, quite a few applications based on a supervised learning paradigm can easily obtain massive labeled data at a relatively low cost. However, due to the variable uncertainty of crowdsourced labelers, learning procedures face great challenges. Thus, improving the qualities of labels and learning models plays a key role in learning from the crowdsourced labeled data. In this survey, we first introduce the basic concepts of the qualities of labels and learning models. Then, by reviewing recently proposed models and algorithms on ground truth inference and learning models, we analyze connections and distinctions among these techniques as well as clarify the level of the progress of related researches. In order to facilitate the studies in this field, we also introduce open accessible real-world data sets collected from crowdsourcing systems and open source libraries and tools. Finally, some potential issues for future studies are discussed.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据