Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model.
While crowdsourcing has become an important means to label data, crowdworkers are not always experts---sometimes they can even be adversarial. Therefore, there is great interest in estimating the ground truth from unreliable labels produced by crowdworkers. The Dawid and Skene (DS) model is one of the most well-known models in the study of crowdsourcing. Despite its practical popularity, theoretical error analysis for the DS model has been conducted only under restrictive assumptions on, e.g., class priors, confusion matrices, and the number of labels each worker provides. In this paper, we derive a minimax error rate under more practical setting for a broader class of crowdsourcing models that includes the DS model as a special case. We further propose the worker clustering model, which is more practical than the DS model under real crowdsourcing settings. Note that the wide applicability of our theoretical analysis allows us to immediately investigate the behavior of this proposed model. Experimental results showed that there is a strong similarity between the lower bound of the minimax error rate derived by our theoretical analysis and the empirical error of the estimated value.
Publisher URL: http://arxiv.org/abs/1802.04551
DOI: arXiv:1802.04551v1
Keeping up-to-date with research can feel impossible, with papers being published faster than you'll ever be able to read them. That's where Researcher comes in: we're simplifying discovery and making important discussions happen. With over 19,000 sources, including peer-reviewed journals, preprints, blogs, universities, podcasts and Live events across 10 research areas, you'll never miss what's important to you. It's like social media, but better. Oh, and we should mention - it's free.
Researcher displays publicly available abstracts and doesn’t host any full article content. If the content is open access, we will direct clicks from the abstracts to the publisher website and display the PDF copy on our platform. Clicks to view the full text will be directed to the publisher website, where only users with subscriptions or access through their institution are able to view the full article.