Please cite the following papers if you use any part of this dataset: @article{DBLP:journals/pvldb/MozafariSFJM14, author = {Barzan Mozafari and Purnamrita Sarkar and Michael J. Franklin and Michael I. Jordan and Samuel Madden}, title = {Scaling Up Crowd-Sourcing to Very Large Datasets: {A} Case for Active Learning}, journal = {{PVLDB}}, volume = {8}, number = {2}, pages = {125--136}, year = {2014}, } @article{DBLP:journals/corr/abs-1209-3686, author = {Barzan Mozafari and Purnamrita Sarkar and Michael J. Franklin and Michael I. Jordan and Samuel Madden}, title = {Active Learning for Crowd-Sourced Databases}, journal = {CoRR}, volume = {abs/1209.3686}, year = {2012}, } For all inquiries please contact mozafari AT umich.edu ------------------ The format of each dataset is provided in the corresponding .desc file. The binary files need to be loaded with Matlab. For ease of use I have the MATLAB file that loads the datasets available, named CrowdManager.m