Please cite the following papers if you use any part of this dataset:
@article{DBLP:journals/pvldb/MozafariSFJM14,
author = {Barzan Mozafari and
Purnamrita Sarkar and
Michael J. Franklin and
Michael I. Jordan and
Samuel Madden},
title = {Scaling Up Crowd-Sourcing to Very Large Datasets: {A} Case for Active
Learning},
journal = {{PVLDB}},
volume = {8},
number = {2},
pages = {125--136},
year = {2014},
}
@article{DBLP:journals/corr/abs-1209-3686,
author = {Barzan Mozafari and
Purnamrita Sarkar and
Michael J. Franklin and
Michael I. Jordan and
Samuel Madden},
title = {Active Learning for Crowd-Sourced Databases},
journal = {CoRR},
volume = {abs/1209.3686},
year = {2012},
}
Questions
For all inquiries please contact mozafari AT umich.edu
The format of each dataset is provided in the corresponding .desc file. The binary files need to be loaded with Matlab. For ease of use I have the MATLAB file that loads the datasets available, named CrowdManager.m
CrowdManager.m