Crowd-Calibrator: Can Annotator Disagreement Inform Calibration in...
Subjective tasks in NLP have been mostly relegated to objective standards, where the gold label is decided by taking the majority vote. This obfuscates annotator disagreement and the inherent...
arxiv.org