Unveiling the multi-annotation process: examining the influence of annotation quantity and instance difficulty on model performance

dc.contributor.author	Kadasi, Pritam
dc.contributor.author	Singh, Mayank
dc.coverage.spatial	United States of America
dc.date.accessioned	2023-11-09T11:13:00Z
dc.date.available	2023-11-09T11:13:00Z
dc.date.issued	2023-10
dc.identifier.citation	Kadasi, Pritam and Singh, Mayank, "Unveiling the multi-annotation process: examining the influence of annotation quantity and instance difficulty on model performance", arXiv, Cornell University Library, DOI: arXiv:2310.14572, Oct. 2023.
dc.identifier.issn	2331-8422
dc.identifier.uri	https://doi.org/10.48550/arXiv.2310.14572
dc.identifier.uri	https://repository.iitgn.ac.in/handle/123456789/9439
dc.description.abstract	The NLP community has long advocated for the construction of multi-annotator datasets to better capture the nuances of language interpretation, subjectivity, and ambiguity. This paper conducts a retrospective study to show how performance scores can vary when a dataset expands from a single annotation per instance to multiple annotations. We propose a novel multi-annotator simulation process to generate datasets with varying annotation budgets. We show that similar datasets with the same annotation budget can lead to varying performance gains. Our findings challenge the popular belief that models trained on multi-annotation examples always lead to better performance than models trained on single or few-annotation examples.
dc.description.statementofresponsibility	by Pritam Kadasi and Mayank Singh
dc.language.iso	en_US
dc.publisher	Cornell University Library
dc.title	Unveiling the multi-annotation process: examining the influence of annotation quantity and instance difficulty on model performance
dc.type	Article
dc.relation.journal	arXiv

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

E-print Articles [183]

Show simple item record

Search Digital Repository

Browse

All of DSpace
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Degree
- By Department

Unveiling the multi-annotation process: examining the influence of annotation quantity and instance difficulty on model performance

Files in this item

This item appears in the following Collection(s)

Search Digital Repository

Browse

All of DSpace

This Collection

My Account