The inefficiency of language models in scholarly retrieval: an experimental walk-through

Show simple item record

dc.contributor.author Singh, Shruti
dc.contributor.author Singh, Mayank
dc.date.accessioned 2022-04-06T05:31:53Z
dc.date.available 2022-04-06T05:31:53Z
dc.date.issued 2022-03
dc.identifier.citation Singh, Shruti and Singh, Mayank, "The inefficiency of language models in scholarly retrieval: an experimental walk-through", arXiv, Cornell University Library, DOI: arXiv:2203.15364, Mar. 2022. en_US
dc.identifier.issn
dc.identifier.uri http://arxiv.org/abs/2203.15364
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/7644
dc.description.abstract Language models are increasingly becoming popular in AI-powered scientific IR systems. This paper evaluates popular scientific language models in handling (i) short-query texts and (ii) textual neighbors. Our experiments showcase the inability to retrieve relevant documents for a short-query text even under the most relaxed conditions. Additionally, we leverage textual neighbors, generated by small perturbations to the original text, to demonstrate that not all perturbations lead to close neighbors in the embedding space. Further, an exhaustive categorization yields several classes of orthographically and semantically related, partially related, and completely unrelated neighbors. Retrieval performance turns out to be more influenced by the surface form rather than the semantics of the text.
dc.description.statementofresponsibility by Shruti Singh and Mayank Singh
dc.language.iso en_US en_US
dc.publisher Cornell University Library en_US
dc.subject Language models en_US
dc.subject IR systems en_US
dc.subject Short-query texts en_US
dc.subject Retrieval performance en_US
dc.subject Textual neighbors en_US
dc.title The inefficiency of language models in scholarly retrieval: an experimental walk-through en_US
dc.type Pre-Print en_US
dc.relation.journal arXiv


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account