Remember this event that year? assessing temporal information and reasoning in large language models

Show simple item record

dc.contributor.author Beniwal, Himanshu
dc.contributor.author Nandagopan D., Kowsik
dc.contributor.author Singh, Mayank
dc.contributor.other arXiv
dc.coverage.spatial United States of America
dc.date.accessioned 2024-02-28T10:27:37Z
dc.date.available 2024-02-28T10:27:37Z
dc.date.issued 2024-02
dc.identifier.citation Beniwal, Himanshu; Nandagopan D., Kowsik and Singh, Mayank, "Remember this event that year? assessing temporal information and reasoning in large language models", arXiv, Cornell University Library, DOI: arXiv:2402.11997, Feb. 2024.
dc.identifier.issn 2331-8422
dc.identifier.uri https://doi.org/10.48550/arXiv.2402.11997
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/9812
dc.description.abstract Large Language Models (LLMs) are increasingly becoming ubiquitous, yet their ability to reason about and retain temporal information remains limited. This hinders their application in real-world scenarios where understanding the sequential nature of events is crucial. This paper experiments with state-of-the-art models on a novel, large-scale temporal dataset, \textbf{TempUN}, to reveal significant limitations in temporal retention and reasoning abilities. Interestingly, closed-source models indicate knowledge gaps more frequently, potentially suggesting a trade-off between uncertainty awareness and incorrect responses. Further, exploring various fine-tuning approaches yielded no major performance improvements. The associated dataset and code are available at the following URL (this https URL).
dc.description.statementofresponsibility by Himanshu Beniwal, Kowsik Nandagopan D. and Mayank Singh
dc.language.iso en_US
dc.publisher Cornell University Library
dc.title Remember this event that year? assessing temporal information and reasoning in large language models
dc.type Article


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account