Remember this event that year? assessing temporal information and reasoning in large language models

dc.contributor.author	Beniwal, Himanshu
dc.contributor.author	Nandagopan D., Kowsik
dc.contributor.author	Singh, Mayank
dc.contributor.other	arXiv
dc.coverage.spatial	United States of America
dc.date.accessioned	2024-02-28T10:27:37Z
dc.date.available	2024-02-28T10:27:37Z
dc.date.issued	2024-02
dc.identifier.citation	Beniwal, Himanshu; Nandagopan D., Kowsik and Singh, Mayank, "Remember this event that year? assessing temporal information and reasoning in large language models", arXiv, Cornell University Library, DOI: arXiv:2402.11997, Feb. 2024.
dc.identifier.issn	2331-8422
dc.identifier.uri	https://doi.org/10.48550/arXiv.2402.11997
dc.identifier.uri	https://repository.iitgn.ac.in/handle/123456789/9812
dc.description.abstract	Large Language Models (LLMs) are increasingly becoming ubiquitous, yet their ability to reason about and retain temporal information remains limited. This hinders their application in real-world scenarios where understanding the sequential nature of events is crucial. This paper experiments with state-of-the-art models on a novel, large-scale temporal dataset, \textbf{TempUN}, to reveal significant limitations in temporal retention and reasoning abilities. Interestingly, closed-source models indicate knowledge gaps more frequently, potentially suggesting a trade-off between uncertainty awareness and incorrect responses. Further, exploring various fine-tuning approaches yielded no major performance improvements. The associated dataset and code are available at the following URL (this https URL).
dc.description.statementofresponsibility	by Himanshu Beniwal, Kowsik Nandagopan D. and Mayank Singh
dc.language.iso	en_US
dc.publisher	Cornell University Library
dc.title	Remember this event that year? assessing temporal information and reasoning in large language models
dc.type	Article

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

E-print Articles [183]

Show simple item record

Search Digital Repository

Browse

All of DSpace
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Degree
- By Department

Remember this event that year? assessing temporal information and reasoning in large language models

Files in this item

This item appears in the following Collection(s)

Search Digital Repository

Browse

All of DSpace

This Collection

My Account