dc.contributor.author |
Beniwal, Himanshu |
|
dc.contributor.author |
Nandagopan D., Kowsik |
|
dc.contributor.author |
Singh, Mayank |
|
dc.contributor.other |
arXiv |
|
dc.coverage.spatial |
United States of America |
|
dc.date.accessioned |
2024-02-28T10:27:37Z |
|
dc.date.available |
2024-02-28T10:27:37Z |
|
dc.date.issued |
2024-02 |
|
dc.identifier.citation |
Beniwal, Himanshu; Nandagopan D., Kowsik and Singh, Mayank, "Remember this event that year? assessing temporal information and reasoning in large language models", arXiv, Cornell University Library, DOI: arXiv:2402.11997, Feb. 2024. |
|
dc.identifier.issn |
2331-8422 |
|
dc.identifier.uri |
https://doi.org/10.48550/arXiv.2402.11997 |
|
dc.identifier.uri |
https://repository.iitgn.ac.in/handle/123456789/9812 |
|
dc.description.abstract |
Large Language Models (LLMs) are increasingly becoming ubiquitous, yet their ability to reason about and retain temporal information remains limited. This hinders their application in real-world scenarios where understanding the sequential nature of events is crucial. This paper experiments with state-of-the-art models on a novel, large-scale temporal dataset, \textbf{TempUN}, to reveal significant limitations in temporal retention and reasoning abilities. Interestingly, closed-source models indicate knowledge gaps more frequently, potentially suggesting a trade-off between uncertainty awareness and incorrect responses. Further, exploring various fine-tuning approaches yielded no major performance improvements. The associated dataset and code are available at the following URL (this https URL). |
|
dc.description.statementofresponsibility |
by Himanshu Beniwal, Kowsik Nandagopan D. and Mayank Singh |
|
dc.language.iso |
en_US |
|
dc.publisher |
Cornell University Library |
|
dc.title |
Remember this event that year? assessing temporal information and reasoning in large language models |
|
dc.type |
Article |
|