How robust are the tabular QA models for scientific tables? a study using customized dataset

Show simple item record

dc.contributor.author Ghosh, Akash
dc.contributor.author Sahith, B. Venkata
dc.contributor.author Ganguly, Niloy
dc.contributor.author Goyal, Pawan
dc.contributor.author Singh, Mayank
dc.contributor.other International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
dc.coverage.spatial Italy
dc.date.accessioned 2024-05-21T14:45:03Z
dc.date.available 2024-05-21T14:45:03Z
dc.date.issued 2024-05-20
dc.identifier.citation Ghosh, Akash; Sahith, B. Venkata; Ganguly, Niloy; Goyal, Pawan and Singh, Mayank, "How robust are the tabular QA models for scientific tables? a study using customized dataset", in the International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin, IT, May 20-25, 2024.
dc.identifier.uri https://aclanthology.org/2024.lrec-main.724.pdf
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/10067
dc.description.abstract Question-answering (QA) on hybrid scientific tabular and textual data deals with scientific information, and relies on complex numerical reasoning. In recent years, while tabular QA has seen rapid progress, understanding their robustness on scientific information is lacking due to absence of any benchmark dataset. To investigate the robustness of the existing state-of-the-art QA models on scientific hybrid tabular data, we propose a new dataset, “SciTabQA”, consisting of 822 question-answer pairs from scientific tables and their descriptions. With the help of this dataset, we assess the state-of-the-art Tabular QA models based on their ability (i) to use heterogeneous information requiring both structured data (table) and unstructured data (text) and (ii) to perform complex scientific reasoning tasks. In essence, we check the capability of the models to interpret scientific tables and text. Our experiments show that “SciTabQA” is an innovative dataset to study question-answering over scientific heterogeneous data. We benchmark three state-of-the-art Tabular QA models, and find that the best F1 score is only 0.462.
dc.description.statementofresponsibility by Akash Ghosh, B. Venkata Sahith, Niloy Ganguly, Pawan Goyal and Mayank Singh
dc.language.iso en_US
dc.title How robust are the tabular QA models for scientific tables? a study using customized dataset
dc.type Conference Paper


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account