How robust are the tabular QA models for scientific tables? a study using customized dataset

dc.contributor.author	Ghosh, Akash
dc.contributor.author	Sahith, B. Venkata
dc.contributor.author	Ganguly, Niloy
dc.contributor.author	Goyal, Pawan
dc.contributor.author	Singh, Mayank
dc.contributor.other	International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
dc.coverage.spatial	Italy
dc.date.accessioned	2024-05-21T14:45:03Z
dc.date.available	2024-05-21T14:45:03Z
dc.date.issued	2024-05-20
dc.identifier.citation	Ghosh, Akash; Sahith, B. Venkata; Ganguly, Niloy; Goyal, Pawan and Singh, Mayank, "How robust are the tabular QA models for scientific tables? a study using customized dataset", in the International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin, IT, May 20-25, 2024.
dc.identifier.uri	https://aclanthology.org/2024.lrec-main.724.pdf
dc.identifier.uri	https://repository.iitgn.ac.in/handle/123456789/10067
dc.description.abstract	Question-answering (QA) on hybrid scientific tabular and textual data deals with scientific information, and relies on complex numerical reasoning. In recent years, while tabular QA has seen rapid progress, understanding their robustness on scientific information is lacking due to absence of any benchmark dataset. To investigate the robustness of the existing state-of-the-art QA models on scientific hybrid tabular data, we propose a new dataset, “SciTabQA”, consisting of 822 question-answer pairs from scientific tables and their descriptions. With the help of this dataset, we assess the state-of-the-art Tabular QA models based on their ability (i) to use heterogeneous information requiring both structured data (table) and unstructured data (text) and (ii) to perform complex scientific reasoning tasks. In essence, we check the capability of the models to interpret scientific tables and text. Our experiments show that “SciTabQA” is an innovative dataset to study question-answering over scientific heterogeneous data. We benchmark three state-of-the-art Tabular QA models, and find that the best F1 score is only 0.462.
dc.description.statementofresponsibility	by Akash Ghosh, B. Venkata Sahith, Niloy Ganguly, Pawan Goyal and Mayank Singh
dc.language.iso	en_US
dc.title	How robust are the tabular QA models for scientific tables? a study using customized dataset
dc.type	Conference Paper

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Conference Papers [286]

Show simple item record

Search Digital Repository

Browse

All of DSpace
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Degree
- By Department

How robust are the tabular QA models for scientific tables? a study using customized dataset

Files in this item

This item appears in the following Collection(s)

Search Digital Repository

Browse

All of DSpace

This Collection

My Account