Unlocking model insights: a dataset for automated model card generation

dc.contributor.author	Singh, Shruti
dc.contributor.author	Lodwal, Hitesh
dc.contributor.author	Malwat, Husain
dc.contributor.author	Thakur, Rakesh
dc.contributor.author	Singh, Mayank
dc.coverage.spatial	United States of America
dc.date.accessioned	2023-10-07T13:21:08Z
dc.date.available	2023-10-07T13:21:08Z
dc.date.issued	2023-09
dc.identifier.citation	Singh, Shruti; Lodwal, Hitesh; Malwat, Husain; Thakur, Rakesh and Singh, Mayank, "Unlocking model insights: a dataset for automated model card generation", arXiv, Cornell University Library, DOI: arXiv:2309.12616, Sep. 2023.
dc.identifier.issn	2331-8422
dc.identifier.uri	https://doi.org/10.48550/arXiv.2309.12616
dc.identifier.uri	https://repository.iitgn.ac.in/handle/123456789/9344
dc.description.abstract	Language models (LMs) are no longer restricted to ML community, and instruction-tuned LMs have led to a rise in autonomous AI agents. As the accessibility of LMs grows, it is imperative that an understanding of their capabilities, intended usage, and development cycle also improves. Model cards are a popular practice for documenting detailed information about an ML model. To automate model card generation, we introduce a dataset of 500 question-answer pairs for 25 ML models that cover crucial aspects of the model, such as its training configurations, datasets, biases, architecture details, and training resources. We employ annotators to extract the answers from the original paper. Further, we explore the capabilities of LMs in generating model cards by answering questions. Our initial experiments with ChatGPT-3.5, LLaMa, and Galactica showcase a significant gap in the understanding of research papers by these aforementioned LMs as well as generating factual textual responses. We posit that our dataset can be used to train models to automate the generation of model cards from paper text and reduce human effort in the model card curation process.
dc.description.statementofresponsibility	by Shruti Singh, Hitesh Lodwal, Husain Malwat, Rakesh Thakur and Mayank Singh
dc.language.iso	en_US
dc.publisher	Cornell University Library
dc.title	Unlocking model insights: a dataset for automated model card generation
dc.type	Article
dc.relation.journal	arXiv

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

E-print Articles [183]

Show simple item record

Search Digital Repository

Browse

All of DSpace
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Degree
- By Department

Unlocking model insights: a dataset for automated model card generation

Files in this item

This item appears in the following Collection(s)

Search Digital Repository

Browse

All of DSpace

This Collection

My Account