Unlocking model insights: a dataset for automated model card generation

Show simple item record

dc.contributor.author Singh, Shruti
dc.contributor.author Lodwal, Hitesh
dc.contributor.author Malwat, Husain
dc.contributor.author Thakur, Rakesh
dc.contributor.author Singh, Mayank
dc.coverage.spatial United States of America
dc.date.accessioned 2023-10-07T13:21:08Z
dc.date.available 2023-10-07T13:21:08Z
dc.date.issued 2023-09
dc.identifier.citation Singh, Shruti; Lodwal, Hitesh; Malwat, Husain; Thakur, Rakesh and Singh, Mayank, "Unlocking model insights: a dataset for automated model card generation", arXiv, Cornell University Library, DOI: arXiv:2309.12616, Sep. 2023.
dc.identifier.issn 2331-8422
dc.identifier.uri https://doi.org/10.48550/arXiv.2309.12616
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/9344
dc.description.abstract Language models (LMs) are no longer restricted to ML community, and instruction-tuned LMs have led to a rise in autonomous AI agents. As the accessibility of LMs grows, it is imperative that an understanding of their capabilities, intended usage, and development cycle also improves. Model cards are a popular practice for documenting detailed information about an ML model. To automate model card generation, we introduce a dataset of 500 question-answer pairs for 25 ML models that cover crucial aspects of the model, such as its training configurations, datasets, biases, architecture details, and training resources. We employ annotators to extract the answers from the original paper. Further, we explore the capabilities of LMs in generating model cards by answering questions. Our initial experiments with ChatGPT-3.5, LLaMa, and Galactica showcase a significant gap in the understanding of research papers by these aforementioned LMs as well as generating factual textual responses. We posit that our dataset can be used to train models to automate the generation of model cards from paper text and reduce human effort in the model card curation process.
dc.description.statementofresponsibility by Shruti Singh, Hitesh Lodwal, Husain Malwat, Rakesh Thakur and Mayank Singh
dc.language.iso en_US
dc.publisher Cornell University Library
dc.title Unlocking model insights: a dataset for automated model card generation
dc.type Article
dc.relation.journal arXiv


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account