dc.contributor.author |
Beniwal, Himanshu |
|
dc.contributor.author |
Panda, Sailesh |
|
dc.contributor.author |
Singh, Mayank |
|
dc.coverage.spatial |
United States of America |
|
dc.date.accessioned |
2025-03-06T09:37:55Z |
|
dc.date.available |
2025-03-06T09:37:55Z |
|
dc.date.issued |
2025-02 |
|
dc.identifier.citation |
Beniwal, Himanshu; Panda, Sailesh and Singh, Mayank, "Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs", arXiv, Cornell University Library, DOI: arXiv:2502.16901, Feb. 2025. |
|
dc.identifier.uri |
http://arxiv.org/abs/2502.16901 |
|
dc.identifier.uri |
https://repository.iitgn.ac.in/handle/123456789/11087 |
|
dc.description.abstract |
We explore Cross-lingual Backdoor ATtacks (X-BAT) in multilingual Large Language Models (mLLMs), revealing how backdoors inserted in one language can automatically transfer to others through shared embedding spaces. Using toxicity classification as a case study, we demonstrate that attackers can compromise multilingual systems by poisoning data in a single language, with rare tokens serving as specific effective triggers. Our findings expose a critical vulnerability in the fundamental architecture that enables cross-lingual transfer in these models. Our code and data are publicly available at this https URL. |
|
dc.description.statementofresponsibility |
by Himanshu Beniwal, Sailesh Panda and Mayank Singh |
|
dc.language.iso |
en_US |
|
dc.publisher |
Cornell University Library |
|
dc.title |
Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs |
|
dc.type |
Article |
|
dc.relation.journal |
arXiv |
|