Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs

Show simple item record

dc.contributor.author Beniwal, Himanshu
dc.contributor.author Panda, Sailesh
dc.contributor.author Singh, Mayank
dc.coverage.spatial United States of America
dc.date.accessioned 2025-03-06T09:37:55Z
dc.date.available 2025-03-06T09:37:55Z
dc.date.issued 2025-02
dc.identifier.citation Beniwal, Himanshu; Panda, Sailesh and Singh, Mayank, "Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs", arXiv, Cornell University Library, DOI: arXiv:2502.16901, Feb. 2025.
dc.identifier.uri http://arxiv.org/abs/2502.16901
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/11087
dc.description.abstract We explore Cross-lingual Backdoor ATtacks (X-BAT) in multilingual Large Language Models (mLLMs), revealing how backdoors inserted in one language can automatically transfer to others through shared embedding spaces. Using toxicity classification as a case study, we demonstrate that attackers can compromise multilingual systems by poisoning data in a single language, with rare tokens serving as specific effective triggers. Our findings expose a critical vulnerability in the fundamental architecture that enables cross-lingual transfer in these models. Our code and data are publicly available at this https URL.
dc.description.statementofresponsibility by Himanshu Beniwal, Sailesh Panda and Mayank Singh
dc.language.iso en_US
dc.publisher Cornell University Library
dc.title Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs
dc.type Article
dc.relation.journal arXiv


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account