Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs

dc.contributor.author	Beniwal, Himanshu
dc.contributor.author	Panda, Sailesh
dc.contributor.author	Singh, Mayank
dc.coverage.spatial	United States of America
dc.date.accessioned	2025-03-06T09:37:55Z
dc.date.available	2025-03-06T09:37:55Z
dc.date.issued	2025-02
dc.identifier.citation	Beniwal, Himanshu; Panda, Sailesh and Singh, Mayank, "Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs", arXiv, Cornell University Library, DOI: arXiv:2502.16901, Feb. 2025.
dc.identifier.uri	http://arxiv.org/abs/2502.16901
dc.identifier.uri	https://repository.iitgn.ac.in/handle/123456789/11087
dc.description.abstract	We explore Cross-lingual Backdoor ATtacks (X-BAT) in multilingual Large Language Models (mLLMs), revealing how backdoors inserted in one language can automatically transfer to others through shared embedding spaces. Using toxicity classification as a case study, we demonstrate that attackers can compromise multilingual systems by poisoning data in a single language, with rare tokens serving as specific effective triggers. Our findings expose a critical vulnerability in the fundamental architecture that enables cross-lingual transfer in these models. Our code and data are publicly available at this https URL.
dc.description.statementofresponsibility	by Himanshu Beniwal, Sailesh Panda and Mayank Singh
dc.language.iso	en_US
dc.publisher	Cornell University Library
dc.title	Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs
dc.type	Article
dc.relation.journal	arXiv

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

E-print Articles [183]

Show simple item record

Search Digital Repository

Browse

All of DSpace
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Degree
- By Department

Char-mander use mbackdoor! a study of cross-lingual backdoor attacks in multilingual LLMs

Files in this item

This item appears in the following Collection(s)

Search Digital Repository

Browse

All of DSpace

This Collection

My Account