Abstract:
Dysregulated expression of long non-coding RNAs (lncRNAs) in cancer contributes to various hallmarks of the disease, presenting novel opportunities for diagnosis and therapy. G-quadruplexes (G4s) within lncRNAs have gained attention recently; however, their systematic evaluation in cancer biology is yet to be performed. In this work, we have formulated a comprehensive dataset integrating experimentally-validated associations between lncRNAs and cancer, and detailed predictions of their G4-forming potential. The dataset categorizes predicted G4-motifs into anticipated G4 types (2 G, 3 G, and 4 G) and provides information about the subcellular localization of the corresponding lncRNAs. It describes lncRNA-RNA and lncRNA-protein interactions, together with the RNA G4-binding capabilities of these proteins. The dataset facilitates the investigation of G4-mediated lncRNA functions in diverse human cancers and provides distinctive leads about G4-mediated lncRNA-protein interactions.