Home  | Publications | FSK+26

RIBEX: Predicting and Explaining RNA Binding Across Structured and Intrinsically Disordered Regions (IDR)-Rich Proteins

MCML Authors

Abstract

Motivation: RNA-binding proteins (RBPs) regulate post-transcriptional processes, yet many remain undiscovered because RNA-binding activity often occurs outside canonical RNA-binding domains (RBDs), including within intrinsically disordered regions (IDRs) or through protein complexes. Computational methods can help identify novel RBPs, but approaches relying solely on sequence-derived features or ignoring the cellular interaction context are limited in capturing the complexity of RNA-binding behavior. To date, no framework rigorously integrates both sequence information and protein interaction context for RBP prediction.<br>Results: We introduce RIBEX, a multimodal framework that combines protein language model (pLM) embeddings with protein interactome topology to improve RBP prediction and interpretation. Specifically, we integrate sequence representations with graph-derived positional encodings (PE) from the human STRING protein–protein interaction (PPI) network. PE are computed using Personalized PageRank, reduced with principal component analysis, and fused with pooled sequence embeddings through FiLM conditioning, while Low-Rank Adaptation (LoRA) enables parameter-efficient task adaptation. Across both an annotation-based benchmark and experimental RNA Interactome Capture (RIC) dataset, PE consistently improves predictive performance, indicating that interactome topology provides complementary information beyond sequence features. LoRA adaptation of ESM2-650M further yields larger gains than simply scaling frozen backbone size. RIBEX outperforms state-of-the-art methods such as RBP-TSTL and HydRA, particularly on challenging subsets including proteins lacking canonical RBDs and those enriched in IDRs. For interpretability, we combine sequence-level computational alanine scanning with network-level positional-encoding ablation and inverse-PCA mapping, recovering known RNA-binding domains, IDR-associated contributions, and functional interactome communities linked to RBP predictions.

misc FSK+26


Preprint

Mar. 2026

Authors

S. Firmani • F. SteinbauerG. Kasneci • A. Marsico • M. Horlacher

Links

DOI

Research Area

 A1 | Statistical Foundations & Explainability

BibTeXKey: FSK+26

Back to Top