Capturing Natural Evolution in Function-guided RNA Design via Genomic Foundation Models
1. This study introduces RILLIE, a zero-shot RNA design framework that combines large language models (LLMs) and inverse folding models (IFMs) to simulate natural evolution and optimize RNA sequences for in vivo function—without any task-specific training.
2. RILLIE integrates AIDO.RNA, a 1.6B-parameter RNA LLM capturing evolutionary plausibility, with RhoDesign, an inverse folding model capturing structural compatibility, forming a “product of experts” model that jointly optimizes for sequence and structure fitness.
3. Unlike traditional SELEX or task-specific ML pipelines, RILLIE operates in a zero-shot setting, rapidly generating RNA variants that maintain natural sequence grammar and structural integrity while enhancing experimental performance.
4. Benchmarking across six diverse DMS datasets (aptamers, tRNAs, ribozymes), RILLIE demonstrates high correlation between model scores and experimental RNA fitness, significantly outperforming both RNA and DNA LLMs alone.
5. Applied to the Broccoli aptamer, RILLIE generated 20 variants in a single round—over half showed increased fluorescence, with B2 achieving a 55% boost in intensity and a 2x improvement in binding affinity, verified via FACS in living HEK cells.
6. For the Pepper aptamer, a two-round directed evolution strategy yielded 40 novel variants, with fluorescence boosts up to 2.6-fold and 3x binding affinity improvement. Over 40% of sequences outperformed wild type, including high-mutation variants with up to 75% sequence divergence.
7. Mutation preference analysis revealed that RILLIE avoids deleterious substitutions (e.g., C5G, U19A) and favors beneficial mutations in variable regions, showing strong alignment with natural selection patterns and high-fitness precision.
8. Importantly, sequences designed with RILLIE retained performance in vivo, demonstrating improved folding and function in HEK cells—a major challenge for aptamers designed solely via in vitro methods like SELEX.
9. RILLIE’s framework can generalize to other RNA classes beyond aptamers. The model was shown to perform well in predicting mutational effects across ribozymes and tRNAs, opening pathways for universal RNA engineering.
10. This work provides the first large-scale evidence that integrating structural and sequence models allows for scalable, evolution-guided, task-agnostic RNA design—enabling a paradigm shift in synthetic biology and RNA therapeutics.
💻Code:
github.com/GENTEL-lab/RILLIE
📜Paper:
biorxiv.org/content/10.1101/…
#RNAEngineering #SyntheticBiology #RNAaptamers #ZeroShotLearning #LanguageModels #InverseFolding #Bioinformatics #AptamerDesign #RNAtherapeutics #LLM #RILLIE #DirectedEvolution