ppIRIS: deep learning for proteome-wide prediction of bacterial protein-protein interactions
1. The article introduces ppIRIS, a lightweight deep learning model designed to predict bacterial protein-protein interactions (PPIs) directly from sequence. This model integrates evolutionary and structural embeddings, achieving state-of-the-art accuracy while enabling rapid proteome-wide screening.
2. Traditional experimental approaches to mapping bacterial interactomes are incomplete due to technical limitations. Computational methods often lack generalizability or are too resource-intensive. ppIRIS addresses these challenges by combining ESM-C and ProstT53Di embeddings within a Siamese network architecture, offering a balance between precision and recall.
3. When applied to Group A Streptococcus (GAS), ppIRIS revealed functional clusters linked to virulence pathways, including nutrient transport, stress response, and metal scavenging. This demonstrates the model’s potential for uncovering key interactions in bacterial pathogens.
4. For host-pathogen predictions, ppIRIS recovered 56.2% of known GAS-human plasma interactions, with enrichment in complement, coagulation, and protease inhibition pathways. Experimental validation confirmed several novel predictions, highlighting the model’s applicability for systematic discovery of cross-species PPIs.
5. The model’s architecture includes a dual-embedding fusion approach and a feature interaction module that captures both individual protein characteristics and relationship patterns between proteins. This design emphasizes practical deployability, allowing for rapid scans on widely available GPUs.
6. The authors also explored the interaction landscape of GAS M1 virulence factors with human proteins, revealing coherent functional modules targeted by multiple virulence factors. This analysis provides valuable insights into the host-pathogen interface and potential targets for therapeutic intervention.
7. The ppIRIS codebase, including dataset preprocessing, embedding extraction, training, and inference pipelines, is available on GitHub, making it accessible for further research and development in the field of bacterial PPI prediction.
📜Paper:
biorxiv.org/content/10.1101/…
#ProteinInteraction #DeepLearning #BacterialPathogens #HostPathogenInteractions #Bioinformatics #Proteomics