AlphaFold 3 accurately models natural variants of Helicobacter pylori catalase KatA
1.This study rigorously evaluates AlphaFold 3's ability to model natural protein variants using a high-resolution crystal structure of the H. pylori catalase KatA from strain SS1. These variants differ from the canonical 26695 strain at key residues: 234, 237, 255, and 421.
2.The most striking result: AlphaFold 3 accurately predicted both the global architecture and fine structural features of KatASS1, including conservative substitutions and solvent-exposed or interface residues—without access to the variant structure during training.
3.Variant residues modeled by AlphaFold 3 closely matched those resolved in the crystal structure (1.87 Å resolution). For Val234 and Phe255, the predictions reached experimental quality. For the more challenging Glu421 and Tyr237, performance varied by input.
4.Incorrect oligomeric state input (e.g., monomer instead of tetramer) reduced accuracy, particularly at interface-exposed residues like Glu421, which lost native hydrogen bonding seen in the crystal structure. Models with incorrect input can show misleadingly high pLDDT scores.
5.Minor input perturbations like single-residue substitutions or terminal Trp insertions had negligible impact on structure prediction, reinforcing the model's robustness—so long as the correct oligomeric state is used.
6.Among natural KatA variants from 1,931 H. pylori genomes, the four studied positions are common divergence points. Their accurate modeling suggests AlphaFold 3 may be broadly useful in evaluating natural variation in pathogen proteins with experimental backbones.
7.The authors highlight that AlphaFold 3’s accessibility is a double-edged sword—non-expert users may unintentionally degrade prediction quality by providing incorrect inputs. Even confident predictions (e.g., high pLDDT) can be structurally misleading.
8.This case study underscores AlphaFold 3's strong potential for modeling natural variants, especially when paired with known oligomeric context and conservative substitutions—but also highlights the need for caution and biological context in interpretation.
9.The experimental crystal structure of KatASS1, solved here for the first time, provides a new reference for benchmarking modeling of natural variants and has been deposited under PDB ID 9nh3.
📜Paper:
biorxiv.org/content/10.1101/…
#AlphaFold3 #ProteinStructure #StructuralBiology #HelicobacterPylori #ComputationalBiology #ProteinVariants #PathogenEvolution