The Align Foundation

The Align Foundation

The Align Foundation

@Align_Bio

Mar 31

📢 Data Release Tuesday: Align TEV Protease Dataset 📊 We’re expanding the The Align Foundation data ecosystem again with ~30,000 high-quality GROQ-seq data points capturing TEV protease sequence–function relationships at scale. To our knowledge, this is the largest mutational dataset on TEV protease to date. Notably, no comprehensive deep mutational scanning study across the full protein has been reported in over three decades, leaving key aspects of its functional landscape unexplored. TEV protease is a cornerstone tool in biotechnology, known for its high substrate specificity, and this dataset provides a rich resource for enzyme engineering and ML-driven protein design. This release was made possible by a strong cross-team effort, with key contributions from Erika Alden DeBenedictis, @Anjali Chadha, @Dave Ross’s team at National Institute of Standards and Technology (NIST) and the DAMP Lab at Boston University 🔗 Access the dataset: hubs.la/Q048Z3890 #OpenScience #SyntheticBiology #ProteinEngineering #BioAI #MachineLearning #AlignData #GROQSEQ #Protease #TEV

1,402

The Align Foundation

The Align Foundation

@Align_Bio

Mar 24

📢 Public Data Release: Align T7 RNA Polymerase Dataset. 📊The data keeps coming at @Align_Bio! We’re excited to release our T7 RNA polymerase dataset, adding ~35,000 unique GROQ-seq data points to the growing Align data ecosystem, capturing sequence–function relationships across variants at scale. To our knowledge, this is the largest mutational dataset on T7 RNA polymerase to date! 🔗 Access the dataset on the Align Data Portal: hubs.la/Q04802KK0 #OpenScience #SyntheticBiology #ProteinEngineering #BioAI #MachineLearning #AlignData #GROQSEQ #RNApolymerase

1,299