MP2D: Constrained Monte Carlo Tree-Guided Diffusion for Multi-Objective Protein Sequence Design
1 MP2D frames diffusion denoising as a constrained sequential decision-making problem and uses Monte Carlo Tree Search (MCTS) to explore many alternative denoising trajectories, rather than committing to a single diffusion path when objectives conflict (e.g., efficacy vs toxicity).
2 The key multi-objective ingredient is Pareto-guided search: each candidate is evaluated by a vector of property predictors, and MCTS selection/expansion is restricted to Pareto non-dominated children, so the search explicitly targets balanced trade-offs instead of relying on fragile scalar weights.
3 To keep Pareto optimization scalable when optimizing 4–5 objectives, MP2D introduces a dynamic Pareto constraint: candidate updates are filtered by alignment to predefined optimization directions (Das–Dennis simplex lattice) using an angular threshold that is adaptively tuned to maintain a stable rejection/acceptance rate, preventing Pareto-front “bloat” and property collapse.
4 MP2D is training-free at optimization time: it plugs in pretrained property evaluators and a pretrained conditional discrete diffusion model, and can swap objectives/predictors without retraining the generative backbone—positioned as a practical workflow for rapidly changing design specifications.
5 The generative backbone is CMDLM, a classifier-free, label-guided conditional masked diffusion language model for peptides (built on an ESM-style transformer). It is pretrained on 2.6M UniProt peptides (length 2–50) and then LoRA fine-tuned for antimicrobial peptides (AMPs) and protein binders (PBs), aiming to narrow the search space to task-relevant sequence regions.
6 CMDLM is evaluated on plausibility and realism using ESM-2 pseudoperplexity, structural foldability via OmegaFold pLDDT, and distributional similarity via Frechet ProtT5 Distance (FPD). Across peptide, AMP, and PB settings, CMDLM is generally more plausible/foldable and closer to target distributions than ProteinGAN, ProtGPT2, and EvoDiff under the paper’s benchmark.
7 The optimization engine (CMCTD) modifies UCB in MCTS by adding a diffusion-posterior–guided exploration term, encouraging exploration that stays close to valid diffusion transitions while still pursuing multi-property reward improvements.
8 MP2D adds a global iterative refinement loop: starting from W seed sequences, it repeatedly partially remasks sequences at random noise levels and reruns constrained MCTS denoising. This is designed to (a) correct early irreversible token decisions typical of single-pass denoising, and (b) increase diversity of optimization routes under noisy global property predictors.
9 On protein binder optimization (5 objectives: hemolysis, non-fouling, solubility, half-life, affinity) for targets 1B8Q and PPP5, MP2D outperforms classical multi-objective optimizers (NSGA-III, SMS-EMOA, SPEA2, MOPSO) and a recent generative baseline (MOG-DFM), achieving lower hemolysis, higher non-fouling/solubility/half-life, and competitive affinity.
10 On AMP optimization (4 objectives: antimicrobial probability, MIC, hemolysis, toxicity), MP2D outperforms AMP-focused multi-objective generative baselines (Multi-CGAN, MPOGAN, HMAMP, MoFormer), improving potency (higher Pamp, lower MIC) while simultaneously reducing safety risks (lower hemolysis and toxicity), addressing the common failure mode where optimizing one property degrades another.
📜Paper:
arxiv.org/abs/2605.05829
#ProteinDesign #ComputationalBiology #DiffusionModels #MonteCarloTreeSearch #MultiObjectiveOptimization #AntimicrobialPeptides #PeptideDesign #ProteinEngineering #GenerativeAI #ParetoOptimization