SE(3) STOCHASTIC FLOW MATCHING FOR PROTEIN BACKBONE GENERATION

Avishek Bose*, Tara Akhound-Sadegh*, Guillaume Huguet, Kilian Fatras, Jarrid Rector-Brooks, Cheng Hao Liu, Andrei Cristian Nica, Maksym Korablyov, Michael Bronstein, Alexander Tong*

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

1 Scopus citations

Abstract

The computational design of novel protein structures has the potential to impact numerous scientific disciplines greatly. Toward this goal, we introduce FOLDFLOW a series of novel generative models of increasing modeling power based on the flow-matching paradigm over 3D rigid motions-i.e. the group SE(3)-enabling accurate modeling of protein backbones. We first introduce FOLDFLOW-BASE a simulation-free approach to learning deterministic continuous-time dynamics and matching invariant target distributions on SE(3). We next accelerate training by incorporating Riemannian optimal transport to create FOLDFLOW-OT leading to the construction of both more simple and stable flows. Finally, we design FOLDFLOW-SFM coupling both Riemannian OT and simulation-free training to learn stochastic continuous-time dynamics over SE(3). Our family of FOLDFLOW generative models offers several key advantages over previous approaches to the generative modeling of proteins: they are more stable and faster to train than diffusion-based approaches, and our models enjoy the ability to map any invariant source distribution to any invariant target distribution over SE(3). Empirically, we validate FOLDFLOW on protein backbone generation of up to 300 amino acids leading to high-quality designable, diverse, and novel samples.

Original languageEnglish
StatePublished - 2024
Externally publishedYes
Event12th International Conference on Learning Representations, ICLR 2024 - Hybrid, Vienna, Austria
Duration: 7 May 202411 May 2024

Conference

Conference12th International Conference on Learning Representations, ICLR 2024
Country/TerritoryAustria
CityHybrid, Vienna
Period7/05/2411/05/24

Funding

FundersFunder number
Anyscale and Google GCP
Samsung
CIFAR AI Chairs
Natural Sciences and Engineering Research Council of CanadaRGPIN-2019-06512

    Fingerprint

    Dive into the research topics of 'SE(3) STOCHASTIC FLOW MATCHING FOR PROTEIN BACKBONE GENERATION'. Together they form a unique fingerprint.

    Cite this