Mutational paths in protein-sequence landscapes: from sampling to low-dimensional characterization - ENS - École normale supérieure Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2022

Mutational paths in protein-sequence landscapes: from sampling to low-dimensional characterization

Résumé

Understanding how protein functionalities vary along mutational paths is an important issue in evolutionary biology and in bioengineering. We here propose an algorithm to sample mutational paths in the sequence space, realizing a trade-off between protein optimality and path stiffness. The algorithm is benchmarked on exactly solvable models of proteins in silico, and applied to data-driven models of natural proteins learned from sequence data. Using mean-field theory, we monitor the projections of the sequence on relevant modes along the path, allowing for an interpretation of the protein sequence trajectory. Qualitative changes observed in paths as their lengths are varied can be explained by the existence of a phase transition in infinitely-long strings of strongly coupled Hopfield models.
Fichier principal
Vignette du fichier
main.pdf (2.04 Mo) Télécharger le fichier
support.pdf (1.61 Mo) Télécharger le fichier

Dates et versions

hal-03645394 , version 1 (21-04-2022)
hal-03645394 , version 2 (21-10-2022)
hal-03645394 , version 3 (27-10-2022)
hal-03645394 , version 4 (06-02-2023)

Identifiants

Citer

Eugenio Mauri, Simona Cocco, Rémi Monasson. Mutational paths in protein-sequence landscapes: from sampling to low-dimensional characterization. 2022. ⟨hal-03645394v1⟩
111 Consultations
130 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More