Mutational paths with sequence-based models of proteins: from sampling to mean-field characterisation - ENS - École normale supérieure Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2022

Mutational paths with sequence-based models of proteins: from sampling to mean-field characterisation

Eugenio Mauri
  • Fonction : Auteur
  • PersonId : 1074770
Simona Cocco

Résumé

Identifying and characterizing mutational paths is an important issue in evolutionary biology and in bioengineering. We here introduce a generic description of mutational paths in terms of the goodness of sequences and of the mutational dynamics (how sequences change) along the path. We first propose an algorithm to sample mutational paths, which we benchmark on exactly solvable models of proteins in silico, and apply to data-driven models of natural proteins learned from sequence data with Restricted Boltzmann Machines. We then use mean-field theory to characterize the properties of mutational paths for different mutational dynamics of interest, and show how it can be used to extend Kimura's estimate of evolutionary distances to sequence-based epistatic models of selection.
Fichier principal
Vignette du fichier
main.pdf (3.21 Mo) Télécharger le fichier
AppendixA.pdf (1.26 Mo) Télécharger le fichier
AppendixB.pdf (2.43 Mo) Télécharger le fichier
support.pdf (1.2 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03645394 , version 1 (21-04-2022)
hal-03645394 , version 2 (21-10-2022)
hal-03645394 , version 3 (27-10-2022)
hal-03645394 , version 4 (06-02-2023)

Identifiants

Citer

Eugenio Mauri, Simona Cocco, Rémi Monasson. Mutational paths with sequence-based models of proteins: from sampling to mean-field characterisation. 2022. ⟨hal-03645394v2⟩
111 Consultations
130 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More