Sequence Determination from Overlapping Fragments : A Simple Model of Whole-Genome Shotgun Sequencing
Résumé
Assembling fragments randomly sampled from along a sequence is the basis of whole-genome shotgun sequencing, a technique used to map the DNA of the human and other genomes. We calculate the probability that a random sequence can be recovered from a collection of overlapping fragments. We provide an exact solution for an infinite alphabet and in the case of constant overlaps. For the general problem we apply two assembly strategies and give the probability that the assembly puzzle can be solved in the limit of infinitely many fragments.
Domaines
Physique [physics]
Fichier principal
Sequence Determination from Overlapping Fragments A Simple Model of Whole-Genome Shotgun Sequencing.pdf (94.67 Ko)
Télécharger le fichier
Origine : Accord explicite pour ce dépôt