Biology 2581B Lecture Notes - Lecture 19: Arms Race, Contig, Ribosomal Rna
Document Summary
Lecture 19: bioinformatics & the future of genetics. 2 main topics in bioinformatics: assembling and searching. Currently, it"s very easy to generate sequence data. Hard to put dna pieces together from a sequencing machine. Look for overlap between different reads, then you assemble them into contigs. Here, the contig is a total of 275 nucleotides long. The chance that one random site is identical to another is 25% (4 bases) The chance that two sites in a row are identical is 0. 25 x 0. 25 = 0. 0625. Chance that 25 sites are identical = 0. 2525. Problem: in this genomes, it"s very common to have repeat elements. Imagine you have an identical repeat in this read, and it"s present 4 times: you could overlap it in many places and still think it"s identical. If you"re lucky, there will be another sequence that covers the entire repeat sequence.