Documente Academic
Documente Profesional
Documente Cultură
Computational Genomics
January 27, 2004
Genomes
Genome sizes
Triturus cristatus ≈ 18 GB (crested newt)
Corn ≈ 5 GB
Human ≈ 3.2 GB
Frog (X. laevis) ≈ 3 GB
Fruit fly (D. melanogaster) ≈ 137 MB
Nematode (C. elegans) = 80 MB
Yeast (S. cerevisiae) = 13 MB
Bacteria (M. tuberculosis) = 4.4 MB
HIV = 9181bp (RNA)
Human Genome
Human genome
¾ 3.2 GB (3200 MB)
¾ 46 chromosome (23 N)
¾ 22 autosome pairs + sex chromosomes (XX or XY)
¾ 25,000 – 100,000 genes
¾≈ 1014 cells
Genome Packaging
Many levels of packaging
Autosomes vs “Sex”
D. McDonald Seattle Laboratory of Pathology
chromosomes
Chromosome Anatomy
Structures
Centromeres
Telomeres
Euchromatin
Gene-dense
Loosely packed
Histones are acetylated
Heterochromatin
Gene-poor
Tightly packed
Histones are methylated
Chromosomes have genes
The density of genes on a
chromosome can be
quite variable
Chromosome 19:
23 genes per million bp
Chromosome 13:
5 genes per million bp
An average human gene
size(spliced) is around
3000bp
"non-functional" regions
Only 1-3% of the human genome may actually code
for a protein
DNA/RNA
4 letters
Amino acids
20 letters
Modes:
1) recruits or enhances polymerase
2) makes DNA more available
Context
Transcription -The conversion of DNA information
to RNA information.
Happens before splicing
Mechanism for transcription is RNA polymerase.
Followed by translation - The conversion of RNA
information to amino acid sequence
RNA polymerase
>10 subunits in the “holoenzyme”
Several types: I-III
1.00
0.57
0.56
0.18
0.88
Promoters vs Enhancers
Characteristics of each
Promotors <200 bp from start
Enhancers < 'several kb'
May end up closer due to bending
Typically more global regulatory elements, tissue
or time specifc.
Modes of operation:
Could change density of supercoiling.
Provid an entry site for the polymerase.
Anchor the DNA at a place within the cell for
access.
Mutation
Common themes in mutation
point mutation
recombination
transposition
deletion
Severity of mutation
No effect
point mutants in particular can be silent (no change to
AA)
Minimal effect
not critical to protein function
Major effect
Insertions which inactivate a protein
Frame-shifts
Deletions
Severity of mutation
mRNA
EST
Protein sequences
Finding your way on molecules
5’ end is the start of DNA
3’ end is the end of DNA
OH OH OH OH
3’ EST
5’ EST
END
(goto UCSC web-site intro)