DNA
DNA (deoxyribonucleic acid) is the molecule that carries the genetic instructions for the development, functioning, growth, and reproduction of all known organisms and many viruses. Often called the "blueprint of life," DNA contains the information needed to build and maintain an organism and is passed from parents to offspring.
Structure
The Double Helix
DNA's structure was discovered by James Watson and Francis Crick in 1953, building on Rosalind Franklin's X-ray crystallography work:
- Two polynucleotide strands wound around each other
- Right-handed helix (B-form DNA)
- Diameter: approximately 2 nanometers
- One complete turn: 3.4 nanometers (10 base pairs)
- Strands are antiparallel (run in opposite directions)
Nucleotides
DNA is composed of four nucleotides, each containing:
- Phosphate group: Provides backbone negative charge
- Deoxyribose sugar: Five-carbon sugar
- Nitrogenous base: One of four types
The Four Bases
| Base | Type | Pairs With | Rings |
|---|---|---|---|
| Adenine (A) | Purine | Thymine | 2 |
| Guanine (G) | Purine | Cytosine | 2 |
| Thymine (T) | Pyrimidine | Adenine | 1 |
| Cytosine (C) | Pyrimidine | Guanine | 1 |
Base Pairing
Complementary base pairing (Chargaff's rules):
- A-T: Two hydrogen bonds
- G-C: Three hydrogen bonds
- This pairing ensures accurate replication
- Amount of A always equals T; G always equals C
Major and Minor Grooves
The double helix has two grooves:
- Major groove: Wider, more accessible
- Minor groove: Narrower
- Proteins bind to grooves to read DNA sequence
Organization
In Prokaryotes
Bacterial DNA organization:
- Single circular chromosome
- Located in nucleoid region (no membrane)
- Plasmids: Additional small circular DNA
- Supercoiled to fit in cell
In Eukaryotes
Complex packaging hierarchy:
- Nucleosomes: DNA wrapped around histone proteins
- Chromatin fiber: String of nucleosomes
- Looped domains: 30 nm fiber loops
- Chromosomes: Condensed during cell division
Chromosomes
Human genome organization:
- 23 pairs of chromosomes (46 total)
- ~3 billion base pairs
- ~20,000 protein-coding genes
- 98% non-coding DNA
Telomeres and Centromeres
Telomeres:
- Protective caps at chromosome ends
- TTAGGG repeats in humans
- Shorten with each cell division
- Associated with aging
Centromeres:
- Attachment point for spindle fibers
- Essential for chromosome separation
- Highly repetitive sequences
DNA Replication
Overview
DNA replication is semi-conservative:
- Each strand serves as template
- Produces two identical DNA molecules
- Each new molecule has one old and one new strand
The Replication Process
Initiation
- Origins of replication identified
- Helicase unwinds the double helix
- Single-strand binding proteins stabilize
- Topoisomerase relieves tension
Elongation
Key enzymes:
| Enzyme | Function |
|---|---|
| Primase | Synthesizes RNA primer |
| DNA Polymerase III | Main replication enzyme |
| DNA Polymerase I | Removes primers, fills gaps |
| Ligase | Joins Okazaki fragments |
Leading strand vs. lagging strand:
- Leading strand: Continuous synthesis (5' to 3')
- Lagging strand: Discontinuous (Okazaki fragments)
- Both strands synthesized simultaneously
Termination
- Replication forks meet
- RNA primers removed
- Gaps filled and ligated
- Proofreading and repair
Accuracy
DNA replication is highly accurate:
- Error rate: ~1 per 10⁹-10¹⁰ base pairs
- 3' to 5' exonuclease proofreading
- Mismatch repair systems
- Essential for genetic stability
Gene Expression
The Central Dogma
Information flow in biology:
DNA → RNA → Protein
Transcription
DNA to RNA:
- Initiation: RNA polymerase binds promoter
- Elongation: RNA synthesized 5' to 3'
- Termination: Polymerase releases
In eukaryotes, additional processing:
- 5' cap added
- Introns spliced out
- 3' poly-A tail added
Translation
RNA to protein:
- mRNA carries genetic code
- tRNA brings amino acids
- Ribosomes catalyze protein synthesis
- Genetic code: 64 codons for 20 amino acids
Regulation
Gene expression control:
- Transcription factors: Activate/repress genes
- Enhancers/silencers: Regulatory DNA sequences
- Epigenetics: DNA methylation, histone modifications
- Post-transcriptional: RNA processing, stability
Mutations and Repair
Types of Mutations
Point Mutations
| Type | Effect | Example |
|---|---|---|
| Silent | No amino acid change | AAA→AAG (both = Lys) |
| Missense | Different amino acid | GAG→GTG (Glu→Val) |
| Nonsense | Premature stop codon | TAT→TAA (Tyr→Stop) |
Larger Mutations
- Insertions: Addition of nucleotides
- Deletions: Removal of nucleotides
- Frameshift: Insertions/deletions not divisible by 3
- Duplications: Segment repeated
- Inversions: Segment reversed
- Translocations: Movement between chromosomes
Causes of Mutations
- Spontaneous: Replication errors, tautomeric shifts
- Induced: Radiation, chemicals, viruses
- Environmental: UV light, carcinogens
DNA Repair Mechanisms
| Mechanism | Damage Type | Process |
|---|---|---|
| Direct repair | UV damage | Photolyase reverses |
| Base excision repair | Single base damage | Remove and replace |
| Nucleotide excision repair | Bulky lesions | Remove segment |
| Mismatch repair | Replication errors | Correct mismatches |
| Double-strand break repair | Breaks | NHEJ or homologous recombination |
Genetic Variation
Sources of Variation
- Mutation: Ultimate source of new alleles
- Recombination: New combinations during meiosis
- Sexual reproduction: Combines parental genomes
Single Nucleotide Polymorphisms (SNPs)
- Single base differences between individuals
- ~4-5 million SNPs per person
- Used in genetic studies
- Some associated with disease risk
Structural Variation
- Copy number variants (CNVs)
- Insertions and deletions
- Inversions
- Translocations
DNA Technology
Polymerase Chain Reaction (PCR)
Amplifies specific DNA sequences:
- Denaturation: Heat separates strands (95°C)
- Annealing: Primers bind (50-65°C)
- Extension: Polymerase synthesizes (72°C)
- Repeat 25-35 cycles
Applications:
- Forensics
- Diagnosis
- Cloning
- Research
DNA Sequencing
Sanger Sequencing
- Chain termination method
- Uses dideoxynucleotides
- Gold standard for accuracy
- Limited throughput
Next-Generation Sequencing
| Platform | Read Length | Output | Application |
|---|---|---|---|
| Illumina | 150-300 bp | High | General purpose |
| PacBio | 10-25 kb | Medium | Long reads |
| Oxford Nanopore | Unlimited | Variable | Portable, real-time |
CRISPR-Cas9
Revolutionary gene editing:
- Cas9 protein cuts DNA at specific location
- Guide RNA directs to target
- Enables precise editing
- Applications: Disease treatment, research, agriculture
DNA Fingerprinting
Individual identification:
- Analyzes variable regions (STRs)
- Unique pattern for each person
- Used in forensics, paternity testing
- Probability of random match: ~1 in billions
The Human Genome
Human Genome Project (1990-2003)
Landmark achievement:
- Sequenced entire human genome
- International collaboration
- Cost: ~$3 billion
- Revealed ~20,000 genes
Genome Composition
| Component | Percentage | Description |
|---|---|---|
| Protein-coding | ~1.5% | Genes |
| Regulatory | ~8% | Controls gene expression |
| Introns | ~25% | Non-coding gene portions |
| Repetitive | ~45% | Transposons, satellites |
| Other non-coding | ~20% | Various functions |
Personal Genomics
Modern DNA analysis:
- Direct-to-consumer testing
- Ancestry information
- Health risk assessment
- Pharmacogenomics
- Ethical considerations
Evolution and DNA
Molecular Evolution
DNA provides evolutionary evidence:
- Sequence comparison reveals relationships
- Molecular clock estimates divergence times
- Common genetic code supports common ancestry
Comparative Genomics
| Organism | Genes Shared with Humans |
|---|---|
| Chimpanzee | ~99% |
| Mouse | ~85% |
| Fruit fly | ~60% |
| Yeast | ~30% |
| Bacteria | Core metabolic genes |
See Also
References
- Watson, J.D. (1968). The Double Helix. Atheneum.
- Alberts, B., et al. (2015). Molecular Biology of the Cell (6th ed.). Garland Science.
- Mukherjee, S. (2016). The Gene: An Intimate History. Scribner.