From RNA to protein

Overview of transcription and translation in prokaryotes vs. eukaryotes

Tracking a trait from gene to phenotype: The BAD2 gene determines fragrance in rice

 

RNA

A. Ribonucleic acid (RNA) is a key nucleic acid in transcription and translation. RNA is like DNA except that:

  1. it is usually single rather than double stranded
  2. it contains ribose rather than deoxyribose
  3. it contains the base uracil rather than thymine

     A comprehensive comparison of deoxyribonucleotides and ribonucleotides

B. General classes of RNA: informational, functional, and regulatory

  1. Informational (messenger); mRNA _______________________
  2. Functional (transfer, ribosomal, and the "small" nuclear and cytoplasmic RNA)

  3. a. tRNA ______________________
    b. rRNA ______________________
    c. snRNA ______________________
  4. Regulatory: (RNAi)
  5. ______________________

Transcription (animation)

A. Overview: The process of transcription uses an intermediate - the messenger RNA (mRNA) - which can transmit the information in the DNA to the next step: translation. There are three steps in transcription: initiation, elongation, and termination2/20/09: Note: The figure accessed by the "Either strand" hyperlink in the next sentence is NOT CORRECT - The dark blue strand should be 5' left and 3' right and the complementary strand 3' left, 5' right. The mRNA transcripts are correct as shown in the Figure and the the concept is right ...Either strand of the DNA may be the template strand for RNA synthesis for a given gene.  For any given gene, the template strand is also referred to as the antisense (or non-coding)  strand and the non-template strand as the sense (or coding) strand.  The same DNA strand is not necessarily transcribed throughout the entire length of the chromosome or throughout the life of the organism.  

B. RNA polymerases: ______________________

 

C. The three steps in transcription

  1. Initiation: Transcription is initiated at the promoter. The promoter is a key feature for control of gene expression. Promoters have defined attributes, in terms of their sequence organization. Example: the HvBM5A promoter region.

 

Elongation:  ______________________

 

Termination: ______________________

 

5' and 3' UTRs: ______________________

D. Processing the mRNA transcript.  In prokaryotes the mRNA is sent on to the ribosome for translation. In eukaryotes, in contrast, the primary RNA transcript is processed into a mature mRNA before it is exported to the cytoplasm for translation (Figure). There are three components to processing:

E. Changes in intron sequence splicing can affect what the gene encodes

More reading on alternative splicing in plants: Chen et al. 2007. Plant Phys. 143: 1086-1095.

F: Expressed Sequence Tags (ESTs)

 

 

The genetic code

The sequence of a coding (sense, non-template) strand of DNA, read 5’ – 3’, specifies a sequence of amino acids (read N-terminus to C-terminus) via a triplet code. Each triplet is called a codon.  

A. Reading the DNA code: There are 64 codons; 61 represent amino acid codes and 3 cause the termination of protein synthesis (stop codons). 

 

 

B. Degeneracy: __________________________

 

Translation

A. Overview: The process of translation takes the information that has been transcribed from the DNA to the mRNA and, via some more intermediates (ribosomes and transfer RNA), give the sequence of amino acids that determine the polypeptide.

  1. Ribosomes: overall view of structure and subunits
  2. Transfer RNA (tRNA): (animation).

B. Steps in Translation

The translation process (animation - a; animation - b) involves initiation, elongation, and termination.

  1. Initiation: In addition to the mRNA, ribosomes, and tRNAs, initiation factors are required to start translation. The AUG codon specifies initiation, in the correct sequence context. It also specifies methionine (MET).
  2. Elongation: Much as initiation factors were important in the first step, now elongation factors come into play. The reactions also require additional components and enzymes.
  3. Termination: There are three "stop" codons.

VI. From DNA to amino acid sequence

The following data from GenBank (accession No.AY785841 ) illustrate several points made in the preceding sections on transcription, the DNA code, and translation.

1:  AY785841Hordeum vulgare s...[gi:60547428]

LOCUS       AY785841                 772 bp    DNA     linear   PLN 01-DEC-2005
DEFINITION  Hordeum vulgare subsp. vulgare cultivar Dicktoo HvCBF2A (CBF2A)
            gene, complete cds.
ACCESSION   AY785841
VERSION     AY785841.1  GI:60547428
KEYWORDS    .
SOURCE      Hordeum vulgare subsp. vulgare
  ORGANISM  Hordeum vulgare subsp. vulgare
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BEP
            clade; Pooideae; Triticeae; Hordeum.
REFERENCE   1  (bases 1 to 772)
  AUTHORS   Skinner,J.S., von Zitzewitz,J., Szucs,P., Marquez-Cedillo,L.,
            Filichkin,T., Amundsen,K., Stockinger,E.J., Thomashow,M.F.,
            Chen,T.H. and Hayes,P.M.
  TITLE     Structural, Functional, and Phylogenetic Characterization of a
            Large CBF Gene Family in Barley
  JOURNAL   Plant Mol. Biol. 59 (4), 533-551 (2005)
   PUBMED   16244905
REFERENCE   2  (bases 1 to 772)
  AUTHORS   Skinner,J.S., Szucs,P., von Zitzewitz,J., Marquez-Cedillo,L.,
            Filichkin,T., Stockinger,E.J., Thomashow,M.F., Chen,T.H. and
            Hayes,P.M.
  TITLE     Mapping of barley homologs to genes that regulate low temperature
            tolerance in Arabidopsis
  JOURNAL   Theor. Appl. Genet. (2006) In press
REFERENCE   3  (bases 1 to 772)
  AUTHORS   Skinner,J.S., Chen,T.H.H. and Hayes,P.M.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-OCT-2004) Horticulture, Oregon State University, 4017
            Ag. Life Sci. Bldg., Corvallis, OR 97331, USA
FEATURES             Location/Qualifiers
     source          1..772
                     /organism="Hordeum vulgare subsp. vulgare"
                     /mol_type="genomic DNA"
                     /cultivar="Dicktoo"
                     /sub_species="vulgare"
                     /db_xref="taxon:112509"
                     /chromosome="5"
                     /map="5H-L; Bin 10"
     gene            <1..>772
                     /gene="CBF2A"
     mRNA            <1..>772
                     /gene="CBF2A"
                     /product="HvCBF2A"
     5'UTR           <1..12
                     /gene="CBF2A"
     CDS             13..678
                     /gene="CBF2A"
                     /note="HvCBF2A-Dt; AP2 domain CBF protein; putative CRT
                     binding factor; monocot HvCBF4-subgroup member"
                     /codon_start=1
                     /product="HvCBF2A"
                     /protein_id="AAX23688.1"
                     /db_xref="GI:60547429"
                     /translation="MDTVAAWPQFEEQDYMTVWPEEQEYRTVWSEPPKRRAGRIKLQE
                     TRHPVYRGVRRRGKVGQWVCELRVPVSRGYSRLWLGTFANPEMAARAHDSAALALSGH
                     DACLNFADSAWRMMPVHATGSFRLAPAQEIKDAVAVALEVFQGQHPADACTAEESTTP
                     ITSSDLSGLDDEHWIGGMDAGSYYASLAQGMLMEPPAAGGWREDDGEHDDGFNTSASL
                     WSY"
     3'UTR           679..>772
                     /gene="CBF2A"
ORIGIN      
        1 tagctgcgag ccatggacac agttgccgcc tggccgcagt ttgaggagca agactacatg
       61 acggtgtggc cggaggagca ggagtaccgg acggtttggt cggagccgcc gaagcggcgg
      121 gccggccgga tcaagttgca ggagacgcgc cacccggtgt accgcggcgt gcgacgccgt
      181 ggcaaggtcg ggcagtgggt gtgcgagctg cgcgtccccg taagccgggg ttactccagg
      241 ctctggctcg gcaccttcgc caaccccgag atggcggcgc gcgcgcacga ctccgccgcg
      301 ctcgccctct ccggccatga tgcgtgcctc aacttcgccg actccgcctg gcggatgatg
      361 cccgtccacg cgactgggtc gttcaggctc gcccccgcgc aagagatcaa ggacgccgtc
      421 gccgtcgccc tcgaggtgtt ccaggggcag cacccagccg acgcgtgcac ggccgaggag
      481 agcacgaccc ccatcacctc aagcgaccta tcggggctgg acgacgagca ctggatcggc
      541 ggcatggacg ccgggtccta ctacgcgagc ttggcgcagg ggatgctcat ggagccgccg
      601 gccgccggag ggtggcggga ggacgacggc gaacacgacg acggcttcaa cacgtccgcg
      661 tcgctgtgga gctactagtt cgactgatca agcagtgtaa attattagag ttgtagtatc
      721 agtagctagt actactagct gtgttcttcc accaggcgtc aggcctggca ag
//
  1.   This sequence of 772 nucleotides encodes the gene HvCBF2A is from gDNA (genomic DNA) from the barley cultivar Dicktoo. Start reading the codons at nucleotide 1; the coding sequence starts at nucleotide 13 (codon = AUG = Met) and ends with nucleotide 678 (codon UAG = Stop). 
  2.    When DNA base sequences are cited, by convention it is the sequence of the non-template (sense, coding) strand that is given, even though the RNA is transcribed from the template strand.  The following Table shows highlighted sequences from the HvCBF2A gene and their interpretation.

Sequence

Type

5' atg gac aca.........tag 3’

Non-template DNA (decode replacing T with U )

3' tac ctg  tgt.........atc 5'

Template DNA

5'aug gac aca........uag3'

RNA (decode)

M     D     T           Stop

Amino acid code (See Table)

Methionine, Aspartic acid, Threonine

Amino acid code (See Table)


Relating transcription and translation to phenotypes 

A. Allelic variation at the DNA sequence level: the fragrance in rice example

 

B. Allelic variation in terms protein structure and function

From gene to polypeptide: There are 20 common amino acids and these are abbreviated with three-letter and one-letter codes.

 

Levels of protein structure: The primary , secondary, tertiary, and quaternary structures of protein.

 

Classification based on function: Proteins can be classified as active and structural. 

Enzymes (biological catalysts) have active sites

Structural proteins can have tremendous economic and cultural value, e.g. wheat endosperm storage proteins. The same proteins can casue intense suffering in certain individuals - e.g. celiac disease

Protein function and non-function: Changes in DNA coding sequence (mutations) can lead to changes in protein structure and function.

Proteomics:   “If the genome represents the words in the dictionary, the proteome provides the definitions of those words”.  For a look at a comprehensive protein: protein interaction study in Drosophila, see Giot et al. 2003. Science 302:1727-1735.

 

Text Readings: Chapters 8.  

 

Useful Links:

Another (North Dakota) perspective on this material  (NDSU PLSC731)