Lesson 3 of 13
From gene to protein
Explain the central dogma in plain terms — a gene is copied into RNA, and the RNA is read to assemble a protein — and that proteins are the machines that actually do the work (enzymes, structure, signals), so 'a gene for X' really means 'a recipe for a protein that affects X'.
01 · Learn · the idea
You now know two things: a cell is a factory, and its instructions are written in DNA as a four-letter text. But a recipe sitting in a book doesn’t feed anyone. Something has to read it and do the cooking. How does a stretch of A, C, G, and T — locked away in the nucleus, never allowed to leave — turn into a working part of you? The answer is one of the most important sentences in all of biology, and it’s short enough to hold in your head: DNA is copied to RNA, and RNA is read to build a protein. That sentence is called the central dogma, and once you have it, you have the engine room of life.
Why a copy is made first
The master DNA stays in the nucleus — too precious to risk on the factory floor. So the cell doesn’t use the original. It makes a working copy of just the one recipe it needs, and sends the copy out. That copy is RNA.
RNA is almost the same four-letter text as DNA, with one cosmetic difference: wherever DNA has a T, RNA uses a U instead. Same information, disposable format. Making this copy is called transcription — like transcribing a page from a reference book you can’t take home. The DNA stays put; the RNA goes to work.
Reading three letters at a time
Out on the floor, a ribosome reads the RNA. But it doesn’t read letter by letter. It reads in groups of three. Each triplet — called a codon — names one amino acid, and amino acids are the beads that, strung together, make a protein.
There are twenty kinds of amino acid. The ribosome moves along the RNA, three letters at a time, looking up each triplet and adding the matching bead to a growing chain: read three, add a bead, read three, add a bead. This is translation — turning the language of letters into the language of proteins. One triplet, AUG, also means “start here.” A few triplets — UAA, UAG, UGA — mean “stop”: the chain is finished and released.
When the chain is done and folds into its shape, you have a protein. And proteins, remember, are the machines that do nearly everything — the enzymes that run chemistry, the structures that hold cells together, the signals like insulin. So the full path is:
DNA → (transcribe) → RNA → (translate) → protein → the work gets done.
That’s the central dogma. A gene is just the stretch of DNA that holds one such recipe. “A gene for X” really means “the recipe for a protein that affects X.”
A worked example, letter by letter
Take a short gene that spells out a tiny protein. Its DNA reads ATG GTT CAT GGA ACG TAA (the spaces are just to show the triplets).
Transcribe it — every T becomes U — and the RNA is AUG GUU CAU GGA ACG UAA.
Now translate, triplet by triplet:
AUG→ Met (this is also the “start”)GUU→ ValCAU→ HisGGA→ GlyACG→ ThrUAA→ Stop
The protein is the chain Met–Val–His–Gly–Thr. Five beads, in that exact order, set entirely by the gene.
Now watch what one typo does. Suppose a single letter in the gene gets miscopied — say the G that starts GGA becomes a T. That triplet is now TGA, which transcribes to UGA — a stop signal. The ribosome reaches it and quits early. Instead of the full five-bead protein, you get a stub: Met–Val–His, then nothing. One wrong letter, and the factory ships a broken, half-finished part.
Other typos are gentler. Change a different letter and you might swap just one bead — Val becomes Asp, say — leaving a protein that’s the right length but subtly wrong. And some typos change nothing at all, because the code has spare room: more than one triplet can name the same amino acid. You’ll get to make these typos yourself in a moment and watch each kind play out.
On the whole
This is the quiet machine running inside every cell, billions of times a second, right now in you. A line of text is copied; the copy is read three letters at a time; beads are strung into a machine; the machine does a job. It is astonishingly simple and astonishingly literal — life is a reading-and-building process, and most of what goes wrong in biology is a problem somewhere along that line.
It also sets up the honest frame for everything ahead. A “genetic disease” is usually a typo in one recipe that the cell faithfully keeps following. A drug often works by helping or blocking one of the protein machines this line produces. And a single misread letter — the difference between a working protein and a broken one — is, in real diseases like sickle-cell, exactly that small and exactly that consequential. Hold the central dogma, and the rest of this course is variations on one theme: reading the text, fixing the typos, or working with the machines the text builds — while never forgetting how long the road is from a line of letters to a healthy person.
02 · Try · the lab
03 · Check · quick quiz
1. State the central dogma in the right order.
- Protein is copied to RNA, which builds DNA
- DNA is copied to RNA, and RNA is read to build a protein
- RNA is copied to DNA, which builds a protein directly
- DNA builds a protein directly, with no copy step
Answer
DNA is copied to RNA, and RNA is read to build a protein — The master DNA stays in the nucleus; a working RNA copy is made (transcription) and carried out to be read into a protein (translation). DNA → RNA → protein.
2. Why does the cell make an RNA copy instead of using the DNA directly?
- RNA can hold more information than DNA
- The DNA is too precious to leave the nucleus, so a disposable working copy is sent out
- DNA cannot be read by ribosomes under any circumstances
- RNA letters spell different words than DNA letters
Answer
The DNA is too precious to leave the nucleus, so a disposable working copy is sent out — The master DNA stays safe in the nucleus. RNA is the same information in a disposable format (T becomes U) that can be sent to the ribosome and discarded after use.
3. A ribosome reads RNA in groups of three letters. What does each triplet specify?
- One whole protein
- One of the four DNA letters
- One amino acid — a bead in the protein chain
- Whether the gene is switched on or off
Answer
One amino acid — a bead in the protein chain — Each triplet (codon) names one amino acid. The ribosome reads three letters, adds that bead, and moves on — stringing the protein together until it hits a 'stop' triplet.
4. A single letter in a gene is miscopied, turning a normal triplet into a 'stop' signal partway through. What happens to the protein?
- Nothing — one letter never matters
- The whole protein is built but folds backwards
- The build halts early, producing a truncated, broken protein
- The cell deletes the gene and starts over
Answer
The build halts early, producing a truncated, broken protein — A 'stop' triplet ends translation. If a typo creates one early, the ribosome quits there, shipping a half-finished protein. One wrong letter can break the part — which is how some genetic diseases work.