Daylila
How games actually work

Lesson 4 of 13

The reward schedule

Explain why unpredictable rewards compel more — the slot-machine mechanism in games.

01 · Learn · the idea

You open a chest and it’s empty. So you open another. Empty. A third — and out spills a rare reward, gold light everywhere. Your heart jumps. You open a fourth straight away. If every chest had simply handed you the same small coin, you’d have stopped opening long ago. The emptiness is doing the work. That’s not an accident. It’s the oldest trick in the building, and it has a name.

Two ways to hand out a reward

Imagine a machine with a button. Every time you press it, sometimes a coin drops. There are two honest ways to set the timing.

The first is fixed: a coin every fifth press, like clockwork. Press, press, press, press — coin. Predictable. You can feel the rhythm.

The second is variable: a coin on average every fifth press, but you never know which press pays. It might come on the second. It might not come for fifteen. Over a long run, both machines pay out exactly the same — one coin per five presses. Same average. Same generosity.

But they do not feel the same. And they do not grip you the same. The variable one will hold you for hours after the fixed one has bored you stiff.

Why the unpredictable one wins

Here is the strange part. The variable machine pays no more than the fixed one. It just hides when.

That hiding is the hook. On the fixed machine, you know the coin is four presses away, so the three presses before it are dead time — chores you do to reach the payout. On the variable machine, every press might be the one. There is no dead time. Each press carries a live spark of “maybe now.” Your brain treats the maybe as a tiny reward all by itself, before any coin appears.

This is variable-ratio reinforcement — rewarding an action after an unpredictable number of repeats. Psychologists found long ago that it produces the most relentless, hardest-to-stop behaviour of any reward schedule they could build. Not the most generous. The most relentless.

The slot machine, in plain sight

You already know the purest example. A slot machine is a variable-ratio device and nothing else. Pull, sometimes win, pull again. The casino tunes how often it pays so that, on average, you slowly lose — yet you keep pulling, because the next pull might be the one. The lights and the cherries are decoration. The schedule is the engine.

A game’s loot chest, card pack, or random drop is the same machine wearing a friendlier coat. You do the action, the reward comes on an unpredictable count, and the “one more turn” pull is the slot machine’s pull by another name.

The dry-streak test

Now the part that makes variable rewards genuinely hard to escape. It’s about what happens when the rewards quietly stop — what’s called extinction, the rewards switching off while you keep acting.

Walk it through. Two players, two machines, and we secretly cut all payouts to zero.

The fixed-machine player expected a coin on the fifth press. Fifth press — nothing. Sixth — nothing. Within a few presses the broken pattern screams at her. This is busted. She quits, fast. The predictability that bored her is exactly what lets her notice it’s dead.

The variable-machine player has no pattern to break. He’s lived through dry runs before — eleven empty presses, then a jackpot. So when the real switch-off comes, the first twelve empty presses look identical to ordinary bad luck. He keeps going. Twenty empties. Thirty. He can’t tell “it’s over” apart from “I’m just cold right now,” because on this machine those two things have always looked the same. He pulls long after the well has run dry.

That’s the whole, uncomfortable point. The variable schedule doesn’t just compel harder while rewards flow — it keeps you acting after they stop, because a dead game and an unlucky streak feel exactly alike from the inside.

The same lever, two intentions

None of this is evil by itself. A roguelike that scatters surprising loot makes exploration thrilling precisely because you can’t predict the next find — the unpredictability is the delight. The same schedule that powers a slot machine powers a genuinely joyful treasure hunt.

What separates them is intent. Is the unpredictability there to make discovery feel alive — or to keep you pressing past the point where you’re enjoying it, toward a wallet or a clock? The mechanism on the workbench is identical. Only the designer’s aim differs, and from the player’s seat the two can be nearly impossible to tell apart in the moment.

On the whole

Variable rewards aren’t a gaming gimmick. They run under slot floors, social feeds you refresh, inboxes you check, the phone you pick up “just to see.” Anywhere an action sometimes pays and you can’t predict when, the same engine is turning — pulling you back not because each pull is good, but because the next one might be.

Seeing the schedule doesn’t switch off the pull; the people who design these systems can feel it too. But it changes what a dry streak means. The next time you can’t stop opening, refreshing, pulling, the honest question isn’t “am I unlucky” — it’s “is there anything left in here, or am I just on the variable machine, doing the one thing it was built to make me keep doing?” We’re not standing outside the casino looking in. We are the hand on the lever, and the lever was shaped to fit it.

02 · Try · the lab

03 · Check · quick quiz

1. Two loot chests are tuned to give the same rare item once every ten opens on average. Chest A drops it on exactly every 10th open; Chest B drops it at random, so any open might be the one. Both are equally generous. Which keeps a player opening longer, and why?

  • Chest A, because the player can plan around the guaranteed 10th open
  • Neither — identical average payout means identical pull
  • Chest B, because every open carries a live 'maybe now', so no open feels like dead time
  • Chest A, because predictable rewards always feel more satisfying
Answer

Chest B, because every open carries a live 'maybe now', so no open feels like dead time — Same average, very different pull. On Chest A the nine opens before the payout are dead time. On Chest B every open might pay, so each one carries a spark of 'maybe now' — that unpredictability is variable-ratio reinforcement, the most relentless schedule there is.

2. A game quietly stops dropping any loot at all (the rewards are switched off). The lesson says a player on a variable, random-drop schedule keeps grinding far longer than one on a fixed, every-Nth-kill schedule. Why does the variable player keep going?

  • On a random schedule, a dry streak looks exactly like ordinary bad luck — so 'it's over' is indistinguishable from 'I'm just cold right now'
  • The variable player is simply more patient by nature
  • Random drops are secretly more frequent than fixed ones
  • The fixed player has already collected everything there is to get
Answer

On a random schedule, a dry streak looks exactly like ordinary bad luck — so 'it's over' is indistinguishable from 'I'm just cold right now' — This is extinction — rewards switching off. The fixed player notices the broken pattern almost at once and quits. The variable player has lived through dry runs before, so the silence looks identical to bad luck and he can't tell the game is dead. The predictability the fixed player found dull is exactly what lets her stop.

3. A designer adds randomised treasure to a treasure-hunt game so that finding it stays surprising and thrilling. The same variable-reward mechanism also runs a slot machine. What actually separates the two?

  • The slot machine uses real money, which changes the underlying psychology of the schedule
  • The treasure-hunt version uses a different, gentler reward schedule
  • Nothing — both are equally harmful because the mechanism is identical
  • The intent: whether the unpredictability is there to make discovery feel alive, or to keep you acting past the point of enjoyment toward a wallet or a clock
Answer

The intent: whether the unpredictability is there to make discovery feel alive, or to keep you acting past the point of enjoyment toward a wallet or a clock — The mechanism on the workbench is the same variable-ratio schedule in both. What differs is the designer's aim — delight versus keeping you pulling past enjoyment. The schedule is neutral; intent is the line between honest fun and engineered compulsion, and from the player's seat the two can be hard to tell apart in the moment.