First-time user guide

Your 5-minute walk-through of the
World Cup 2026 Prediction Engine

Everything you need to read the predictions, run your own scenarios, and understand why the numbers are honest โ€” no stats background required.

โฑ๏ธ ~5 min read ๐Ÿงญ 6 guided steps ๐ŸŽฏ No setup needed

The 60-second tour

  1. It already ran. Open the engine and 10,000 tournament simulations have already played out โ€” results are on screen instantly.
  2. Scroll to "Who wins it all?" to see each team's chance of lifting the trophy.
  3. Click any group match to reveal its three most likely scorelines and a plain-English reason.
  4. Play in the Match Lab โ€” pit any two teams head-to-head, anywhere.
  5. One golden rule: these are probabilities, not predictions set in stone. A 60% favourite still loses 4 times in 10.
1

Get oriented (and read the one rule)

When the page loads, the engine has already simulated the whole tournament for you โ€” there's nothing to install or press to begin. The very first thing you'll see is an honesty banner. Read it once. It explains the single most important idea in the whole tool:

Football is gloriously unpredictable. The best forecasters on earth get the result right only about 50โ€“55% of the time, and the most likely exact score lands only ~10โ€“12% of the time. So this engine gives you well-calibrated probabilities, never false certainty.

Engine header and honesty panel showing factors used and excluded
Top of the engine. The green/red panel shows exactly what the model uses (team strength, home advantage, scoreline maths) and what it deliberately ignores (coach "mind-reading", social-media sentiment, anything claiming 99.99%).
Why this matters: every number you see already accounts for uncertainty. When the model says a team has a 70% chance, it genuinely expects them to slip up about 3 times in 10.
2

See who's likely to win it all

Scroll to "Who wins it all?" Each row is a team, with two bars:

  • Solid blue bar + the % on the right โ€” chance of winning the trophy.
  • Faint bar underneath โ€” chance of reaching the final.
What the title-odds rows look like
1
๐Ÿ‡ช๐Ÿ‡ธ Spain
21.3%
2
๐Ÿ‡ฆ๐Ÿ‡ท Argentina
14.3%

Notice how even the favourite sits around 20%, and the chances taper off gently rather than dropping off a cliff. That flat top is the real shape of World Cup uncertainty โ€” the best team in the world is still more likely not to win than to win.

โ–ถ Try this Find your team in the list. If they're not in the top 16, they're outside the headline contenders โ€” but the group view (next step) shows exactly how far they're expected to go.
3

Dive into the groups & match predictions

The Groups tab shows all 12 groups. Each card has two parts: a mini-table and the six match predictions for that group.

  • Qual = chance of reaching the knockouts. Win = chance of finishing top of the group. (Colours: green = likely through, amber = on the bubble, grey = long shot.)
  • Every match has a three-colour bar: green = left team wins ยท grey = draw ยท red = right team wins.
  • Click any match to expand it: you'll see the three most likely scorelines and a one-line "Model read" explaining the edge.
Group H standings and the expanded Spain versus Uruguay match prediction
Group H, with Spain vs Uruguay expanded. Spain are 56% to win, draw 27%, Uruguay 17%. The most likely scoreline (1โ€“1) is still only 12.5% โ€” proof that even a clear favourite produces a wide spread of plausible scores.
Reading scorelines: notice no single exact score climbs much above ~12โ€“15%. That's correct, not a flaw โ€” exact scores are genuinely hard, so the model spreads its confidence across many plausible results.
โ–ถ Try this Open the most lopsided match you can find (e.g. Spain vs a minnow) and a coin-flip match (two close teams). Watch how the green/red bar goes from one-sided to nearly even โ€” that's the model expressing certainty vs genuine doubt.
4

Run your own what-ifs

The Model controls & what-if panel lets you bend the model's assumptions and watch everything update. The sliders are plain-English knobs:

Model controls panel with sliders and the Temperature slider tooltip open
Model controls. Hover the โ“˜ next to any control for an explanation. Here the Temperature (T) tooltip is open, and the green box shows the live quality readout for the current setting.
  • Host home advantage โ€” how big a boost the USA, Canada & Mexico get at home.
  • Avg goals / match โ€” the tournament's overall scoring level.
  • Strength sensitivity โ€” how much a rating gap translates into a result. Lower = bigger teams dominate more.
  • Draw tendency โ€” nudges how often matches end level.
  • Temperature (T) โ€” the calibration dial (see Step 6).
  • Edit team ratings โ€” the most fun button: drop a team's rating to simulate a key injury, or boost one to test a hot streak.
Two kinds of update: match cards and the Match Lab refresh instantly as you drag. The headline title odds re-run when you press Re-run simulation (or when you move the Temperature slider, which re-runs automatically). Reset defaults puts everything back.
โ–ถ Try this Click Edit team ratings, knock 150 points off your favourite's rating (a "star striker is injured" scenario), then Re-run simulation and watch their title odds drop.
5

Pit any two teams in the Match Lab

Switch to the Match Lab tab to model any head-to-head โ€” even teams in different groups. Pick Team A and Team B, choose a neutral venue or give one side home advantage, and you get a full breakdown:

Example: ๐Ÿ‡ช๐Ÿ‡ธ Spain vs ๐Ÿ‡ซ๐Ÿ‡ท France โ€” neutral venue
๐Ÿ‡ช๐Ÿ‡ธ Spain win
40%
Draw (90')
31%
๐Ÿ‡ซ๐Ÿ‡ท France win
28%
Most likely scorelines
1โ€“114.7% 0โ€“011.0% 1โ€“010.4%
If it goes to a penalty shootout
55% Spain45% France
Shootouts are close to a coin flip โ€” even a clear favourite rarely tops ~55%. Overall, Spain advance 58% of the time.
The shootout is honest on purpose. Many people assume the better team wins penalties 60โ€“40. History says it's far closer to 50โ€“50, so the Lab keeps it near a coin flip.
โ–ถ Try this Pick two giants (say Spain vs Brazil), then toggle "Team B home". Watch the bars shift โ€” then ask yourself whether the swing matches your gut.
6

Check the evidence (and tune the calibration)

This is what sets the engine apart: it doesn't just say "trust me." Scroll to "Is this model actually any good?" The engine replays its exact model over the real 2018 & 2022 World Cup group stages (96 matches) and scores itself. A few terms, in plain English:

  • Accuracy โ€” how often the top pick was right (~57%).
  • Log-loss / RPS / Brier โ€” quality scores that punish confident mistakes. Lower is better.
  • Reliability diagram โ€” the honesty test: when the model says "30%", does it happen ~30% of the time? Dots should sit on the diagonal line.
  • Sharpness โ€” whether it makes bold calls or hedges everything near 33%.

The headline finding is the Temperature (T) calibration. The raw model was over-confident on big favourites (it was 92% sure Brazil would beat Cameroon in 2022 โ€” Brazil lost). Temperature scaling gently tones those extremes down. Here's the before/after:

Reliability diagram before and after calibration at T equals 1.0 and 1.845
Left (T = 1.0, raw): the dot at the bottom-right is the model being 90%+ sure on games it lost. Right (T = 1.845, calibrated): those over-confident dots are gone โ€” nothing now exceeds ~75%.

The Temperature (T) slider in Model Controls lets you explore this yourself: slide it down for bolder, sharper calls; up for safer, smoother ones. The default 1.845 is the cross-validated sweet spot, and the green box shows the quality score updating live as you drag.

โ–ถ Try this Drag Temperature to 0.8 and watch the reliability dots fly above 90% again (over-confident); drag to 3.0 and watch everything hug the diagonal but go timid. 1.845 is the balance.

The one rule: how to read a probability

A win probability is a frequency, not a verdict. "Spain 72% to win the group" means: if this group were played 100 times, Spain top it about 72 of them โ€” and miss out the other 28.

56% SpainDraw 27%17% Uruguay

So even in a match Spain are favoured to win, there's a 44% chance they don't (draw + loss). The engine is being honest about that โ€” and that honesty is the whole point.

Controls cheat-sheet

Every knob, what it does, its default, and a quick experiment.

ControlWhat it doesDefaultTry
Host home advantageBoost for USA / Canada / Mexico at home+70Set 0 to neutralise home edge
Avg goals / matchOverall scoring level of the tournament2.6Raise it for a higher-scoring event
Strength sensitivityHow much a rating gap decides the result180Lower = favourites dominate more
Draw tendency (ฯ)Nudges how often matches end levelโˆ’0.08Push toward 0 for fewer draws
Temperature (T)Calibration dial (confidence of probabilities)1.845Down = bolder, up = safer
SimulationsHow many tournaments are played out10,000More = smoother title odds
Apply calibrationMaster on/off for temperature scalingONToggle off to see the raw model
Edit team ratingsHand-edit any team's strengthโ€”Simulate an injury or hot streak

Frequently asked (and a few gotchas)

Why is Ecuador rated near Germany? That looks wrong.
The engine is seeded with public World Football Elo ratings, which rate strong South-American sides generously. It's faithful to the source โ€” and fully adjustable. If you disagree, click Edit team ratings and change it; everything recomputes.
I moved a slider but the title odds didn't change. Bug?
No โ€” by design. Match cards update instantly, but the headline title odds only re-run when you press Re-run simulation (running 10,000 tournaments takes a moment). The one exception is the Temperature slider, which re-runs automatically.
Are the knockout matchups real fixtures?
No. The 72 group matches are the real, scheduled fixtures. Knockout matchups depend on who finishes where, so they can't be known in advance โ€” the engine projects them probabilistically through 10,000 simulations to produce advancement and title odds.
Can it tell me the exact final score of a match?
It gives you the most likely scorelines with honest probabilities โ€” but no single exact score is ever very likely (the top one is usually ~10โ€“15%). Anyone promising a guaranteed exact score is selling certainty that doesn't exist in football.
Is there a bookmaker comparison?
Not yet. A real Pinnacle closing-odds benchmark would need a historical odds dataset that isn't loaded โ€” and the engine won't invent odds. If you supply the data, it can be wired in as a comparison row.
So what accuracy can I actually expect?
On past World Cup group stages the model gets the result right about 57% of the time โ€” right in line with the best public models and betting markets. There is no 99.99%; football's unpredictability is irreducible, and this tool is built to respect that.