Methodology
ProbFut uses statistical models to calculate the probability of each team reaching their objectives in the league. All predictions are based on real match results and updated after every round.
Elo Ratings
Each team has a score that reflects its relative strength. After every match, scores are adjusted based on the result, the scoreline, and the difference in strength between the teams. Home teams receive a bonus reflecting their home advantage. At the start of each season, scores are partially regressed toward the mean to prevent past seasons from weighing too heavily.
Goal Model
To predict the outcome of each future match, we use the Elo difference between teams to estimate how many goals each should score. From these estimates, we calculate the probability of every possible scoreline. We apply a statistical correction that improves accuracy for low-scoring matches, where the base model tends to be less precise.
Monte Carlo Simulation
We simulate the rest of the season thousands of times. In each simulation, we draw a scoreline for every future match based on the calculated probabilities, update the standings, and record each team's final position. By the law of large numbers, the more simulations we run, the closer the results converge to the true probabilities. In practice, after a few thousand iterations the probabilities stabilize with high precision.
Wins Needed
For each objective (title, qualification for international competitions, relegation), we calculate how many wins a team needs to achieve it at different confidence levels. We force the team to win a specific number of matches and check in how many simulations it reaches the objective. The minimum number of wins that guarantees the objective in 75%, 90%, or 99% of simulations is what we call Wins Needed.
Updates
Data is updated automatically after each round. The pipeline fetches results, recalculates ratings, runs simulations, and publishes the new numbers on the site.
References
- 01Elo rating system — originally developed by Arpad Elo for chess, adapted for football.
- 02Maher, M.J. (1982). Modelling association football scores. Statistica Neerlandica, 36(3), 109-118.
- 03Dixon, M.J. & Coles, S.G. (1997). Modelling Association Football Scores and Inefficiencies in the Football Betting Market. Journal of the Royal Statistical Society: Series C, 46(2), 265-280.
- 04Monte Carlo simulation — computational method that uses random sampling to obtain numerical results.