England football fans are starting to sing ‘It’s coming home’; while followers of Scotland and Wales hope their teams can cause a few shocks at Euro 2020. But the chances of these outcomes are slim, despite what’s suggested by the bookmakers’ odds.
The delayed UEFA European Football Championship starts today, 11 June 2021. The media are ratcheting up the anticipation and pundits are filling the airwaves. Some football fans will fervently believe it is their nation’s time to taste success. Others will just be hoping that their team does well.
The definition of ‘well’, though, is a matter of expectations. For some, making it out of the group stage is a success. For others, anything less than the final is failure. The England manager, Gareth Southgate, has marked out the semi-finals as the minimum his team should achieve – and his job could be on the line if they don’t get that far.
Our analysis – which is based on measures of relative team strengths based on past performance plugged into a statistical model – suggests that the favourite is Belgium with a 24% probability of winning the title. But that still means that there is a 76% chance that Belgium will not lift the trophy.
Read on for the chances of success we calculate for every other team at Euro 2020 – and the opportunity to take part in our Euro 2020 predictions game.
Forecasting the results of Euro 2020
As the physics Nobel laureate Niels Bohr once said: ‘Prediction is very difficult, especially if it's about the future’. Even so, we decided to simulate the European Championships a few thousand times anyway. Why? Because it is fun, and as fans we can get some sense of how excited, or surprised, we might be about our teams’ chances of winning.
Our forecasting model is simple — it asks how many goals each team are expected to score in each match, given how strong they and their opponents are, and assuming that goals arrive at a particular rate (based on a probability distribution called the ‘Poisson’ – a very standard assumption in this context).
To measure relative team strengths, we use the 'Elo ratings' of each team, which are based on the entire history of international football – a recent win against a strong team is worth more than a win against a weak team. On the eve of the tournament, Belgium has the highest Elo rating in Europe, followed by France and Spain. North Macedonia is at the bottom of the list among the countries participating in Euro 2020.
Our model generates probability forecasts for the final score in a match (using the ‘Bivariate Poisson distribution’). For example, a 2-0 win for Italy in their opening game against Turkey is one quite likely outcome. After generating a match outcome from this distribution, we carry on to the next match, updating each team’s Elo ratings, and so on until we reach the final game. We do this whole process 10,000 times and count how often each team wins the tournament or gets to a particular stage.
The order of actions in tournaments can have significant effects on performance. This is as true for a major international tournament as it is for a kickabout at the local park. We have taken this into account in the model. The teams in Groups E and F have an advantage over others because they play on the last day of the group stage (23 June 2021). They will know exactly what results they need to be among the four best-ranked third-placed teams from the groups (which would allow them to qualify for the next stage).
For example, in Euro 2016, Portugal played on the last day of the group stage and knew that a draw would get them into the last 16 from third place. Coincidentally or otherwise, this is exactly what happened. Such a system also occurred in the World Cups of 1986, 1990 and 1994, as well as at Euro 2016. In all these tournaments, the third-ranked teams that played on the last day of the group stage always qualified for the knock-out stage.
A notable example concerns the Russian team in 1994, which finished third in Group B, one of the groups to finish earliest. Russia then needed only one of the following results to qualify: Bulgaria to lose to Argentina, Netherlands to lose to Morocco, Nigeria to lose to Greece; or Saudi Arabia to lose to Belgium. None of these happened and Russia headed home.
To illustrate the impact of this quirk on tournament outcomes, we assume that teams from Groups E and F, which play on the last day of the group stage, always qualify for the knock-out stage. The remaining two qualifying third-ranked teams are randomly picked from Groups A to D. In the forecast model results below, we compare the group stage qualifying probabilities to a hypothetical ‘fair qualifying’ schedule.
The countries that gain most from this are the teams unlikely to qualify in first or second in Groups E and F: Poland most markedly, but also Germany, Hungary and Slovakia. The countries that suffer most are those in Groups A to D that may have qualified in third: Ukraine, Denmark, Austria and Turkey.
Group A
Fair qualifying probability (%) | Qualifying probability adjusted to schedule (%) | Difference in probabilities caused by schedule | |
Italy | 89.1 | 85.4 | -3.8 |
Switzerland | 65.3 | 60.6 | -4.7 |
Turkey | 60.0 | 54.0 | -6.0 |
Wales | 55.3 | 50.2 | -5.0 |
Group B
Fair qualifying probability (%) | Qualifying probability adjusted to schedule (%) | Difference in probabilities caused by schedule | |
Belgium | 95.8 | 93.5 | -2.3 |
Denmark | 85.3 | 78.7 | -6.6 |
Russia | 46.0 | 42.8 | -3.2 |
Finland | 34.4 | 33.5 | -0.9 |
Group C
Fair qualifying probability (%) | Qualifying probability adjusted to schedule (%) | Difference in probabilities caused by schedule | |
Netherlands | 91.4 | 87.0 | -4.5 |
Ukraine | 82.2 | 75.3 | -6.8 |
Austria | 65.3 | 59.0 | -6.2 |
North Macedonia | 30.6 | 29.1 | -1.5 |
Group D
Fair qualifying probability (%) | Qualifying probability adjusted to schedule (%) | Difference in probabilities caused by schedule | |
England | 95.1 | 92.3 | -2.8 |
Croatia | 80.8 | 76.1 | -4.6 |
Czech Republic | 49.5 | 46.6 | -2.9 |
Scotland | 37.7 | 35.9 | -1.8 |
Group E
Fair qualifying probability (%) | Qualifying probability adjusted to schedule (%) | Difference in probabilities caused by schedule | |
Spain | 94.4 | 96.4 | 2.0 |
Sweden | 80.1 | 88.1 | 8.1 |
Poland | 64.7 | 77.9 | 13.2 |
Slovakia | 27.8 | 37.6 | 9.9 |
Group F
Fair qualifying probability (%) | Qualifying probability adjusted to schedule (%) | Difference in probabilities caused by schedule | |
France | 88.3 | 92.5 | 4.2 |
Portugal | 80.5 | 86.9 | 6.4 |
Germany | 71.5 | 81.8 | 10.3 |
Hungary | 29.3 | 38.9 | 9.5 |
Knock-out stage
If a knock-out match finishes level in the model, we decide the penalty shootout by assuming that the probability each team wins would be equal to the Elo prediction for that match (so the better team wins more than the worse team, but not always). This is known as a ‘binomial event’.
Below is a table showing the model’s forecast probabilities of winning Euro 2020 for each country, as well as the probabilities that each country makes the final, semi-finals and quarter-finals.
Team | Champions | Final | Semi-finals | Quarter-finals |
Belgium | 24.02 | 35.23 | 52.71 | 71.53 |
France | 18.45 | 28.85 | 48.50 | 65.93 |
Spain | 11.92 | 26.23 | 42.36 | 73.16 |
Italy | 8.81 | 16.31 | 29.90 | 61.83 |
Portugal | 8.23 | 16.13 | 33.75 | 51.88 |
England | 7.29 | 16.47 | 27.49 | 56.34 |
Denmark | 4.67 | 10.13 | 21.92 | 40.87 |
Germany | 4.64 | 10.84 | 23.88 | 41.18 |
Netherlands | 3.36 | 9.45 | 24.27 | 47.31 |
Sweden | 1.69 | 5.38 | 14.92 | 41.09 |
Switzerland | 1.46 | 4.56 | 12.65 | 30.72 |
Ukraine | 1.00 | 3.00 | 9.79 | 27.03 |
Croatia | 0.92 | 3.36 | 9.42 | 34.01 |
Turkey | 0.84 | 2.68 | 8.11 | 24.60 |
Poland | 0.82 | 3.07 | 9.51 | 29.06 |
Wales | 0.74 | 2.67 | 8.55 | 23.22 |
Russia | 0.34 | 1.25 | 4.19 | 11.61 |
Austria | 0.29 | 1.38 | 5.00 | 15.39 |
Hungary | 0.16 | 0.89 | 3.82 | 9.75 |
Czech Republic | 0.13 | 0.87 | 3.20 | 15.38 |
Slovakia | 0.10 | 0.43 | 1.82 | 7.93 |
Finland | 0.07 | 0.32 | 1.81 | 6.27 |
North Macedonia | 0.03 | 0.22 | 1.08 | 4.79 |
Scotland | 0.02 | 0.28 | 1.35 | 9.12 |
The model suggests that the tournament favourite is Belgium, with a 24% probability of winning the title. But it still means that there is a 76% chance that Belgium will not lift the trophy. The other teams with a greater than 10% chance of winning are France and Spain.
England fans can be happy that their 7% chance of winning the trophy is higher than Germany’s, but it does suggest that the bookmakers are hedging their bets against patriotic fervour, given some have England as equal tournament favourites with France.
Scotland have never reached the knock-out stages of a major tournament, and so will be heartened by an approximately 36% chance of reaching the last 16, and even a 9% chance of reaching the quarter-finals. Wales have a 9% chance of repeating their heroics of 2016 when they raced to the semi-finals.
How hard is it to predict football match outcomes?
The beauty of football, especially at the European Championships, is that seemingly anything can happen. In 1992, Denmark replaced Yugoslavia at the last minute and won the whole tournament. In 2004, Greece beat hosts Portugal in the final. Greece’s pre-tournament odds that year were 150-to-1 to win, which implies a probability of less than 1% – not dissimilar to our forecast for the chances of Wales winning this time. Scotland fans need to hope for a miracle.
Home advantage is not factored into our model, and is typically large in professional football. On a related note, the impact of travel fatigue could come into play. Due to the design of Euro 2020, which is taking place all over Europe, there could be significant scheduling effects. There is evidence from the post-Covid-19 period that the presence of fans does significantly affect football match outcomes, through social pressure and its effects on the referee.
Finally, the pre-tournament Elo ratings that drive the model are mostly based on recent non-tournament match outcomes. Some might argue that Belgium, for example, are overrated and may be underprepared for the grind of tournament football due to having less depth in their playing squad compared with the likes of France and Spain. These are all factors that, to some extent, make a mockery of our forecasts. But they are also why football is such an exciting and unpredictable game.
Where can I find out more?
- The Scorecasting Economists’ Euro 2020 Forecasts in full: Analysis by James Reade.
- Hybrid machine learning forecasts for the UEFA Euro 2020: An alternative Euro 2020 forecasting model by Achim Zeileis (see here for 2018, 2016 and 2008 forecasts of Achim’s model).
- First in first win: Evidence on schedule effects in round-robin tournaments in mega-events: Evidence on scheduling effects in major football tournaments by Alex Krumer.
- Causal effects of an absent crowd on performances and refereeing decisions during Covid-19: A study of home advantage and the impact of the crowd by the authors of this article and colleagues.
- The Cauldron Has Cooled Down: A Systematic Literature Review on COVID-19, Ghost Games, and Home Advantage in Football from a Behavioral Science Perspective: A review of the research evidence by Michael Christian Leitner, Frank Daumann, Florian Follert and Fabio Richlan.
- Is football a matter of life and death – or is it more important than that?: A study of the happiness effects (and importance) of football by Peter Dolton and George MacKerron.
Who are experts on this question?
- Alex Krumer, Molde University College
- James Reade, University of Reading
- Carl Singleton, University of Reading
Do you think you can beat the experts at predicting the Euros?
If so, click this link to submit scoreline predictions for every match during the tournament!