Wangari
Wangari Podcast
Why Most World Cup Predictions Are Wrong (And Why I Wrote a Book About Soccer ML Anyway)
0:00
-25:36

Why Most World Cup Predictions Are Wrong (And Why I Wrote a Book About Soccer ML Anyway)

Every four years, the models say Brazil. Every four years, the World Cup disagrees (except when Brazil actually wins and nobody predicted it).

In this episode, Ari Joury (PhD, particle physics; Founder & CEO of Wangari Global) turns his attention to the 2026 World Cup — and to why the machine learning models built to predict it are confidently wrong in specific, predictable ways. Drawing on his upcoming O’Reilly book Soccer Analytics with Machine Learning, he walks through four failure modes: the distribution shift between club and international football, the small-sample limits of Expected Goals (xG), the form-transfer illusion, and the incentive structures that push analysts to publish flashy numbers over honest ones. He then flips the argument: what actually does predict tournament outcomes, and what does that tell us about where ML earns its keep versus where it just looks like it does? The bigger lesson here is not about soccer — it is about knowing which questions your model can actually answer with the data you have.

Topics covered: Soccer analytics, World Cup prediction, Expected Goals (xG), distribution shift, small-sample statistics, feature engineering, predictive modeling, O’Reilly Media, enterprise AI context.

Wangari is the newsletter and podcast for practitioners and leaders navigating the real work of enterprise AI. New episodes every Friday.

https://wangari.global/contact

Ari’s book (O’Reilly Media, early release out now and officially out around June 25): Soccer Analytics with Machine Learning

Discussion about this episode

User's avatar

Ready for more?