Case Study: FiveThirtyEight’s Elo Beats 98% of Forecasters in NFL Forecasting Game

An in-depth look at how FiveThirtyEight’s Elo algorithm outperformed over 98% of human participants in the NFL Forecasting Game, showcasing AI’s predictive power.

Chart showing Elo algorithm performance against human forecasters

FiveThirtyEight's Elo Beats 98% of Forecasters

Since 2014, FiveThirtyEight's NFL Forecasting Game invites readers to assign win probabilities to upcoming matchups, challenging them to outpredict an algorithmic model built on Elo ratings. This annual contest pits collective human intuition against a transparent, data-driven system to see who forecasts winners more accurately.

Background & Challenge

Since 2014, FiveThirtyEight's NFL Forecasting Game invites readers to assign win probabilities to upcoming matchups, challenging them to outpredict an algorithmic model built on Elo ratings. This annual contest pits collective human intuition against a transparent, data-driven system to see who forecasts winners more accurately.

The Elo Algorithm in Action

Elo assigns every NFL team a power rating—centered around 1500—and updates those ratings after each game based on outcome, opponent strength, home-field advantage, quarterback changes, and rest days. These ratings are transformed into win probabilities, providing a consistent, interpretable forecast for every matchup.

Dynamic Rating System

Team ratings continuously update after each game, incorporating opponent strength, home-field advantage, and situational factors.

Transparent Methodology

Unlike black-box models, Elo's simple, interpretable algorithm provides clear insights into team strength assessments.

Developer vs. Crowd: 2018 Challenge

In the 2018 NFL Forecasting Game, 20,352 participants submitted their predictions throughout the season—but fewer than 2% managed to outperform the Elo model. That placed Elo itself 432nd out of all forecasters, demonstrating that a simple, transparent algorithm beat the vast majority of human competitors.

20,352 Participants

Massive participation in the 2018 challenge created a robust sample size for comparing human vs. algorithmic forecasting.

98% Outperformed

Fewer than 2% of human forecasters managed to beat the algorithmic Elo model throughout the season.

Accuracy Metrics & Key Statistics

Across 798 games since the game's inception, Elo's favorite pick performed exceptionally: out of 527 favorite predictions, 519 resulted in wins (with one tie), yielding a 98.5% success rate on its favorite selections.

98.5% Success Rate

Out of 527 favorite predictions, 519 resulted in wins—dramatically outperforming typical NFL favorite win rates of ~65%.

798 Games Analyzed

Long-term performance across nearly 800 games demonstrates consistent algorithmic superiority over human judgment.

Lessons & Applications for Sports Betting

This case study underscores that transparent, data-driven models can consistently outperform collective human judgment in probabilistic forecasting. Key success factors include a robust rating system, continuous calibration, and a simple, interpretable algorithm that rapidly integrates new information.

Data Beats Intuition

Systematic data processing consistently outperforms human gut feelings and crowd wisdom in sports forecasting.

Continuous Calibration

Regular model updates and performance calibration ensure sustained accuracy over changing competitive landscapes.

The Future of Algorithmic Sports Prediction

FiveThirtyEight's Elo model not only beat 98% of human forecasters but also illustrates the lasting value of algorithmic approaches in sports prediction. By leveraging comprehensive data and proven rating methodologies, platforms can harness AI's predictive power to deliver reliable, actionable insights for modern sports betting.

Experience Algorithm-Driven Predictions

Ready to leverage proven algorithmic approaches for your sports betting strategy? See how data-driven models consistently outperform human intuition.