Soccer Data Evolution: From Manual Logging to Agentic AI
Soccer data analytics has transitioned from manual event logging to AI-driven predictive modeling, with Stats Perform and its Opta division leading the integration of agentic AI for the 2026 FIFA World Cup. This shift enables real-time tactical insights and automated content generation, replacing rudimentary metrics with proprietary data models that quantify player performance and match probability.
How did Opta transform soccer statistics?
Opta began as a marketing project by Opta Consulting, a group formed by former directors of Coopers Lybrand, to replicate cricket rating systems for soccer. According to the company’s history, sales executive Suzi Russell collaborated with the Premier League, Sky Sports, and former England assistant manager Don Howe to create the Opta Index. In the late 1990s, analysts manually logged goals, cards, and corners from match footage, often spending six hours on a single game.
The system gained traction through Sky’s Monday Night Football and The Observer, primarily appealing to fantasy soccer players. By 2006, Opta introduced computerized logging to provide real-time data. The company’s growth led to its acquisition by Perform Group in 2013 for £40 million, eventually merging with US-based Stats to become Stats Perform in 2019.
Why is Expected Goals (xG) more accurate than total shots?
Expected Goals (xG) provides a quality-based assessment of attacking opportunities, contrasting with traditional shot counts that treat all attempts equally. According to Stats Perform, xG differentiates between a speculative 30-yard effort and a high-probability shot from six yards out. This metric uses historical data and computer vision technology to assign a probability value to every shot, allowing coaches and analysts to identify underperforming strikers or efficient attacking patterns that raw totals hide.

Patrick Lucey, chief scientist at Stats Perform, notes that the rise of data literacy among fans was accelerated by the movie Moneyball and the prevalence of statistics in video games. Lucey states that modern analysis focuses on player roles and formations rather than simple totals like yellow cards or corner counts.
What role will AI play in the 2026 FIFA World Cup?
The 2026 World Cup in North America will use AI to manage an expanded 104-match schedule and bridge time-zone gaps for global audiences. The BBC plans to use AI-driven metrics to create content for fans unable to watch matches live, according to company projections. This follows the BBC’s use of AI metrics during UEFA Euro 2024 to provide immediate context across live broadcasts and social media.
Stats Perform is developing a sports tech platform that pairs Opta’s proprietary historical database with large language models (LLMs). Patrick Lucey explains that while general AI tools like ChatGPT often “hallucinate” because they rely on public internet data, proprietary enterprise data allows for accurate, conversational responses tailored to the specific language of sports.
How does agentic AI differ from standard LLMs in sports?
Agentic AI differs from standard LLMs by its ability to autonomously break down complex queries into multiple tasks and execute them, such as writing code or generating a video highlight. While a standard LLM can describe a player’s season, agentic AI can run long-term simulations to predict how a season’s outcome might change if specific players are removed from a squad, according to Lucey.
This capability reduces the reliance on human resources for live reporting. Agentic AI can automate the creation of reports and graphics in real-time, which is critical for broadcasters and bookmakers operating in live environments where manual analysis is too slow.
Comparison: The Evolution of Soccer Data Collection
| Feature | Early Opta Era (Late 90s) | Modern AI Era (2024-2026) |
|---|---|---|
| Collection Method | Manual logging from tapes | Computer vision & automated tracking |
| Processing Time | Overnight via single computer | Real-time cloud processing |
| Primary Metrics | Corners, Yellow Cards, Goals | xG, Player Positioning, Agentic Simulations |
| Delivery | Newspapers & Matchday programs | AI-generated highlights & Conversational LLMs |
Frequently Asked Questions
Who owns Opta?
Opta is part of Stats Perform, a global sports data company formed by the 2019 merger of Perform Group and Stats.

Can coaches use AI for tactics?
Yes. Laura Harvey, coach of the NWSL’s Seattle Reign, has acknowledged using AI tools for tactical analysis ahead of fixtures.
What is the main limitation of general AI in soccer?
General LLMs lack access to proprietary, deep-level match data, which often leads to factual errors or “hallucinations” regarding specific player stats or match events.