4.7 Article

The Open International Soccer Database for machine learning

Journal

MACHINE LEARNING
Volume 108, Issue 1, Pages 9-28

Publisher

SPRINGER
DOI: 10.1007/s10994-018-5726-0

Keywords

Open International Soccer Database; 2017 Soccer Prediction Challenge; Open science; Soccer analytics

Funding

  1. KU Leuven Research Fund [C14/17/070, C22/15/015, C32/17/036]
  2. FWO-Vlaanderen [SBO-150033]
  3. Interreg V A project NANO4Sports

Ask authors/readers for more resources

How well can machine learning predict the outcome of a soccer game, given the most commonly and freely available match data? To help answer this question and to facilitate machine learning research in soccer, we have developed the Open International Soccer Database. Version v1.0 of the Database contains essential information from 216,743 league soccer matches from 52 leagues in 35 countries. The earliest entries in the Database are from the year 2000, which is when football leagues generally adopted the three points for a win rule. To demonstrate the use of the Database for machine learning research, we organized the 2017 Soccer Prediction Challenge. One of the goals of the Challenge was to estimate where the limits of predictability lie, given the type of match data contained in the Database. Another goal of the Challenge was to pose a real-world machine learning problem with a fixed time line and a genuine prediction task: to develop a predictive model from the Database and then to predict the outcome of the 206 future soccer matches taking place from 31 March 2017 to the end of the regular season. The Open International Soccer Database is released as an open science project, providing a valuable resource for soccer analysts and a unique benchmark for advanced machine learning methods. Here, we describe the Database and the 2017 Soccer Prediction Challenge and its results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available