The application of data mining techniques and statistical analysis to the sports field has received increasing attention in the last decade. One of the most famous sports in the world is soccer, and the present work deals with it, using data from the 2009/2010 season to the 2015/2016 season from nine European leagues extracted from the Kaggle European Soccer database. Overall performance indicators of the four roles in a soccer team (forward, midfielder, defender and goalkeeper) for home and away teams are used to investigate the relationships between them and the results of matches, and to predict the wins of the home team. The model used to answer both these demands is the Bayesian Network. This study shows that this model can be very useful for mining the relations between players’ performance indicators and for improving knowledge of the game strategies applied by coaches in different leagues. Moreover, it is shown that the ability to predict match results of the proposed Bayesian Network is roughly the same as that of the Naive Bayes model.

Discovering associations between players’ performance indicators and matches’ results in the European Soccer Leagues

Maurizio Carpita;Silvia Golia
2021-01-01

Abstract

The application of data mining techniques and statistical analysis to the sports field has received increasing attention in the last decade. One of the most famous sports in the world is soccer, and the present work deals with it, using data from the 2009/2010 season to the 2015/2016 season from nine European leagues extracted from the Kaggle European Soccer database. Overall performance indicators of the four roles in a soccer team (forward, midfielder, defender and goalkeeper) for home and away teams are used to investigate the relationships between them and the results of matches, and to predict the wins of the home team. The model used to answer both these demands is the Bayesian Network. This study shows that this model can be very useful for mining the relations between players’ performance indicators and for improving knowledge of the game strategies applied by coaches in different leagues. Moreover, it is shown that the ability to predict match results of the proposed Bayesian Network is roughly the same as that of the Naive Bayes model.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/530117
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 9
social impact