The application of data mining techniques and statistical analysis to the sports field has received increasing attention in the last decade. One of the most famous sports in the world is soccer, and the present work deals with it, using data from the 2009/2010 season to the 2015/2016 season from nine European leagues extracted from the Kaggle European Soccer database. Overall performance indicators of the four roles in a soccer team (forward, midfielder, defender and goalkeeper) for home and away teams are used to investigate the relationships between them and the results of matches, and to predict the wins of the home team. The model used to answer both these demands is the Bayesian Network. This study shows that this model can be very useful for mining the relations between players’ performance indicators and for improving knowledge of the game strategies applied by coaches in different leagues. Moreover, it is shown that the ability to predict match results of the proposed Bayesian Network is roughly the same as that of the Naive Bayes model.
Discovering associations between players’ performance indicators and matches’ results in the European Soccer Leagues
Maurizio Carpita;Silvia Golia
2021-01-01
Abstract
The application of data mining techniques and statistical analysis to the sports field has received increasing attention in the last decade. One of the most famous sports in the world is soccer, and the present work deals with it, using data from the 2009/2010 season to the 2015/2016 season from nine European leagues extracted from the Kaggle European Soccer database. Overall performance indicators of the four roles in a soccer team (forward, midfielder, defender and goalkeeper) for home and away teams are used to investigate the relationships between them and the results of matches, and to predict the wins of the home team. The model used to answer both these demands is the Bayesian Network. This study shows that this model can be very useful for mining the relations between players’ performance indicators and for improving knowledge of the game strategies applied by coaches in different leagues. Moreover, it is shown that the ability to predict match results of the proposed Bayesian Network is roughly the same as that of the Naive Bayes model.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.