This study investigates the effectiveness of machine learning algorithms, including Neural Networks, Bayesian Networks, Support Vector Machines, and Random Forests, in predicting football match outcomes using data from the English Premier League (2018–2022).By incorporating user-generated probabilities for home win, away win, and draw alongside conventional features, the models were evaluated under binary and multi-class classification scenarios. The Support Vector Machine achieved the highest accuracy (69%) in the win-loss scenario, while the Neural Network reached 51% in the win-draw-loss scenario. Results indicate that user-derived features enhance predictive performance, though user predictions show a bias toward home teams, especially in uncertain cases. These findings highlight the potential of integrating user perspectives into predictive modeling and underscore the importance of addressing cognitive bias in sports analytics.
Ghasemi et al. (Thu,) studied this question.