TY - JOUR
T1 - Predicting the NCAA basketball tournament using isotonic least squares pairwise comparison model
AU - Neudorfer, Ayala
AU - Rosset, Saharon
N1 - Publisher Copyright:
© 2018 Walter de Gruyter GmbH. All rights reserved.
PY - 2018
Y1 - 2018
N2 - Each year, millions of people fill out a bracket to predict the outcome of the popular NCAA men's college basketball tournament, known as March Madness. In this paperwe present a new methodology for team ranking and use it to predict the NCAA basketball tournament.We evaluate our model in Kaggle'sMarchMachine LearningMania competition, inwhich contestantswere required to predict the results of all possible games in the tournament. Our model combines two methods: The least squares pairwise comparison model and isotonic regression. We use existing team rankings (such as seeds, Sagarin and Pomeroy ratings) and look for a monotonic, non-linear relationship between the ranks' differences and the probability to win a game. We use the isotonic property to get new rankings that are consistent with both the observed outcomes of past tournaments and previous knowledge about the order of the teams. In the 2016 and 2017 competitions, submissions based on our methodology consistently placed in the top 5% out of over 800 other submissions. Using simulations, we show that the suggested model is usually better than commonly used linear and logistic models that use the same variables.
AB - Each year, millions of people fill out a bracket to predict the outcome of the popular NCAA men's college basketball tournament, known as March Madness. In this paperwe present a new methodology for team ranking and use it to predict the NCAA basketball tournament.We evaluate our model in Kaggle'sMarchMachine LearningMania competition, inwhich contestantswere required to predict the results of all possible games in the tournament. Our model combines two methods: The least squares pairwise comparison model and isotonic regression. We use existing team rankings (such as seeds, Sagarin and Pomeroy ratings) and look for a monotonic, non-linear relationship between the ranks' differences and the probability to win a game. We use the isotonic property to get new rankings that are consistent with both the observed outcomes of past tournaments and previous knowledge about the order of the teams. In the 2016 and 2017 competitions, submissions based on our methodology consistently placed in the top 5% out of over 800 other submissions. Using simulations, we show that the suggested model is usually better than commonly used linear and logistic models that use the same variables.
KW - least squares pairwise comparison
KW - multivariate isotonic regression
KW - ranking
UR - http://www.scopus.com/inward/record.url?scp=85054280376&partnerID=8YFLogxK
U2 - 10.1515/jqas-2018-0039
DO - 10.1515/jqas-2018-0039
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85054280376
SN - 1559-0410
VL - 14
SP - 173
EP - 183
JO - Journal of Quantitative Analysis in Sports
JF - Journal of Quantitative Analysis in Sports
IS - 4
ER -