Next Article in Journal
Gender Effect on the Relationship between Talent Identification Tests and Later World Triathlon Series Performance
Previous Article in Journal
Combined but Not Isolated Ingestion of Caffeine and Taurine Improves Wingate Sprint Performance in Female Team-Sport Athletes Habituated to Caffeine
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting Seasonal Performance in Professional Sport: A 30-Year Analysis of Sports Illustrated Predictions

Department of Kinesiology and Health Science, York University, Toronto, ON M3J 1P3, Canada
*
Author to whom correspondence should be addressed.
Submission received: 14 October 2021 / Revised: 24 November 2021 / Accepted: 25 November 2021 / Published: 1 December 2021

Abstract

:
In 2017, Sports Illustrated (SI) made headlines when their remarkable prediction from 2014 that the Houston Astros (a team in one of the lowest Major League Baseball divisional rankings) would win the World Series, came true. The less-publicised story was that in 2017, SI predicted the Los Angeles Dodgers to win the Major League Baseball (MLB) title. Assessing the forecasting accuracy of experts is critical as it explores the difficulty and limitations of forecasts and can help illuminate how predictions may shape sociocultural notions of sport in society. To thoroughly investigate SI’s forecasting record, predictions were collected from the four major North American sporting leagues (the National Football League, National Basketball Association, Major League Baseball, and National Hockey League) over the last 30 years (1988–2018). Kruskal–Wallis H Tests and Mann–Whitney U Tests were used to evaluate the absolute and relative accuracy of predictions. Results indicated that SI had the greatest predictive accuracy in the National Basketball Association and was significantly more likely to predict divisional winners compared to conference and league champions. Future work in this area may seek to examine multiple media outlets to gain a more comprehensive perspective on forecasting accuracy in sport.

1. Introduction

In 2014, Sports Illustrated (SI) made the lofty prediction that the Houston Astros (a team in one of the lowest MLB divisional rankings) would win the 2017 World Series. Three years later, to the public’s surprise, this prediction came true. The media marvelled at this occurrence, giving credit to SI’s predictive accuracy [1,2]. The less-publicised story, however, was that the team SI predicted in 2017 to win the 2017 Championship was the Los Angeles Dodgers. Surprisingly, SI was praised for the accuracy of their 2014 prediction, not criticised for changing that prediction to one that was incorrect in the same year. The original prediction was especially odd since the Astros were at the bottom of the league standings when the prediction was made. It is possible that the bizarreness of this prediction (moving from last to first place) may have influenced how it was perceived and evaluated [3,4,5,6]. However, these differing forecasts highlight the complexities of talent identification and prediction decisions in sport. In this study, we examine the accuracy of SI’s pre-season predictions across the four major North American professional sports leagues—Major League Baseball (MLB), the National Football League (NFL), the National Basketball Association (NBA), and the National Hockey League (NHL) over the last 30 years.
Since the turn of the century, the sports industry has seen a rise in the use of analytics [7]. A large catalyst for this shift was the Major League Baseball (MLB) team Oakland Athletics’ ‘Moneyball’ movement where the team experienced tremendous success (second place at the World Series) despite financial restrictions, by using objective measures and statistics [8]. This phenomenon raised awareness of the value of using quantitative approaches to minimise biases from more subjective approaches to decision-making. Currently, computer technology and statistical acumen occupy a growing role in the development of accurate forecasting models and although expert opinions have been evaluated in fields like meteorology [9] and finance [10] there has been less attention to the accuracy of experts in predicting sport outcomes [11,12]. There is, however, an exception with recent years bringing new light to evaluating the expert and non-expert (also referred to as ‘crowd’) forecasts in sports. Specifically, new research published includes topics such as (i) evaluating expert forecasters over an extended period of time [13,14], (ii) evaluating crowds of tipsters (or semi-expert forecasters) [15,16], (iii) evaluating the informational content of sports experts through their contributions on social media [17], and (iv) evaluating semi-expert judgments regarding sporting predictions to improve forecast accuracy [18]. Assessing the forecasting accuracy of ‘experts’ is critical as it explores the difficulty and limitations of forecasts and challenges us to re-think notions of athlete development and the path to elite performance.
Evidence comparing the accuracy of expert opinions and computer-generated models is varied. On the one hand, there is a substantial body of literature showing the benefits of removing the ‘human-element’ of the decision-making process [8,19,20] while on the other hand, there is evidence highlighting the importance of the human ‘eye’ in judgements [21,22]. For example, it was discovered that on average, computer-based models made more correct forecasts at the 2003 Rugby Union World Cup than experts [23]. However, certain human experts were more accurate than the computer-based models with one individual predicting 46 of 48 matches correctly. Perhaps the greater average accuracy of these computer-generated models can be attributed to the increased agreeability of the predictions across the different systems as this is associated with more accurate predictions [24,25,26]. The ability of computerised models to consistently and efficiently analyse data are certainly strengths of these systems [24].
Human judgements are often prone to biases, which can result in overconfidence by experts [27]. Such overconfidence, coupled with the public’s tendency to overlook the numerous inaccurate forecasts and celebrate the rare accurate ones, form the grounds of the bizarreness effect [3,4,5,6] and inflate claims of the accuracy of expert forecasts in sport [28]. This explains why bold predictions capture the public’s attention and disrupt memory of other events [6] as exemplified by SI’s Houston Astros prediction and the resulting impact on the public’s perception of SI’s predictive accuracy [1,2].
With greater access to performance statistics and other information than ever before, a relevant question is whether this surplus of data helps or hinders human forecasts [10,29,30,31]. Coincident with this increased access to data, there has been a greater focus on the value of ‘simple heuristics’ for improving the accuracy of decision-making and forecasting [32]. For instance, several studies have explored the availability heuristic—a phenomenon that postulates a well-recognised competitor (i.e., one which is most readily ‘available’ in our brains) is more likely to win compared to a less recognised one [10,31,32]. This work suggests having less information may be more beneficial when making decisions under uncertainty, although results in this area have been mixed [11,29,30,33]. The sociocultural implications and the way sport is constructed in society can thus influence individuals when making forecasts.
Despite the evidence presented above that may question the validity of experts, individuals continue to rely on expert opinions [30] presumably, at least in part, due to their access to useful (and often proprietary) information. However, it is important to critically examine ‘experts’ forecasting records to evaluate if the public should continue to deem these predictions as valuable. Furthermore, given that their ideas are released on behalf of large, seemingly reputable organisations, sports media outlets such as SI have the propensity to shape overall public opinion about the value and accuracy of sports forecasting. Given these implications, the predictive accuracy of popular media outlets is an area ripe for further exploration. SI is among the most distinguished magazines worldwide. Their inaugural issue in 1954 featured “Scouting Reports” and seasonal performance forecasts have continued to grow and evolve ever since (i.e., frequency, variables included, and time to prediction). A focused examination of SI’s predictive accuracy provides an opportunity for a unique case study of predictive accuracy in professional sports over a longer time frame than possible in events such as the World Cup of Soccer [34], World Cup of Rugby [10,23], and Wimbledon Tennis Championship [31]. One of the few studies to examine the predictive accuracy of season-long MLB forecasts found that SI and the New York Times made more accurate predictions than random guesses, except for the National League Conference winners [35]. Although informative, this study deserves re-assessment as the MLB has been expanded and re-aligned since this 1994 study. Therefore, the objective of this study was to assess the accuracy of SI’s pre-season forecasts of the divisional standings, conference standings, and championship winners of the four main North American sport leagues (the NFL, NBA, MLB, and NHL) over the past 30 years.

2. Materials and Methods

2.1. Procedures

Quantitative content analysis of the popularised magazine, SI was utilised as the primary research method. Content analysis refers to the objective, systematic, and quantitative analysis of communications content [36]. Pre-season predictions made by SI between 1988 and 2018 were collected from the magazine’s online repository. The repository consists of digitised versions of paper magazines inclusive of SI’s inaugural 1954 issue to the latest SI magazines from 2021. This archive can be freely accessed at https://vault.si.com/vault/archives (accessed on 14 August 2021). All four major North American sporting leagues- the NFL, NBA, MLB, and the NHL—were examined and predictions included the championship winner, runner-up, and score (for the NFL only), as well as divisional standings and conference winners where available. In years where SI did not explicitly state their prediction, inferences were made for each division based on their rankings of the conference (i.e., the team in each division with the highest predicted conference ranking was inferred to be the predicted divisional winner). This method was utilised for 8.1% of the data. Predictions were then compared to official results for the respective leagues. Throughout the process, the number of teams within each league, conference, and division was noted. Some data (13.7%) were missing from the dataset for reasons such as league lockouts, missing pages in the magazines, or unclear predictions.

2.2. Data Analysis

Several, largely descriptive, approaches were taken in the analyses. Data were analysed in Statistical Package for the Social Sciences (SPSS). Two areas of interest were highlighted—determining the absolute accuracy of SI’s predictions and the relative accuracy of their predictions (i.e., exactly how far the experts were off if they were incorrect). Initially, a simple binary score was calculated to determine the level of overall agreement for the predictions (i.e., was the prediction correct? Yes/No). The total number of correct forecasts within each division, conference, and league were collected, and following this, the average percentage of correct predictions was calculated for each domain to yield a comparable value (i.e., this number considered the total predictions made and controlled for elements such as the changing structural compositions of each league, lockouts, missing data and unclear predictions). Subsequently, comparisons between predicted vs. actual results for winner and runners-up provided information on relative accuracy. Predicted vs. actual results were calculated via discrepancy scores where the team’s predicted standing was subtracted from their actual standing. Thus, smaller discrepancy scores were associated with greater relative accuracy (i.e., the discrepancy between where the team actually finished is close to where SI predicted they would finish). Since the playoff rounds yielded ambiguous final standings (i.e., at the conclusion of each round, numerous teams are eliminated at once) a relative scale was created to clarify the actual results for the winner and runner-up predictions. The scale was as follows (1 = Won Championship, 2 = Lost in Championship, 3 = Lost in Third Round, 4 = Lost in Second Round, 5 = Lost in First Round, 6 = Did not Qualify for Playoffs). The average discrepancy scores for each division, conference, and league were then calculated, permitting comparisons between these categories.
As expected, the data violated the assumption of normality for the use of parametric statistics. Therefore, non-parametric Mann–Whitney U and Kruskal–Wallis H tests were used to compare groups. The exact significance level is reported where applicable, and the Monte Carlo permutation was used as a substitute if computational limits did not allow the exact significance to be calculated. For Kruskal–Wallis H Tests that met the significance level of 0.05, Mann–Whitney U Tests were utilised as a ‘post hoc’ test to further investigate the finding.
Due to the exploratory nature of this study, we used two-tailed tests for all comparisons. Furthermore, Bonferroni’s correction was applied to our alpha level to reduce the likelihood of Type 1 errors. However, given the small sample sizes, we focused on measures of effect size over indicators of statistical significance. Effect sizes for the Mann–Whitney U tests are in the form of Pearson’s correlation coefficient (r) and for Kruskal–Wallis tests as epsilon-squared (E2R) [37]. For the purposes of this study, small effect sizes were categorised as 0.2 or less, moderate effect sizes were categorised as 0.2–0.5 and large effect sizes were deemed anything 0.5 or greater.
The variables (number of correct predictions, average percentage of correct predictions, and average discrepancy score) were also totalled for the divisions and conferences within the league to provide a comparison between leagues (i.e., to determine whether SI made the most correct forecasts in the NFL, NBA, MLB, or NHL). For the winner and runner-up predictions for each league, we used Kruskal–Wallis H tests to compare across three time brackets (with approximately the same number of years in each bracket) to determine changes in the percentage of correct predictions (hereinafter referred to as “percent correct”) and discrepancy score. For both NFL predictions, both NBA predictions and the NHL Champion prediction, the time brackets were as follows: 1988–1997 vs. 1998–2007 vs. 2008–2018. SI made fewer predictions for the MLB champion, runner-up, and for the NHL runner-up and, as a result, their time brackets were as follows: 1996–2003 vs. 2004–2011 vs. 2012–2018. For the discrepancy scores, a Jonckheere–Terpstra Test of Trend was chosen to examine changes over time across these time brackets.
Due to changes in the sports leagues over the past 30 years, we attempted to standardise the accuracy values relative to the number of teams in the division/conference/league. The average number of teams per division/conference/league was calculated over the 30-year time frame, and these values were taken into account when examining the number of correct predictions and the average percentage of correct predictions in each respective category.

3. Results

Results from the Kruskal–Wallis and Mann–Whitney U tests are presented below and in Table 1. Descriptive statistics for each league are presented below (as well as in Table 2), followed by analyses comparing leagues (Table 3 and Table 4) and trends over time (Table 5, Table 6 and Table 7).

3.1. NFL

NFL league-wide data (i.e., winner vs. runner-up) revealed medium-sized effects but only the discrepancy scores displayed significant differences (see Table 1). Comparing the American Football Conference (AFC) to the National Football Conference (NFC) yielded medium effect sizes, with significantly higher discrepancy scores for the NFC compared to the AFC. Interestingly, the percent correct and discrepancy scores for the NFC did not significantly differ from the league winner category, despite the NFC having half as many teams as the entire NFL. Divisionally, non-significant small effect sizes were found when examining the AFC and NFC divisions, as well as when comparing the AFC to the NFC. Furthermore, as the average number of teams increased, percent correct decreased and discrepancy scores increased (i.e., SI was less accurate with larger groups). Divisional predictions showed significantly higher percent correct and lower discrepancy scores than conference and league-wide data. Moderate and small-to-moderate effect sizes were found when examining divisional vs. league-wide and conference-wide predictions, respectively.
Examining league winner forecasts over the last 30 years revealed small effect sizes for both percent correct and discrepancy scores, although none of these effects met the criteria for statistical significance (see Table 5). Additionally, the Jonckheere–Terpstra Test of Trend revealed small effect sizes and no discernible trends (see Table 7).

3.2. NBA

The NBA data (see Table 1) revealed non-significant small effects between the league winner and runner-up predictions. Small effect sizes were also seen when comparing the predictions for the Eastern Conference and Western Conference Champions. Non-significant small effects were also found when examining the NBA divisions, indicating that the division being predicted had little relation to SI’s resulting accuracy. SI was, however, significantly more likely to predict divisional winners compared to league and conference champions. Specifically, moderate, and small-to-moderate effect sizes were found when comparing SI’s accuracy in forecasting divisional champions to league and conference champions, respectively.
A small-to-moderate effect size was found when examining the discrepancy scores of NBA Champion predictions over the last 30 years. Significant differences and a large effect size were found between the 1988–1997 and 1998–2007 time brackets (see Table 5), where the latter had a larger median discrepancy score (see Table 6). A moderate effect size was found when comparing 1988–1997 vs. 2008–2018 and when comparing 1998–2007 and 2008–2018. Despite this, the Jonckheere–Terpstra Test indicated there were no significant trends (see Table 7).

3.3. MLB

Discrepancy scores of the MLB league winner and runner-up were significantly different with a large effect size (Table 1). At the conference level, non-significant small effects were noted between the American League (AL) and the National League (NL) winners. Divisionally, non-significant small effects were found, denoting that SI had no more success in one division than another. SI had significantly more success (i.e., higher percent correct and lower discrepancy scores) in predicting MLB divisional winners compared to league and conference champions, with moderate-to-large and small-to-moderate effects, respectively. Non-significant small effects were found for the forecasts of league winners compared to conference winners, indicating similar margins of error, despite conferences having half as many teams as the league.
Examining World Series Champion predictions via the Jonckheere–Terpstra test revealed a significant trend over time with a moderate-to-large effect size (see Table 7). Specifically, 1996–2003 had significantly smaller discrepancy scores than 2004–2011 with a large effect size between these two time periods. A non-significant small effect size was found between 2004–2011 and 2012–2018, suggesting an initial decrease in predictive accuracy (beginning in 2004) that has since remained consistent. Although no significant difference was found between the discrepancy scores of 1996–2003 and 2012–2018, there was a large effect size indicating SI’s predictive accuracy decreased over time.

3.4. NHL

Comparing the forecasted winner vs. runner-up in the NHL revealed no significant differences (see Table 1), and effect sizes were small-to-moderate for percent correct and small for discrepancy score. The Eastern conference had significantly larger discrepancy scores than the Western conference with a moderate-sized effect. Divisionally, non-significant small effects suggested the division being forecasted had little correlation with SI’s predictive accuracy. Furthermore, as the average number of teams decreased, predictive accuracy increased. For example, percent correct was significantly higher and discrepancy scores significantly lower for divisions compared to league-wide predictions with moderate effect sizes. Comparisons between divisional and conference forecasts indicated a significant moderate effect size for discrepancy score but non-significant small effects for percent correct.
Examining SI’s predictions for the Stanley Cup winner over time (see Table 5) yielded small effects for percent correct and discrepancy scores, implying little correlation between time and SI’s predictive accuracy. Overall, no significant differences were found between time brackets, and the Jonckheere–Terpstra test confirmed there were no distinct trends over time (see Table 7).

3.5. Comparing across Leagues

An analysis combining both league-wide predictions (i.e., NFL championship winner and runner-up vs. NBA championship winner and runner-up, etc.), revealed the NBA had the largest percent correct (see Table 3). Moderate effect sizes and significant differences were revealed between the NBA and NFL, as well as between the NBA and MLB. A non-significant small effect was found when comparing the NBA to NHL. SI also experienced the greatest relative accuracy in the NBA with significantly lower discrepancy scores than in the other three leagues and moderate effect sizes. Non-significant small effects were found when comparing the other leagues on both percent correct and discrepancy scores (see Table 3).
The percent correct and discrepancy scores for the divisions and conferences within each league were combined to yield statistics that could be used for comparison across leagues. Generally, at the conference level, small effects and no significant differences were found for percent correct (see Table 3). SI experienced the greatest relative accuracy with the NBA at the conference level. However, the only significant difference was found between the NBA and MLB and this comparison yielded a moderate effect size.
Divisionally, there were significant differences in percent correct and discrepancy scores (see Table 3) although effect sizes were small. Again, SI’s divisional NBA predictions had significantly higher percent correct than divisional predictions for the NFL and NHL. SI also experienced the greatest relative accuracy with their divisional NBA predictions, which had significantly lower discrepancy scores than other leagues. An analysis of all predictions found SI’s overall percent correct was 32.8%.

4. Discussion

The use of technology and the internet has allowed for greater forecast examination, but only in recent years. Analyses of SI’s predictive accuracy over the last 30 years revealed several intriguing findings. These results help to critically examine the forecasting accuracy of ‘experts’ and how their predictions shape sociocultural notions of sport in society.
First, there was a clear, and unsurprising, trend showing increased forecasting accuracy when the average number of teams was the lowest (i.e., divisions had significantly higher percent correct and lower discrepancy scores than conference and league-wide predictions). However, non-significant small effect sizes were found between divisions, indicating that the division being forecasted had little relation to SI’s resulting predictive accuracy. This was somewhat surprising as certain divisions (e.g., the AFC East in football, the AL East in baseball) are known for ‘powerhouse’ organisations who have historically been consistent contenders for divisional championships. Perhaps expanding markets of teams in these divisions such as the Tampa Bay Rays (who finished in last place every year from 1998–2007, yet since 2008 have had a winning record in eight years) have led to more uncertainty in these perceived ‘easy-to-predict’ divisions. Alternatively, if there is no clear divisional favourite, the small pool of teams to choose from may increase the likelihood of correct predictions. Finally, each league has implemented numerous structural changes over the last 30 years (i.e., divisional adjustments, expansion teams). This may also impact why SI had no more success in predicting the winners of one division compared to another, as continual divisional re-alignments may hinder any emerging trends.

4.1. SI Accuracy Is Greatest in the NBA

At the divisional, conference, and league levels, SI had the greatest predictive success in the NBA. This may reflect how basketball is structured in relation to other sports. For example, with more scoring attempts per game (i.e., due to a ‘shot clock’ that requires teams to shoot the ball every 24 s), the likelihood of random outcomes decreases as the variance of points regress towards a mean value determined by the ‘talent’ on a given team [38]. Moreover, the NBA playoffs are structured as four, best-of-seven series. As a result, lengthening the playoff series reduces the likelihood of a weaker team ‘upsetting’ the stronger team [39]. In comparison, playoff formats of other leagues rely on single elimination games (e.g., the NFL) or shorter series (e.g., the MLB). Similarly, NBA rosters possess the fewest athletes, which further reduces uncertainty by limiting the potential for variance (i.e., fewer players to project).
It is also likely that greater prediction accuracy in the NBA is associated with the league’s financial structure, which differs from other North American leagues in that it operates under a ‘soft’ salary cap (i.e., franchise payrolls are limited yet some exceptions are permitted). The salary cap was implemented in 1984 with the hopes of increasing competitive balance. However, it has been largely ineffective as the lack of parity in the NBA has remained consistent over the last 30 years [38,40] seemingly making the outcomes easier to predict than other leagues. It is argued the large exemptions to the cap have been detrimental to league parity [41]. For example, the NBA is renowned for dynasties such as the Boston Celtics, Chicago Bulls, and more recently the Golden State Warriors. While other leagues also possess dominating organisations, the NBA differs as its “Larry Bird Exemption” is a clause under the ‘soft’ salary cap that allows teams to re-sign one of their own players without regard to the salary cap [41,42]. These ‘loopholes’ in the salary cap damage the competitive balance as large-market franchises can match outside offers and keep their star players together for successive seasons [41]. In comparison, the NHL’s revised Collective Bargaining Agreement (CBA) implemented in 2005 has decreased an organisation’s capacity to “hold-together” championship teams whereby no NHL team has won the Stanley Cup more than two years in a row [42]. Even prior to the new CBA, a team winning the Stanley Cup in more than two consecutive years has only happened twice in the post-expansion era (1967 to present). Similarly, the NFL illustrates greater competitive balance as a team has yet to win three consecutive Super Bowl Titles [43]. While the primary mechanisms explaining these effects are unknown, the existence of these trends suggests a range of additional factors that may explain SI’s greater predictive accuracy in the NBA.

4.2. No Changes in Prediction Accuracy over Time

In general, results indicated no significant differences or meaningful effects in SI’s predictive accuracy over time. This was somewhat surprising considering the increased attention given to the use of predictive analytics in short- and long-term sport forecasting. However, despite advancements in analytics over the last 30 years, emphasis has also been placed on increasing competitive balance within leagues in order to attract fans, prevent bankruptcy and discourage the creation of rival leagues [43,44,45,46]. Evidence suggests that parity has generally improved for most leagues since the 1990–1999 decade [38]; and as a consequence, accurate forecasting may have become more difficult (i.e., a more ‘balanced’ league with numerous contenders is theoretically more difficult to predict).
In an effort to achieve competitive balance, leagues have introduced various redistribution mechanisms (e.g., revenue-sharing, reverse-order drafts, free-agency, and salary caps). Ultimately, these measures must be addressed as their proposed benefits to competitive balance [38,41,42,46] may influence forecasting ability. However, their effectiveness is largely unknown as recent increases in competitive balance may not be a direct result of these redistribution mechanisms (i.e., causation cannot be inferred). For example, North American sports also use unbalanced schedules (i.e., teams do not play opponents the same number of times), a format that traditionally produces more uncertainty [47], which also may have contributed to greater league parity. Traditional redistribution mechanisms include revenue-sharing and numerous studies have explored the effects of revenue-sharing with mixed results [48,49,50,51,52,53,54]. Additionally, the effectiveness of the ‘reverse-order’ draft is also largely unknown as it is argued that adding only one player may not be impactful [55]. Free agency is another redistribution mechanism implemented by all four leagues, yet it is uncertain whether it helps or hinders predictive accuracy. This clause allows for player mobility yet may also permit the formation of ‘super-teams’. Finally, salary caps (which are designed to limit large-market teams from signing all the elite athletes) [56] are another redistribution mechanism that may impact predictive accuracy. While their effects are also ambiguous [38,41,46] the NHL and NFL are argued to have greater parity due to their ‘hard’ salary caps [38] compared to the NBA’s ‘soft’ salary cap and the MLB’s luxury tax (i.e., a surcharge for teams that exceed a pre-determined payroll amount) [41]. Overall, it is unclear how these redistribution mechanisms may have affected the predictive accuracy in our analyses. The possible direct and interactive effects provide a host of intriguing areas of further work.
While SI’s forecasting accuracy remained largely consistent over time, there was a trend of decreasing accuracy in the MLB. Specifically, 1996–2003 had significantly lower discrepancy scores than in 2004–2011. Although this suggests that SI’s accuracy in predicting World Series Champions has decreased over time, the dominance of the New York Yankees, who won four league titles during 1996–2003, needs to be considered. Nonetheless, as previously noted, parity in MLB has generally grown over time [48,57] creating greater difficulty in forecasting the World Series Champions. Other factors may also be especially relevant for MLB forecasts (e.g., performance-enhancing drugs; MLB’s 182-game regular season) but are largely unaccounted for in SI’s pre-season predictions. Last, although SI’s remarkable three-year out Houston Astro’s prediction sparked this study, recent evidence revealed unfair play on the Astro’s behalf [58] and this must be acknowledged as a potential confounding factor in SI’s and our analyses.

4.3. Limitations and Future Research

This study is one of few to examine long-term forecasting accuracy, using data from a popular American media outlet. Although interesting results were discovered, the study had some notable limitations. First, 30 years of data, although impressive as a longitudinal data source, provided a relatively small number of data points, some of which were missing due to reasons such as league lockouts, missing pages in the magazine, or if SI had not made a prediction that year. As a result of this small sample size, the statistical tests conducted were generally crude and conservative tests, increasing the possibility of type II error. While the purpose of this research project was to present an initial exploration into the SI’s forecasting accuracy, future studies may seek to use more rigorous statistical tests, as data becomes more readily available (i.e., as the forecasting pool grows when SI makes pre-season predictions in subsequent years). In an effort to increase sample size, future studies may also seek to compare the forecasting accuracy of multiple news outlets (i.e., to determine if certain sport media outlets are more successful than others) or compare the predictive accuracy of media outlets to betting odds or other forecasting forums. Finally, SI’s “team of editors” who made the prediction each year was inconsistent (i.e., the team changed over time), which confounded our ability to discern predictive accuracy over time. Future work could overcome this limitation by examining a single forecaster, for example, a notable sports commentator, over time.
Additionally, the present study could be enhanced by comparing SI’s pre-season predictions to other ‘futures betting markets’ or sporting prediction magazines. Positioning SI’s projections against different measures of across-season competitive balance estimates could help to provide a benchmark for SI’s abilities to forecast and highlight the strengths and weaknesses of each source’s respective forecasting abilities. Using the futures betting market could be an especially fruitful comparison as returns could be calculated to see if following SI’s predictions make money in this time frame.
It is also worth considering that forecasting accuracy is a small segment in the entirety of sports culture and mass media communications. The changing dynamics within the sociocultural vehicle of sports and how SI represents those values must be considered when examining their predictive accuracy. Several other studies have analysed periodicals over time and noted how their content is impacted by the time period in which they are published [59,60].
Last, the authors were unable to determine how SI predictions were/are made. For this reason, future work could benefit from understanding whether SI forecasts were the result of one person’s gut (i.e., tipsters/pundits), an aggregation of several people, whether there was some sort of statistical model being used by SI and/or some variation of all the above.

5. Conclusions

This study presented an initial exploration into SI’s pre-season forecasting accuracy for the four major North American sport championships over the last 30 years. While results varied across leagues, SI was generally more successful at predicting divisional winners compared to conference and league champions. Interestingly, in all leagues, it was found that SI was not significantly better at predicting certain divisions compared to others. A direct comparison of leagues revealed SI experienced the greatest predictive accuracy in the NBA. Furthermore, an examination of 30 years of predictions revealed no distinct trends over time, yet there is some evidence to suggest that forecasting accuracy for the MLB may have decreased. This study suggests that despite technological advancements, expert opinions still carry some value in modern-day sport forecasting. Findings from this study highlight that expert opinions should be viewed in relation to the sociocultural lens of sport, and help to shape the way society views sport as a whole. Although greater research is required to fully optimise predictive success in North American sporting leagues, this study presents a valuable baseline analysis of SI’s forecasting accuracy.

Author Contributions

Conceptualisation, J.J., K.J., L.F. and J.B.; Methodology, J.J., K.J., L.F. and J.B.; Formal Analysis, J.J., K.J., L.F. and J.B..; Investigation, J.J., K.J., L.F. and J.B.; Writing—Original Draft Preparation, J.J.; Writing—Review & Editing, J.J., K.J., L.F. and J.B.; Visualisation, J.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data can be made available upon request. All requests for data can be made directly to [email protected].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kay, S. The Best Tweets about Sports Illustrated’s Prophetic 2014 Astros Cover. Sports Illustrated. 2017. Available online: https://www.si.com/mlb/2017/11/02/sports-illustrated-houston-astros-world-series-prediction-twitter-reaction (accessed on 9 March 2021).
  2. Lauletta, T. In 2014, a Writer Predicted the Woeful Houston Astros Would Win the World Series in 2017—Here Is the Story behind That Prediction. Business Insider. 2017. Available online: https://www.businessinsider.com/the-story-behind-the-si-cover-predicting-the-astros-world-series-run-2017-10 (accessed on 27 January 2021).
  3. Benson, B. Cognitive Bias Cheat Sheet. Better Humans. 2016. Available online: https://betterhumans.pub/cognitive-bias-cheat-sheet-55a472476b18 (accessed on 3 March 2021).
  4. Cipolli, C.; Bolzani, R.; Comoldi, C.; De Beni, R.; Fagioli, I. Bizareness effect in dream recall. Sleep 1993, 16, 163–170. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. McDaniel, M.A.; Einstein, G.O. Bizarre imagery as an effective memory aid: The importance of distinctiveness. J. Exp. Psychol. Learn Mem. Cogn. 1986, 12, 54–65. [Google Scholar] [CrossRef]
  6. Worthen, J.B.; Wood, V.V. A disruptive effect of bizarreness on memory for relational and contextual details of self-performed and other-performed acts. Am. J. Psych. 2001, 114, 535–546. [Google Scholar] [CrossRef]
  7. Nandakumar, N.; Jensen, S.T. Historical perspectives and current directions in hockey analytics. Annu. Rev. Stat. Appl. 2019, 6, 19–36. [Google Scholar] [CrossRef]
  8. Lewis, M. Moneyball: The Art of Winning an Unfair Game; W.W. Norton & Company Inc.: New York, NY, USA, 2003. [Google Scholar]
  9. Hoffman, R.R.; LaDue, D.S.; Mogil, H.M.; Trafton, J.G.; Roebber, P.J. Minding the Weather: How Expert Forecasters Think; MIT Press: Cambridge, MA, USA, 2017. [Google Scholar]
  10. Sherden, W.A. The Fortune Sellers: The Big Business of Buying and Selling Predictions; Wiley: New York, NY, USA, 1998. [Google Scholar]
  11. Andersson, P.; Ekman, M.; Edman, J. Forecasting the Fast and Frugal Way: A Study of Performance and Information-Processing Strategies of Experts and Non-Experts When Predicting the World Cup 2002 in Soccer. Working Paper Series in Business Administration. 2003. 9: Forthcoming. Available online: https://swoba.hhs.se/hastba/papers/hastba2003_009.pdf (accessed on 3 March 2021).
  12. Stekler, H.O.; Sendor, D.; Verlander, R. Issues in sports forecasting. Int. J. Forecast. 2010, 26, 606–621. [Google Scholar] [CrossRef] [Green Version]
  13. Butler, D.; Butler, R.; Eakins, J. Expert performance and crowd wisdom: Evidence from English Premier League predictions. Eur. J. Oper. Res. 2021, 288, 170–182. [Google Scholar] [CrossRef]
  14. Reade, J.; Singleton, C.; Brown, A. Evaluating strange forecasts: The curious case of football match scorelines. Scott. J. Polit. Econ. 2021, 68, 261–285. [Google Scholar] [CrossRef]
  15. Brown, A.; Reade, J.J. The wisdom of amateur crowds: Evidence from an online community of sports tipsters. Eur. J. Oper. Res. 2019, 272, 1073–1081. [Google Scholar] [CrossRef]
  16. Singleton, C.; Reade, J.; Brown, A. Going with your gut: The (In)accuracy of forecast revisions in a football score prediction game. J. Behav. Experim. Econ. 2020, 89, 101502. [Google Scholar] [CrossRef] [Green Version]
  17. Brown, A.; Rambaccussing, D.; Reade, J.J.; Rossi, G. Forecasting with social media: Evidence from Tweets on soccer matches. Econ. Inquiry 2018, 56, 1748–1763. [Google Scholar] [CrossRef] [Green Version]
  18. Peeters, T. Testing the wisdom of crowds in the field: Transfermarkt valuations and international soccer results. Int. J. Forecast. 2018, 34, 17–29. [Google Scholar] [CrossRef]
  19. Den Hartigh, R.J.; Niessen, A.S.M.; Frencken, W.G.; Meijer, R.R. Selection procedures in sports: Improving predictions of athletes’ future performance. Eur. J. Sport Sci. 2018, 18, 1191–1198. [Google Scholar] [CrossRef] [Green Version]
  20. Grove, W.M.; Zald, D.H.; Lebow, B.S.; Snitz, B.E.; Nelson, C. Clinical versus mechanical prediction: A meta-analysis. Psychol. Assess. 2000, 12, 19–30. [Google Scholar] [CrossRef] [PubMed]
  21. Christensen, M.K. “An eye for talent”: Talent identification and the “practical sense” of top-level soccer coaches. Sociol. Sport J. 2009, 26, 365–382. [Google Scholar] [CrossRef] [Green Version]
  22. Schorer, J.; Rienhoff, R.; Fischer, L.; Baker, J. Long-term prognostic validity of talent selections: Comparing national and regional coaches, laypersons and novices. Front. Psychol. 2017, 8, 1146. [Google Scholar] [CrossRef] [Green Version]
  23. O’Donoghue, P.; Williams, J. An evaluation of human and computer-based predictions of the 2003 Rugby Union World Cup. Int. J. Comput. Sci. Sport 2004, 3, 5–22. [Google Scholar]
  24. Song, C.U.; Boulier, B.L.; Stekler, H.O. Measuring consensus in binary forecasts: NFL game predictions. Int. J. Forecast. 2009, 25, 182–191. [Google Scholar] [CrossRef] [Green Version]
  25. Gregory, A.W.; Yetman, J. The evolution of consensus in macroeconomic forecasting. Int. J. Forecast. 2004, 20, 461–473. [Google Scholar] [CrossRef] [Green Version]
  26. Gregory, A.W.; Smith, G.W.; Yetman, J. Testing for forecast consensus. J. Bus. Econ. Stat. 2001, 19, 34–43. [Google Scholar] [CrossRef]
  27. Paul, R.J.; Weinbech, A.P. Bettor Misperception in the NBA: The overbetting of large favorites and the “hot hand”. J. Sports Econ. 2005, 6, 390–400. [Google Scholar] [CrossRef]
  28. Aukutsionek, S.A.; Belianin, A.V. Quality of forecasts and business performance: A survey study of Russian managers. J. Econ. Psychol. 2001, 22, 661–692. [Google Scholar] [CrossRef]
  29. Gröschner, C.; Raab, M. Predicting soccer outcomes: Descriptive and normative aspects of prediction models in sport. J. Sports Psychology 2006, 13, 23–36. [Google Scholar]
  30. Pachur, T.; Biele, G. Forecasting from ignorance: The use and usefulness of recognition in lay predictions of sports events. Acta Psychol. 2007, 125, 99–116. [Google Scholar] [CrossRef] [Green Version]
  31. Scheibehenne, B.; Bröder, A. Can lay people be as accurate as experts in predicting the results of Wimbeldon 2005? Int. J. Forecast. 2007, 23, 415–426. [Google Scholar] [CrossRef] [Green Version]
  32. Raab, M. Simple heuristics in sports. Int. Rev. Sport Exerc. Psychol. 2012, 5, 104–120. [Google Scholar] [CrossRef]
  33. Khazaal, Y.; Chatton, A.; Billieux, J.; Bizzini, L.; Monney, G.; Fresard, E.; Thorens, G.; Bondolfi, G.; El-Guebaly, N.; Zullino, D.; et al. Effects of expertise on football betting. Subst. Abuse Treat. Prev. Policy 2012, 7, 1–6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. O’Donoghue, P. Factors influencing the accuracy of predictions of the 2014 FIFA World Cup. Int. J. Comput. Sci. Sport 2014, 13, 32–49. [Google Scholar]
  35. Smyth, D.J.; Smyth, S.J. Major League Baseball division standings, sports journalists’ predictions and player salaries. Manag. Decis. Econ. 1994, 15, 421–429. [Google Scholar] [CrossRef]
  36. Kassarjian, H.H. Content analysis in consumer research. J. Consum. Res. 1997, 4, 8–18. [Google Scholar] [CrossRef]
  37. Tomczak, M.; Tomczak, E. The need to report effect size estimates revisited. An overview of some recommended measures of effect size. Trends Sport Sci. 2014, 1, 19–25. [Google Scholar]
  38. Rockerbie, D.W. Exploring inter-league parity in North America: The NBA anomaly. J. Sports Econ. 2014, 17, 286–301. [Google Scholar] [CrossRef] [Green Version]
  39. Sanderson, A.R. The many dimensions of competitive balance. J. Sports Econ. 2002, 3, 204–228. [Google Scholar] [CrossRef]
  40. Foraker, S. The national basketball association salary cap: An antitrust violation? South Calif. Law Rev. 1985, 59, 157–181. [Google Scholar]
  41. Staudohar, P.D. Salary caps in professional team sports. Compens. Work. Conditions 1998, 3, 3–11. [Google Scholar]
  42. Zimbalist, A.S. Competitive balance in sports leagues: An introduction. J. Sports Econ. 2002, 3, 111–121. [Google Scholar] [CrossRef]
  43. Humphreys, B.R. Alternative measures of competitive balance in sports leagues. J. Sports Econ. 2002, 3, 133–148. [Google Scholar] [CrossRef]
  44. Michie, J.; Oughton, C. Competitive Balance in Football: Trends and Effects; Football Governance Centre Research Paper; Birkbeck College: London, UK, 2004. [Google Scholar]
  45. Szymanski, S. The economic design of sporting contests. J. Econ. Lit. 2003, 41, 1137–1187. [Google Scholar] [CrossRef]
  46. Dietl, H.M.; Franck, E.; Lang, M.; Rathke, A. Salary cap regulation in professional team sports. Contemp. Econ. Policy 2011, 30, 307–319. [Google Scholar] [CrossRef]
  47. Noll, R.G. The organization of sports leagues. Oxf. Rev. Econ. Policy 2003, 19, 530–551. [Google Scholar] [CrossRef] [Green Version]
  48. Crooker, J.R.; Fenn, A.J. Sports leagues and parity: When league parity generates fan enthusiasm. J. Sports Econ. 2007, 8, 139–164. [Google Scholar] [CrossRef]
  49. Fort, R.; Quirk, J. Cross-subsidization, incentives, and outcomes in professional team sports leagues. J. Econ. Lit. 1995, 33, 1265–1299. [Google Scholar]
  50. Késenne, S. Revenue sharing and competitive balance in professional sports. J. Sports Econ. 2000, 1, 56–65. [Google Scholar] [CrossRef]
  51. Marburger, D.R. Gate revenue sharing and luxury taxes in professional sports. Contemp. Econ. Policy 2007, 15, 114–123. [Google Scholar] [CrossRef]
  52. Solow, J.L.; Krautmann, A.C. Leveling the playing field or just lowering the salaries? The effects of redistribution in baseball. South Econ. J. 2007, 73, 947–958. [Google Scholar] [CrossRef]
  53. Szymanski, S.; Késenne, S. Competitive balance and gate revenue sharing in team sports. J. Ind. Econ. 2004, 52, 165–177. [Google Scholar] [CrossRef]
  54. Vrooman, J. A general theory of professional sports leagues. South Econ. J. 1995, 61, 971–990. [Google Scholar] [CrossRef]
  55. Sanderson, A.R.; Siegfried, J.J. Thinking about competitive balance. J. Sports Econ. 2003, 4, 255–279. [Google Scholar] [CrossRef] [Green Version]
  56. Quirk, J.; Fort, R. Competitive Balance in Sports Leagues. Pay Dirt: The Business of Professional Team Sports; Princeton University Press: Princeton, NJ, USA, 1997. [Google Scholar]
  57. Cain, L.P.; Haddock, D.D. Measuring parity: Tying into the idealized standard Deviation. J. Sports Econ. 2006, 7, 330–338. [Google Scholar] [CrossRef]
  58. Vigdor, N. The Houston Astros’ Cheating Scandal: Sign-Stealing, Buzzer Intrigue and Tainted Pennants. The New York Times. 2020. Available online: https://www.nytimes.com/article/astros-cheating.html (accessed on 5 May 2021).
  59. Pawelec, P. “Ido Movement for Culture. Journal of Martial Arts Anthropology” as a scientific journal. Kultura Bezpieczeństwa. Nauka Prakt. Refleksje 2016, 23, 213–227. [Google Scholar]
  60. Ingham, A.G.; Loy, J.W. The social system of sport: A humanistic perspective. Quest 1973, 19, 3–23. [Google Scholar] [CrossRef]
Table 1. Results from Kruskal–Wallis and Mann–Whitney U Tests for each league.
Table 1. Results from Kruskal–Wallis and Mann–Whitney U Tests for each league.
Percent CorrectDiscrepancy Score
ComparisonU/HpRU/HpR
NFL
W vs. RU434.000.24−0.22344.500.05−0.25
AFC vs. NFC157.500.07−0.33101.000.002−0.47
W vs. RU vs. AFC vs. NFC17.240.0010.1715.640.0010.15
W vs. AFC233.000.02−0.34161.500.001−0.43
W vs. NFC325.001.00−0.002274.000.34−0.13
RU vs. AFC201.50<001−0.51200.500.02−0.33
RU vs. NFC294.500.16−0.24225.500.06−0.26
L vs. C vs. D35.34<0010.1164.10<0010.20
L vs. C1055.000.006−0.281164.500.36−0.09
L vs. D41,430.00<001−0.342679.00<001−0.45
C vs. D3683.000.02−0.152775.00<001−0.27
AFC E vs. W vs. N vs. S vs. C1.950.750.021.240.880.01
NFC E vs. W vs. N vs. S vs. C5.560.230.056.800.150.06
AFC divs vs. NFC divs5335.000.10−0.125367.000.13−0.10
NBA
W vs. RU420.000.78−0.01400.000.45−0.01
East vs. West264.000.77−0.08285.000.95−0.01
W vs. RU vs. East vs. West0.850.840.010.830.850.01
L vs. C vs. D16.61<0010.0723.48<0010.09
L vs. C 1380.000.69−0.041328.000.48−0.07
L vs. D3170.00<001−0.252790.00<001−0.31
C vs. D2682.000.007−0.202522.500.001−0.23
Atlantic vs. Central vs. SE0.740.650.010.880.640.01
SW vs. NW vs. Pacific vs. MW2.200.530.031.580.660.02
East divs vs. West divs2336.000.17−0.132330.000.13−0.12
MLB
W vs. RU231.001.00−0.09102.000.001−0.50
AL vs. NL182.501.00−0.05169.500.57−0.09
W vs. RU vs. AL vs. NL4.670.200.067.780.050.09
W vs. AL175.000.09−0.29173.500.24−0.18
W vs. NL174.500.16−0.25201.000.84−0.03
RU vs. AL185.000.23−0.21181.000.32−0.15
RU vs. NL184.000.39−0.17151.000.13−0.24
L vs. C vs. D28.44<0010.1152.07<0010.21
L vs. C718.500.06−0.23799.500.59−0.06
L vs. D2163.00<001−0.341467.50<001−0.44
C vs. D2443.500.007−0.191686.50<001−0.34
AL East vs. Central vs. West2.480.280.032.880.240.04
NL East vs. Central vs. West1.800.440.022.870.240.04
AL divs vs. NL divs3114.000.28−0.103179.500.36−0.07
NHL
W vs. RU278.500.06−0.27313.000.53−0.09
East vs. West180.000.60−0.11128.500.05−0.31
W vs. RU vs. East vs. West5.370.140.065.750.130.06
L vs. C vs. D11.770.0020.0636.43<0010.17
L vs. C955.000.29−0.121027.000.78−0.03
L vs. D2292.500.001−0.251440.50<001−0.44
C vs. D1960.000.09−0.141421.00<001−0.30
Atlantic vs. Met vs. NE vs. SE1.340.710.023.870.280.07
Central vs. NW vs. Pacific3.420.190.062.620.270.05
East divs vs. West divs1363.000.06−0.191403.500.10−0.15
Note: W = League Winner; RU = League Runner-Up; AFC = American Football Conference; NFC = National Football Conference; L = League-Wide forecasts; C = Conference-Wide forecasts; D = Divisional forecasts; E = East; W = West; N = North; S = South; C = Central; SE = South East; SW = South West; NW = North West; MW = Mid-West; AL = American League; NL = National League; Met = Metropolitan; NE = North East.
Table 2. Descriptive Statistics for each league.
Table 2. Descriptive Statistics for each league.
CategoryAvg. NAvg. % CorrectMdn. Discrepancy Score
NFL
Winner30.80.1−4.0
Runner-Up30.80−2.0
AFC15.40.4−1.0
NFC15.30.1−3.0
AFC East4.50.60
AFC North4.00.4−1.0
AFC West4.50.5−1.0
AFC South4.00.5−1.0
AFC Central4.70.60
NFC East4.50.3−1.0
NFC North4.00.50
NFC West4.20.50
NFC South4.00.3−1.0
NFC Central5.00.2−1.0
League30.80.05−2.0
Conference15.40.2−2.0
Division4.30.4−1.0
NBA
Winner29.00.4−1.5
Runner-Up29.00.3−1.5
Eastern Conference14.60.3−1.0
Western Conference14.40.4−1.0
Atlantic5.90.70
Central6.30.70
South East5.00.60
South West5.00.70
North West5.00.5−1.0
Pacific6.00.60
Mid-West6.80.4−1.0
League29.00.3−1.5
Conference14.50.4−1.0
Division5.70.60
MLB
Winner29.00.04−4.0
Runner-Up29.00.1−2.0
AL14.20.2−3.0
NL14.90.2−4.0
AL East5.40.50
AL Central5.00.60
AL West4.80.4−1.0
NL East5.20.5−0.5
NL Central5.60.5−1.0
NL West5.10.3−1.0
League29.00.1−3.0
Conference14.50.2−3.0
Division 5.20.5−1.0
NHL
Winner27.90.2−3.0
Runner-Up27.90.04−2.5
Eastern Conference15.00.2−3.5
Western Conference14.30.3−1.5
Atlantic6.10.3−1.0
Metropolitan8.00.2−2.0
North East5.30.3−1.0
South East4.90.4−1.0
Central5.60.4−1.0
North West4.90.70
Pacific5.80.5−1.0
League27.90.1−3.0
Conference14.60.2−2.0
Division5.80.4−1.0
Note: Mdn = Median; League = all League-Wide predictions (e.g., forecasts for league winner and league runner up); Conference = predictions from both conferences (e.g., the AFC and NFC); Division = predictions from all the divisions within the league.
Table 3. Comparison across leagues at the League, Conference, and Divisional levels.
Table 3. Comparison across leagues at the League, Conference, and Divisional levels.
ComparisonPercent CorrectDiscrepancy Score
U/HpRU/HpR
LNFL vs. LNBA vs. LMLB vs. LNHL22.36<0010.1022.21<0010.10
NFL vs. NBA1330.00<001−0.361091.50<001−0.36
NFL vs. MLB1337.000.69−0.041319.500.77−0.03
NFL vs. NHL3427.500.11−0.173431.000.35−0.09
NBA vs. MLB970.000.002−0.31753.50<001−0.37
NBA vs. NHL1300.000.03−0.211035.000.001−0.31
MLB vs. NHL1069.500.33−0.131019.000.28−0.11
CNFL vs. CNBA vs. CMLB vs. CNHL3.150.370.0211.840.010.07
NFL vs. NBA870.000.18−0.15723.500.02−0.25
NFL vs. MLB813.001.00−0.01717.500.33−0.11
NFL vs. NHL830.001.00−0.01836.000.97−0.004
NBA vs. MLB801.000.17−0.15576.000.002−0.34
NBA vs. NHL840.000.25−0.13690.000.02−0.25
MLB vs. NHL765.001.00−0.02661.500.24−0.13
DNFL vs. DNBA vs. DMLB vs. DNHL13.750.0040.0212.570.0050.02
NFL vs. NBA13,351.000.002−0.1613,200.000.002−0.16
NFL vs. MLB17,621.000.54−0.0318,035.000.82−0.01
NFL vs. NHL12,304.000.56−0.0312,516.000.76−0.02
NBA vs. MLB10,498.000.02−0.1310,194.000.01−0.15
NBA vs. NHL6737.000.001−0.206749.000.002−0.19
MLB vs. NHL8947.000.27−0.079324.000.63−0.03
Note: L = League-Wide predictions (e.g., the forecasts for League-Winner and Runner-Up); C = Conference-Wide predictions (e.g., the forecasts for both conferences); D = Divisional predictions for each league.
Table 4. Summary of statistics across leagues.
Table 4. Summary of statistics across leagues.
CategoryLeagueAvg. NAvg. % CorrectMdn. Discrepancy Score
League−Wide NFL30.70.05−2.0
League−WideNBA29.00.3−1.5
League−WideMLB29.00.1−3.0
League−WideNHL27.90.1−3.0
Conference−WideNFL15.40.2−2.0
Conference−WideNBA14.50.4−1.0
Conference−WideMLB14.50.2−3.0
Conference−WideNHL14.60.2−2.0
DivisionalNFL4.30.4−1.0
DivisionalNBA5.70.60
DivisionalMLB5.20.5−1.0
DivisionalNHL5.80.4−1.0
Table 5. Results from Kruskal–Wallis and Mann–Whitney U Tests for each league over time.
Table 5. Results from Kruskal–Wallis and Mann–Whitney U Tests for each league over time.
Comparison Percent CorrectDiscrepancy Score
LeagueU/HpRU/HpR
W vs. W vs. WNFL2.320.290.080.360.840.01
RU vs. RU vs. RUNFL01.0000.990.620.03
W vs. W vs. WNBA5.270.090.186.820.030.23
1998–2007 vs. 2008–2018NBA---29.500.12−0.35
1988–1997 vs. 1998–2007NBA---17.000.01−0.58
1988–1997 vs. 2008–2018NBA---38.500.40−0.21
RU vs. RU vs. RUNBA2.760.400.092.320.340.08
W vs. W vs. WMLB1.750.420.087.020.020.33
2004–2011 vs. 2012–2018MLB---23.501.00−0.04
1996–2003 vs. 2004–2011MLB---8.500.02−0.62
1996–2003 vs. 2012–2018MLB---12.000.05−0.50
RU vs. RU vs. RUMLB3.670.300.185.690.050.27
W vs. W vs. WNHL4.780.100.173.540.170.13
RU vs. RU vs. RUNHL2.140.640.101.930.400.09
Note: W = League Winner; RU = League Runner-Up.
Table 6. Descriptive statistics over time.
Table 6. Descriptive statistics over time.
Time BracketLeagueAvg. NAvg. Percent CorrectMdn. Discrepancy Score
1988–1997NFL28.60.2−3.0
1998–2007NFL31.50.1−3.5
2008–2018NFL32.00−4.0
1988–1997NBA27.60.60
1998–2007NBA29.40.1−3.0
2008–2018NBA30.00.4−1.0
1996–2003MLB29.50.1−3.0
2004–2011MLB30.00−5.0
2012–2018MLB30.00−5.0
1988–1997NHL23.90−4.0
1998–2007NHL29.00.4−1.0
2008–2018NHL30.20.3−2.0
Table 7. Jonckheere–Terpstra Test of Trend Results.
Table 7. Jonckheere–Terpstra Test of Trend Results.
ComparisonDiscrepancy Score
LeagueJpR
W vs. W vs. WNFL175.000.580.10
RU vs. RU vs. RUNFL160.500.990.003
W vs. W vs. WNBA174.000.350.17
RU vs. RU vs. RUNBA144.500.82−0.04
W vs. W vs. WMLB115.000.020.48
2004–2011 vs. 2012–2018MLB23.501.00−0.04
1996–2003 vs. 2004–2011MLB47.500.020.62
1996–2003 vs. 2012–2018MLB44.000.080.50
RU vs. RU vs. RUMLB106.000.120.34
W vs. W vs. WNHL111.500.25−0.22
RU vs. RU vs. RUNHL77.000.84−0.05
Note: W = League Winner; RU = League Runner-Up.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Jones, J.; Johnston, K.; Farah, L.; Baker, J. Predicting Seasonal Performance in Professional Sport: A 30-Year Analysis of Sports Illustrated Predictions. Sports 2021, 9, 163. https://0-doi-org.brum.beds.ac.uk/10.3390/sports9120163

AMA Style

Jones J, Johnston K, Farah L, Baker J. Predicting Seasonal Performance in Professional Sport: A 30-Year Analysis of Sports Illustrated Predictions. Sports. 2021; 9(12):163. https://0-doi-org.brum.beds.ac.uk/10.3390/sports9120163

Chicago/Turabian Style

Jones, Justine, Kathryn Johnston, Lou Farah, and Joseph Baker. 2021. "Predicting Seasonal Performance in Professional Sport: A 30-Year Analysis of Sports Illustrated Predictions" Sports 9, no. 12: 163. https://0-doi-org.brum.beds.ac.uk/10.3390/sports9120163

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop