Sports Datasets for Data Modeling, Visualization, Predictions, Machine-Learning

Sports Data Sets / October 31, 2020

Sports Datasets for Data Modeling, Data-Vis, Predictions, Machine-Learning

🏈 Football Data Sets

  • NFL Stats data compiled from publicly available NFL play-by-play data.
  • Detailed NFL Play-by-Play Data 2009-2018: Regular season plays from 2009-2016 containing information on: players, game situation, results, win probabilities and miscellaneous advanced metrics.
  • NFL Draft Outcomes: All players selected in the NFL Draft from 1985-2015 including outcome stats.

College Football Stats & Data Sets

  • CFB Stats – detailed downloadable CFB stats in CSV file-format. Data includes: Kicking Statistics, Kickoff-return Statistics, Kickoff Statistics, Passing Statistics, Punt-return Statistics, rushing, scoring and punting data. Conferences: Sun Belt, SEC, PAC-12, Mountain-West, MAC, Independents, CUSA, Big-Ten, American Conference, Big 12, ACC, And FBS.

🎾 Tennis Data Sets

  • ATP World Tour tennis data ATP tournaments, match scores, match stats, rankings and players overview data extracted from the ATP World Tour website. Dataset is updated annually in October.

⚽ Soccer Data Sets

FIFA Video-game Data Sets

Results Data Sets

  • Historical soccer results datasets – Historical soccer data sets reference, featuring game half-time and full-time scores, player stats from European and International soccer leagues. back to 1994. Downloadable in CSV format. Updated weekly.
  • football.db: Open source, free and public domain soccer database & schema for use in any (programming) language.
  • International football results from 1872 to 2018: 40,000 results of soccer matches from the very first official match in 1872 up until 2018.
  • German Bundesliga Data – Data for last 10 seasons of German Bundesliga including current season. The data is updated weekly and contains various statistical data such as final and half time result, corners.

World Cup Data Sets

  • World Cup Dataset: This data set shows all information about historical World Cups as well as all match data.

🏀 Basketball Data Sets

🏎️ Racing Data Sets

  • Ergast Formula One Dataset: An experimental web service which provides a historical record of motor racing data for non-commercial purposes.
  • Formula 1 Race Data: Results dataset covering formula seasons from 1950 to the 2017 F1 season. Data includes constructors, drivers, lap times, pit stops.
  • MotoGP Dataset – The statistics database

⚾ Baseball Data Sets

  • Historical MLB Scores & Odds Dataset 2010-2020. Historical odds & scores data from MLB seasons 2010-2020 inclusive – including run-lines, opening and closing moneylines and totals (over/under).
  • Lahman’s Baseball Database: A complete history of major league baseball stats from 1871 to 2018, including batting and pitching stats, standings, team stats, managerial records, post-season data, and more.

🏒 Hockey Data Sets

Miscellaneous Sports Data Sets and Databases

  • structured ball-by-ball data for international and IPL cricket matches, 2015 to 2019 inclusive.
  • FiveThirtyEight – Data driven sports journalism and analysis with datasets regularly published to Github.
  • SPORTS-1M: 1M sports videos of average length-5.5mins labelled for 487 sports classes.
  • 120 years of Olympic history: A historical data set on the Olympic Games, including all the Games from Athens 1896 to Rio 2016
  • Daily and Sports Activities Data Set: Motion sensor data of nineteen sports activities performed by 8 subjects in their own style for 5 minutes.
  • NHL Game Data: Game, team, player and play data including x,y coordinates measured for each game in the NHL in the past 6 years.

Extracting / Scraping Sports Data from websites

Document Last Updated November 2020.

Thomas Nielsen
Tom loves NBA. NFL and Hockey. When he is not analyzing sports data, he is watching it, or writing about it!