|
| 1 | +# Processed Data |
| 2 | + |
| 3 | +This folder contains the processed data files that are used for visualization. |
| 4 | + |
| 5 | +Can be found here: |
| 6 | + |
| 7 | +## Description of files |
| 8 | + |
| 9 | +### player_seasons.csv |
| 10 | + |
| 11 | +Contains aggregated season-level statistics for players. Each row corresponds to a player's performance in a specific season, with the following columns: |
| 12 | + |
| 13 | +- `season`: The NBA season (e.g., 2020 for the 2020-2021 season) |
| 14 | +- `personId`: Unique identifier for the player (NBA player ID) |
| 15 | +- `firstName`, `lastName`: Player's first and last name |
| 16 | +- `gameType`: Regular Season or Playoffs |
| 17 | +- `teamScore`: Average number of points scored by the player's team in the season |
| 18 | +- `opponentScore`: Average number of points scored by the opponent teams in the season |
| 19 | +- `points`, `assists`, `rebounds`, `blocks`, `steals`, `turnovers`: Average per game general statistics for the player in the season |
| 20 | +- `pointsTotal`, `assistsTotal`, `reboundsTotal`, `blocksTotal`, `stealsTotal`, `turnoversTotal`: Sum throughout the season of the corresponding statistics for the player |
| 21 | +- `fieldGoalsMade`, `fieldGoalsAttempted`, `threePointersMade`, `threePointersAttempted`, `freeThrowsMade`, `freeThrowsAttempted`: Sum throughout the season of the corresponding shot statistics for the player |
| 22 | +- `plusMinusPoints`: Average plus/minus points per game for the player in the season (i.e., the average point differential when the player is on the court) |
| 23 | +- `foulsPersonal`: Average personal fouls per game for the player in the season |
| 24 | +- `numMinutes`: Average minutes played per game for the player in the season |
| 25 | +- `win`: Number of games won by the player's team in the season |
| 26 | +- `gamesPlayed`: Number of games played by the player in the season |
| 27 | +- `proportionThreePoint`: Proportion of three-point shots attempted out of total field goal attempts for the player in the season |
| 28 | +- `fieldGoalsPercentage`, `threePointersPercentage`, `freeThrowsPercentage`: Shooting percentages for the player in the season |
| 29 | +- `salary`: Average salary for the player in the season (if available, otherwise NaN) |
| 30 | + |
| 31 | +### player_games.csv |
| 32 | + |
| 33 | +TODO |
| 34 | + |
| 35 | +### shot_events.csv |
| 36 | + |
| 37 | +TODO |
| 38 | + |
| 39 | +### player_metadata.csv |
| 40 | + |
| 41 | +Contains metadata about players (from nba_api), with the following columns: |
| 42 | + |
| 43 | +- `personId`: Unique identifier for the player (NBA player ID) |
| 44 | +- `firstName`: Player's first name |
| 45 | +- `lastName`: Player's last name |
| 46 | +- `birthDate`: Player's birth date (in datetime format) |
| 47 | +- `height`: Player's height in centimeters |
| 48 | +- `weight`: Player's weight in pounds |
| 49 | +- `nbSeasons`: Number of seasons the player has played in the NBA |
| 50 | +- `jerseyNumber`: The jersey number the player wore in the NBA (e.g., 23) |
| 51 | +- `position`: The position the player played in the NBA (e.g., "Guard", "Forward", "Center") |
| 52 | +- `startYear`: The year the player started playing in the NBA |
| 53 | +- `endYear`: The year the player ended playing in the NBA |
| 54 | +- `draftYear`: The year the player was drafted into the NBA |
| 55 | +- `draftRound`: The round in which the player was drafted |
| 56 | +- `draftNumber`: The number of the pick in the draft |
| 57 | + |
| 58 | +### team_seasons.csv |
| 59 | + |
| 60 | +Contains aggregated season-level statistics for teams. Each row corresponds to a team's performance in a specific season, with the following columns: |
| 61 | + |
| 62 | +- `season`: The NBA season (e.g., 2020 for the 2020-2021 season) |
| 63 | +- `teamId`: Unique identifier for the team (NBA team ID) |
| 64 | +- `gameType`: Regular Season or Playoffs |
| 65 | +- `teamName`: Name of the team (e.g., "Lakers") |
| 66 | +- `teamCity`: City of the team (e.g., "Los Angeles") |
| 67 | +- `teamScore`: Average number of points scored by the team in the season |
| 68 | +- `opponentScore`: Average number of points scored by the opponent teams in the season |
| 69 | +- `assists`, `rebounds`, `blocks`, `steals`, `turnovers`: Average per game general statistics for the team in the season |
| 70 | +- `teamScoreTotal`, `opponentScoreTotal`, `assistsTotal`, `reboundsTotal`, `blocksTotal`, `stealsTotal`, `turnoversTotal`: Sum throughout the season of the corresponding statistics for the team |
| 71 | +- `fieldGoalsMade`, `fieldGoalsAttempted`, `threePointersMade`, `threePointersAttempted`, `freeThrowsMade`, `freeThrowsAttempted`: Average per game shot statistics for the team in the season |
| 72 | +- `fieldGoalsPercentage`, `threePointersPercentage`, `freeThrowsPercentage`: Shooting percentages for the team in the season |
| 73 | +- `proportionThreePoint`: Proportion of three-point shots attempted out of total field goal attempts for the team in the season |
| 74 | +- `plusMinusPoints`: Average plus/minus points per game for the team in the season (i.e., the average point differential when the team is on the court) |
| 75 | +- `foulsPersonal`: Average personal fouls per game for the team in the season |
| 76 | +- `numMinutes`: Average minutes played per game for the team in the season |
| 77 | +- `win`: Number of games won by the team in the season |
| 78 | +- `losses`: Number of games lost by the team in the season |
| 79 | +- `gamesPlayed`: Number of games played by the team in the season (should be 82 for regular season, but can be less for older seasons or playoffs) |
| 80 | +- `salary`: Average salary for the team in the season (if available, otherwise NaN) |
| 81 | + |
| 82 | + |
| 83 | +### team_games.csv |
| 84 | + |
| 85 | +Contains detailed game-level statistics for teams. Each row corresponds to a team's performance in a specific game, with the following columns: |
| 86 | + |
| 87 | +- `gameId`: Unique identifier for the game |
| 88 | +- `gameDateTimeEst`: Date and time of the game in Eastern Standard Time |
| 89 | +- `teamCity`: City of the team (e.g., "Los Angeles") |
| 90 | +- `teamName`: Name of the team (e.g., "Lakers") |
| 91 | +- `teamId`: Unique identifier for the team (NBA team ID) |
| 92 | +- `opponentTeamCity`: City of the opponent team (e.g., "Boston") |
| 93 | +- `opponentTeamName`: Name of the opponent team (e.g., "Celtics") |
| 94 | +- `opponentTeamId`: Unique identifier for the opponent team (NBA team ID) |
| 95 | +- `home`: Boolean indicating if the team was playing at home (0) or away (1) |
| 96 | +- `win`: Boolean indicating if the team won (1) or lost (0) the game |
| 97 | +- `season`: The NBA season (e.g., 2020 for the 2020-2021 season) |
| 98 | +- `gameType`: Regular Season or Playoffs |
| 99 | +- `teamScore`: Number of points scored by the team in the game |
| 100 | +- `opponentScore`: Number of points scored by the opponent team in the game |
| 101 | +- `numMinutes`: Duration of the game in minutes (should be 48 for regular season, but can be more for overtime games) |
| 102 | +- `assists`, `rebounds`, `reboundsDefensive`, `reboundsOffensive`, `blocks`, `steals`, `turnovers`: General statistics for the team in the game |
| 103 | +- `foulsPersonal`: Number of fouls committed by the team in the game |
| 104 | +- `fieldGoalsMade`, `fieldGoalsAttempted`, `threePointersMade`, `threePointersAttempted`, `freeThrowsMade`, `freeThrowsAttempted`: Shot statistics for the team in the game |
| 105 | +- `fieldGoalsPercentage`, `threePointersPercentage`, `freeThrowsPercentage`: Shooting percentages for the team in the game |
| 106 | + |
| 107 | + |
| 108 | +### team_metadata.csv |
| 109 | + |
| 110 | +Contains metadata about teams, with the following columns: |
| 111 | + |
| 112 | +- `teamId`: Unique identifier for the team (NBA team ID) |
| 113 | +- `teamAbbrev`: Abbreviation of the team name (e.g., "LAL" for Los Angeles Lakers) |
| 114 | +- `teamSlug`: Slug version of the team name (e.g., "los-angeles-lakers" should be used to get the team logo from the https://i.logocdn.com/nba/{year}/{teamSlug}.svg URL) |
| 115 | +- `Color1`, `Color2`, `Color3`, `Color4`, `Color5`: Colors associated with the team (in hexadecimal format, e.g., "#552583") |
| 116 | +- `seasonFounded`: The season in which the team was founded (e.g., 1947 for the first NBA season) |
| 117 | +- `seasonActiveTill`: The most recent season in which the team was active (e.g., 2100 for the active teams, or a past season for defunct teams) |
0 commit comments