Skip to content

Latest commit

 

History

History
44 lines (23 loc) · 2.13 KB

README.md

File metadata and controls

44 lines (23 loc) · 2.13 KB

NBARefs

A repository with our CS122 Project about NBA refereeing

The compressed file NBA.tar.gz contains the large folder that configures the website. I was able to unzip the file on my VM to get the folder so hopefully that shouldn't be a problem. There is another README in that folder to explain how to get to the website.

When using the exported VM, please use the cs122-project directory for the website configuration.

Contents:

A database of NBA .. .docx - Word document containing the original proposal of our project

NBA.tar.gz - Compressed folder containing the website configuration. See the introduction of this file for more info

NBARefs.db - The final database used for statistical analysis

NBAStat_Final.xlsx - The cleaned up version of the PDFs using Adobe Acrobat

Output.csv - The file mapping each referee to the game codes of the games they officiated. It is the csv version of the referees table in the final database

NBAStat.txt - A tab delimited txt file containing the same data as NBAStat_Final.xlsx

Referees.py - A file containing the functions that generate Output.csv

Sorted300, Sorted400 - The lists of referees sorted by accuracy with cutoffs at 300 and 400 respectively

create-db.sql - The sql query that generates NBARefs.db

data_dict.csv - The output of parser.py. It is the csv version of the calls table in the final database

parser.py - The code that generates data_dict.csv

proportion_shooting.csv - The list of referees by proportion of shooting fouls over total fouls

ranking.csv - The list of all referees in the dataset, with incorrect calls, incorrect non calls, total incorrect calls, total calls and error ratio

ranking.sql - The sql query that generates ranking.csv

star_players.csv - A csv file containing the error ratios for star players in our dataset.

statistical_analysis.py - The code that generates all of the stats that we used in our paper

top_teams.csv - A file containing the ratio of games including top teams officiated by the top and bottom 10 referees by accuracy

util.py - A function containing auxiliary methods used to generate Output.csv. It is the same file that was provided to us in PA2