Instructions on How to Run This Code

To make requests to Spotify's Web API, a Node.js application was used. This application, which is called app.js, is under the spotify-webAPI/authorization_code folder. To run the appplication, enter the following command:

'node app.js'

After the application is running, the user should go to the 'localhost:8888' url to enter their Spotify credentials (i.e. Spotify only grants authorization to use the API to users that have a Spotify account). The app.js code makes three get requests: two to the search endpoint using randomly generated queries and offsets, and one to the search endpoint with a restriction by year (2018) but again using random queries and offsets. The information gathered by these requests iss then used to search for the meta-data of each track. The resulting json data is saved under the spotify-webAPI/authorization_code/json_files folder. Note, this folder contains the files used to process the data.

To pre-process the data, run:

python process_data.py total_data.json

To get the histogram of popularity, run:

python visualize_data.py data/processed_50_total_data.json

To see the chi-squared and covariance analysis, run:

python chi_squared_and_covariance_analysis.py data/processed_50_total_data.json

To make naive bayes predictions without PCA, run the following command. It may take a while since it's running 5,000 trials. Since we use random holdout we use just one input file.

python make_naive_bayes_predictions.py <num_trials> data/processed_50_total_data.json

e.g.

python make_naive_bayes_predictions.py 500 data/processed_50_total_data.json

To make naive bayes predictions with PCA, run the following command. <num_components> must be a number between 1 and 10. <filter_c_features> and <filter_d_features> must be 1 or 0. If 1, it only uses the best features from covariance or chi-squared analysis respectively. If you set <filter_c_features> to 1 then <num_components> is automatically capped at 5.

python make_naive_bayes_predictions_with_pca.py <num_trials> <num_components> <filter_c_features> <filter_d_features> data/processed_50_total_data.json

e.g.

python make_naive_bayes_predictions_with_pca.py 500 10 0 0 data/processed_50_total_data.json

To graphs of predictions from continuous features when using different PCA configurations, run:

python pca_graphs.py data/processed_50_total_data.json

To make SVM predictions and get the confusion matrix, run following command:

'python svc_predictions.py <num_trials> data/processed_50_total_data.json'

e.g. 'python svc_predictions.py 100 data/processed_50_total_data.json'

To make SVM predictions with bagging and get the confusion matrix, run following command:

'python svm_bagging_predictions.py <num_trials> <num_estimators> data/processed_50_total_data.json'

e.g. 'python svm_bagging_predictions.py 50 15 data/processed_50_total_data.json'

Note that the standard deviation will only be calculated for the SVM classifier and for the SVM classifier with bagging if <num_trials> is greater than 1.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Instructions on How to Run This Code

About

Uh oh!

Releases

Packages

Contributors 9

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
data		data
spotify-webAPI		spotify-webAPI
.gitignore		.gitignore
200_000_eng_tracks		200_000_eng_tracks
50_000_eng_tracks_with_good_utf8_encoding		50_000_eng_tracks_with_good_utf8_encoding
README.md		README.md
chi_squared_and_covariance_analysis.py		chi_squared_and_covariance_analysis.py
lyrics.py		lyrics.py
make_naive_bayes_predictions.py		make_naive_bayes_predictions.py
make_naive_bayes_predictions_with_pca.py		make_naive_bayes_predictions_with_pca.py
pca_graphs.py		pca_graphs.py
process_data.py		process_data.py
svc_predictions.py		svc_predictions.py
svm_bagging_predictions.py		svm_bagging_predictions.py
visualize_data.py		visualize_data.py

bzanardo/data-science-project

Folders and files

Latest commit

History

Repository files navigation

Instructions on How to Run This Code

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Uh oh!

Languages

Packages