Author: Ruben R. Kazumov
URL: https://github.com/kazumov/pulsar
University: HarvardX
Course: PH125.9x
Program: Data Science Professional
Course URL: HarvardX PH125.9x
The course graduation project. The project study HTRU2 data set with Rborist, Random Forest an KNN algorithms of classification.
Prediction accuracy and calculation time comparison. Prediction errors analysis.
| File | Description |
|---|---|
| report.pdf | Detailed description of analysis and the building of classification and regression predictive systems |
| report.Rmd | Source code of the report.pdf file |
| report.R | Source code of the analysis, the building of predictive systems, and the RMSE tests of the predicted vectors |
| htru2.Rds | HTRU2 data set in RDS format |
| correlationMatrix.png | The data set correlation matrix size 9x9. Exported gg object. |
| errors3d.png | The classification errors. Exported image. |