Driver Telematics Analysis is a Kaggle challenge. For more details, see the challenge page. Besides solving a machine learning problem, we want to learn how to use git and scikit-learn.
Submissions can be generated by running scripts from scripts
directory, using root as working directory. Features implement a common interface and are stored inside features
package. Utilities like plotting, i/o are part of utils
package. Working notes are stored as IPython notebooks in notebooks
directory.
- replace power computation in the distance function with numpy's squared distance (should be more efifcient)
- plot angle feature in color over trip track
- plot angle feature histograms, before and after RDP
- compute best RDP epsilon value
- create script that reduces trips using RDP and stores them as *.npy
- run RDP, recompute angle feature submissions
- analyze article by Olariu