Data Science BootCamp¶
Python Packages¶
- Pandas
- Data Frames
- Numpy
- Numberical Python
- Scipy
- Scientific Python
- Sklearn
- Neural Network library
- Matplotlib
- Matlab style plotting library
- Plotly
- Interactive plotting library
- Lasagne
- Neural Networks in Python
Data Modelling¶
k-nearest neighbours¶
Radial Neighbors kNearest Neighbors
Computationaly expensive Data Data needs to be normalised Performance Evaluation
Decision Trees¶
Decision Trees: Prone to overfitting Works with Categorical Works with unnormalised data Based on asking questions Cost is logarthimic Depth of tree is main issue