Group 1: Daan van der Valk and Sandesh Manganahalli Jayaprakash
Lab assignment 1 for Cyber Data Analytics, the TU Delft course.
- Task 1: visualisation: heatmap_categorical.py
- Task 2: imbalance (SMOTE): ROC_analysis.py
- Task 3: Decision Tree (white-box algorithm):
- Parameter tuning: BestParams.py
- Performance measurement: Scores-DecisionTree.py
- Task 4: Random Forest (black-box algorithm):
- Parameter tuning: random_forest_parameters.py
- Performance measurement: random_forest_performance.py
All included scripts should be run with Python 3. We used Python 3.6.4 to be specific, but hopefully any Python 3 version would suffice.
The following packages should be installed, which can be done using pip (pip install <package>) or Conda (conda install <package>), whatever you prefer.
matplotlibscipysklearnpydotplusgraphviz(depending on the environment, alsopython-graphviz)imblearnjoblibseabornpandas
Some of these packages maybe already installed.