You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repo provides Julia based benchmarks for ML algo on tabular data.
It was developed to support both NeuroTabModels.jl and EvoTrees.jl projects.
Methodology
For each dataset and algo, the following methodology is followed:
Data is split in three parts: train, eval and test
A random grid of 16 hyper-parameters is generated
For each parameter configuration, a model is trained on train data until the evaluation metric tracked against the eval stops improving (early stopping)
The trained model is evaluated against the test data
The metric presented below are the ones obtained on the test for the model that generated the best eval metric.
Datasets
Datasets are now sourced from OpenML, usingOpenML: