- The
p1/1_data_exploration.ipynbfile contains all the necessary code for this section. - It uses the raw data (
train_val_split.csvandtest_split.csv) in thep1/datafolder and also saves processed data into the same folder (train_val.parquetandtest.parquet). - The file contains code for visualizing the feature distributions and processing the data.
- The code for the section is in the
p1/2_lasso_regression.ipynbfile. - It contains all the necessary functions but uses the previously processed data in the
p1/datafolder (train_val.parquet,test.parquet).
note: The part of the code that saves the model and the results on the test set is commented out.
- The
p1/3_multi-layer-perceptrons.ipynbfile contains all the necessary functions and classes for this part. - It also uses the processed data files in the
p1/datafolder (train_val.parquet,test.parquet).
note: The part of the code that saves the model and the results on the test set is commented out.
- The same way as previously, the
p1/4_neural_additive_models.ipynbfile contains all the necessary functions and classes for this part. - It also uses the processed data files in the
p1/datafolder (train_val.parquet,test.parquet).
note: The part of the code that saves the model and the results on the test set is, again, commented out.