This repository contains the analysis code to reproduce the results and plots from the main manuscript and supplementary information.
See pyproject.toml for complete dependency specifications. Most of the dependencies are installed with reafs.
uv syncThe reafs package is expected to be located in the parent directory. Modify pyproject.toml accordingly.
plot_cc_model_cards_1.py- Generate parts of the model "cards" for the featurization comparison as in the main manuscript.plot_cc_model_cards_SI_1.py- Supplementary information version of model cards for the featurization comparisonplot_data_diagno_1.py- Diagnostic plots for data analysisplot_data_g23_outliers_1.py- Outlier analysis and visualizationplot_fs_common_1.py- Visualize common feature selections across analysesplot_fs_sequential_extension_1.py- Sequential feature selection extension plotsplot_oa_prospect_1.py- Prospective analysis "cards" parts.plot_oa_prospect_SI_1.py- Supplementary information for the prospective analysis
svg_composer_feat.py- Compose the final plot from the featurization comparison from all the partssvg_composer_prosp.py- Compose the final plot from the prospective analysis from all the parts
To run a specific analysis script:
uv run python script_name.pyOr,
python script_name.pyResults and generated figures are saved to the results/ directory.
This project was created as part of NCCR Catalysis (grant number 225147), a National Centre of Competence in Research funded by the Swiss National Science Foundation.