Digitalhub-tutorials

This project repository aims to provide some documented scenarios to showcase how to use the platform. Inside each scenario folder there are:

Executable Jupyter Notebook
Project descriptor files and code sources

In-depth descriptions of these scenarios, as well as more details on the platform, can be found in the documentation.

Here follows a short description of each scenario:

ETL (Extract, Transform, Load)

This scenario demonstrates how to collect data regarding traffic, analyze and transform it, then expose the resulting dataset.

DBT (Database Transformation) Scenario

This scenario demonstrates how to collect data regarding organizations, analyze and transform it, then expose the resulting dataset.

Scikit Learn Scenario

This scenario provides a quick overview of developing and deploying a scikit-learn machine learning application using the functionalities of the platform. We will prepare data, train a generic model, and expose it as a service.

ML Flow Model Training and Serving

This scenario provides a quick overview of developing and deploying a machine learning application based on model tracked with MLFlow framework using the functionalities of the platform.

LLM Flow Model Training and Serving

This scenario demonstrates how to create and serve LLM HuggingFace-compatible models. Specifically, it is possible to serve directly the LLM models from the HuggingFace catalog provided the id of the model or to serve the fine-tuned model from the specified path, such as S3. The scenario uses a GPU and a profile defined by the cluster owner.

Custom ML Flow Model Training and Serving

This scenario provides a quick overview of developing and deploying generic machine learning applications using the functionalities of the platform. For this purpose, we use ML algorithms for time series management provided by the Darts framework.

Retrieval-Augmented Generation (RAG)

This scenario builds a RAG application, using a chat model and an embedding model, to provide a chatbot.

Whisper fine-tuning

This scenario demonstrates how to fine-tune Whisper, a model for speech-to-text recognition.

Flower federated learning

This scenario introduces the Flower Federated Learning framework, which allows for running federated learning tasks where different client nodes perform local training and cooperatively create a more robust solution without exchanging the data but only the weight necessary to progress the training process.

Data validation

This scenario implements a simple data validation function, which evaluates the correctness of a CSV table by leveraging an open source library, Frictionless.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
s1-etl		s1-etl
s10-data-validation		s10-data-validation
s2-dbt		s2-dbt
s3-scikit-learn		s3-scikit-learn
s4-mlflow		s4-mlflow
s5-llm		s5-llm
s6-custom-ml-model		s6-custom-ml-model
s7-rag		s7-rag
s8-whisper-fine-tuning		s8-whisper-fine-tuning
s9-flower-fl		s9-flower-fl
webinars		webinars
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Digitalhub-tutorials

ETL (Extract, Transform, Load)

DBT (Database Transformation) Scenario

Scikit Learn Scenario

ML Flow Model Training and Serving

LLM Flow Model Training and Serving

Custom ML Flow Model Training and Serving

Retrieval-Augmented Generation (RAG)

Whisper fine-tuning

Flower federated learning

Data validation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Digitalhub-tutorials

ETL (Extract, Transform, Load)

DBT (Database Transformation) Scenario

Scikit Learn Scenario

ML Flow Model Training and Serving

LLM Flow Model Training and Serving

Custom ML Flow Model Training and Serving

Retrieval-Augmented Generation (RAG)

Whisper fine-tuning

Flower federated learning

Data validation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages