Author: Muhammad Ehsan
Program: DevelopersHub Corporation AI/ML Engineering Internship
Welcome to my repository for the AI/ML Engineering Internship at DevelopersHub Corporation. This repository contains my implementations of various artificial intelligence and machine learning tasks, showcasing a progression of skills from fundamental data exploration to advanced predictive modeling and natural language processing.
These projects highlight my proficiency in exploratory data analysis (EDA), regression and classification algorithms, and the deployment of Large Language Models (LLMs) using modern prompt engineering and fine-tuning techniques.
developershub-ai-projects/
│
├── task1-iris-eda/ # Task 1: Exploratory Data Analysis
│ ├── explore_iris.py # Main EDA script
│ └── README.md
│
├── task2-stock-prediction/ # Task 2: Time-Series Forecasting
│ ├── predict_stocks.py # Stock prediction script
│ └── README.md
│
├── task3-heart-disease/ # Task 3: Binary Classification
│ ├── train_heart_model.py # Heart disease model training
│ ├── heart.csv # Dataset
│ └── README.md
│
├── task4-health-chatbot/ # Task 4: LLM-based API Chatbot
│ ├── chatbot.py # Agent LLM script
│ ├── Dockerfile # Containerization
│ └── README.md
│
├── task5-empathetic-chatbot/ # Task 5: LLM Fine-Tuning & RAG
│ ├── finetune_model.py # Model tuning script
│ ├── hybrid_chatbot.py # RAG and inference script
│ ├── app.py # Streamlit UI
│ └── README.md
│
├── task6-house-pricing/ # Task 6: Multivariable Regression
│ ├── main.py # House pricing model training
│ ├── test_model.py # Model evaluation
│ ├── house_prices.csv # Dataset
│ └── README.md
│
└── README.md # Root documentation
Here is a summary of the internship tasks completed in this repository:
-
Task 1: Exploring and Visualizing a Simple Dataset
- Domain: Data Analytics & Visualization
- Focus: Performing EDA on the standard Iris dataset to uncover data trends, distributions, and feature correlations using
pandas,matplotlib, andseaborn.
-
Task 2: Predict Future Stock Prices (Short-Term)
- Domain: Time-Series Forecasting & Regression
- Focus: Fetching historical stock data dynamically via
yfinanceand implementing machine learning models to predict short-term stock closing prices.
-
Task 3: Heart Disease Prediction
- Domain: Medical Diagnostics & Classification
- Focus: Developing robust binary classifiers (Logistic Regression and Decision Trees) to predict a patient's risk of heart disease based on clinical metrics.
-
Task 4: General Health Query Chatbot
- Domain: Conversational AI & Prompt Engineering
- Focus: Building a conversational agent leveraging external LLM APIs (OpenAI/Mistral), featuring strict safety guardrails to ethically answer health-related queries.
-
Task 5: Mental Health Support Chatbot
- Domain: LLM Fine-Tuning & Retrieval-Augmented Generation (RAG)
- Focus: Fine-tuning base models (DistilGPT2) on empathetic dialogue datasets to create a supportive mental health chatbot, augmented with vector database knowledge retrieval and presented via a Streamlit UI.
-
Task 6: House Price Prediction
- Domain: Multivariable Regression
- Focus: Creating an end-to-end ML pipeline to predict real estate values based on property features, including thorough feature scaling, model evaluation (MAE/RMSE), and persistent model saving.
Follow these instructions to clone the repository and run the projects locally on your machine.
- Python 3.8 or higher installed
pip(Python package installer)- Git
-
Clone the repository:
git clone https://github.com/MuhammadEhsan02/developershub-ai-projects.git cd developershub-ai-projects -
Navigate into a specific task directory: Each task operates independently. It is recommended to navigate into the task folder you wish to execute. For example:
cd task1-iris-eda -
Install the required dependencies: Many tasks require specific libraries such as
scikit-learn,pandas,transformers, orstreamlit. Check the individual task'sREADME.mdorpyproject.tomlfiles, or install the standard data science packages globally:pip install pandas numpy scikit-learn matplotlib seaborn yfinance transformers torch streamlit openai-agents faiss-cpu
-
Run the project script: Execute the primary Python file for the respective task. For example:
python explore_iris.py
(Please refer to the
README.mdinside each specific task folder for detailed execution commands, especially for Streamlit apps or Docker deployments.)