<[STEG-DETECTOR]>

The Future of Steganography Detection — Machine Learning powered, Wavelet-Enhanced, GUI-Driven

🚀 Recent Major Upgrade!

This project has undergone a significant overhaul for improved maintainability, performance, accuracy, and robust error handling. The core functionalities have been modularized, and several critical issues addressed.

🌌 Vision & Inspiration

Steganography is not just about hiding information — it’s about the invisible war in cyberspace.
Traditional security tools fail to detect hidden payloads inside images, videos, or signals.
That’s why Steg-Detector Pro was born: a futuristic, AI-driven, and research-grade framework that blends:

🧠 Machine Learning for smart detection
🌊 Wavelet Transform (PyWavelets) for frequency feature extraction
🖼️ Computer Vision (OpenCV) for image processing
🖥️ Beautiful GUI (Tkinter) for easy use
📊 Dataset Training so it keeps evolving with your data

⚡ Key Changes & Improvements (v3.0)

Modular Architecture: The entire codebase has been refactored into a src/ directory with logical submodules (src/logic, src/gui, src/utils). This greatly enhances maintainability and extensibility.
SSIM Bug Fix: Corrected a critical bug in the Structural Similarity Index (SSIM) feature extraction for images. Previously, SSIM was incorrectly calculated by comparing an image to itself; it now compares the image to a slightly blurred version for a more accurate steganographic artifact detection.
Enhanced Audio Feature Extraction:
- MFCCs Integration: Added Mel-frequency cepstral coefficients (MFCCs) using librosa, significantly improving the accuracy of audio steganography detection.
- Performance Optimization: Optimized audio processing by calculating rfftfreq outside loops, reducing computational overhead.
Robust Dependency Handling: Implemented conditional imports for optional libraries (librosa, pywt, skimage, pydub). The application now logs warnings and gracefully falls back to alternative methods or defaults if these libraries are not installed, preventing crashes.
Centralized Configuration: All application-wide constants are now managed in src/config.py, making it easier to customize settings.
Improved Error Handling: Replaced generic except Exception blocks with more specific exception types across the codebase for clearer error reporting and more robust application behavior.
Comprehensive Unit Tests: Added a new tests/ directory with unit tests for core functionalities, ensuring code reliability and preventing regressions.

🛠️ Installation

Clone the repository:

git clone https://github.com/0warn/STEG-Detector.git
cd STEG-Detector

Install dependencies:
Ensure you have Python 3.8+ and pip installed. Then, install all required libraries:
```
pip install -r requirement.txt
```
⚡ Note: Some optional dependencies like librosa and pywt (PyWavelets) might require system-level dependencies depending on your OS. The application will function without them, but certain features (MFCCs, Wavelet features) will be disabled.

🚀 Usage

Run the detector:
```
python3 main.py
```

⚠️ Crucial First Step: Retrain Your Model!

Due to significant internal changes in scikit-learn versions, any previously trained model files (steg_detector_model.joblib) are incompatible with this updated version of STEG-Detector.

Upon first launch (or if an incompatible model exists), the application will start, but detection features will not work until a new model is trained and saved.

Steps to Retrain Your Model:

Launch the application (python3 main.py).
Navigate to the "Train / Evaluate" tab in the GUI.
Select your Dataset Source: Choose either "Folders (clean/stego)" or "CSV (path,label)".
- If "Folders", browse to your root dataset directory (e.g., a folder containing clean/ and stego/ subfolders).
- If "CSV", browse to your CSV file containing paths and labels.
Select Algorithm: Choose rf (RandomForest) or xgb (XGBoost, requires xgboost package installed).
Click the "Train & Evaluate" button.
Once training is complete and evaluation results are displayed, click the "Save Model" button. This will save a new, compatible model file (steg_detector_model.joblib) in your project's root directory.

After saving the new model, the application's detection capabilities will be fully functional.

GUI Options

🔍 Detect → Upload a media file (image, audio, or video) to check for hidden data.
📊 Train / Evaluate → Train new ML models with your datasets and evaluate their performance.
⚙️ Dataset → Access tools for generating synthetic stego datasets (useful for bootstrapping training data).
ℹ️ About / Updates → View application information and check for updates.

🧩 Dataset Training

Prepare your dataset:
- For "Folders" mode: Create a root directory containing two subfolders:
  - cover/ → Contains original, clean media files.
  - stego/ → Contains steganographically altered media files.
- For "CSV" mode: Create a CSV file where each row contains path_to_media_file,label (e.g., image1.png,0 for clean, stego_image1.png,1 for stego).
Use the "Train / Evaluate" tab in the GUI, select your source, and follow the retraining steps mentioned above.
The trained model is saved as steg_detector_model.joblib for future detection.

🧑‍💻 Example

# Assuming you have trained and saved a model using the GUI.
# If you wish to use the detector programmatically:

from src.logic.detector import Detector

# Initialize the detector (it will attempt to load the model saved as steg_detector_model.joblib)
detector = Detector()

# Example detection:
result = detector.detect("path/to/your/sample_image.png")

if result.get('ok'):
    print(f"Detection Result for {result['path']}:")
    print(f"  Type: {result['type']}")
    print(f"  Heuristics Flag: {result['heuristics'].get('flag')}")
    if result.get('ml_prediction') is not None:
        prediction_label = "Stego Detected ✅" if result['ml_prediction'] == 1 else "Clean ❌"
        print(f"  ML Prediction: {prediction_label}")
    else:
        print("  ML Prediction: Model not loaded or available.")
else:
    print(f"Error detecting steganography: {result.get('error')}")

🩺 Error Handling & Fixes

Error / Warning	Cause	Solution
`ModuleNotFoundError: No module named 'pywt'` / `librosa` / `skimage` / `pydub`	Optional dependency not installed.	Run `pip install -r requirement.txt`. If issues persist, ensure `librosa` and `pywt` might have underlying system dependencies. The application will function, but related features might be unavailable.
`ValueError: node array from the pickle has an incompatible dtype` (on startup or load)	Model file (`steg_detector_model.joblib`) was trained with an older `scikit-learn` version.	Retrain the model using the "Train / Evaluate" tab in the GUI, then click "Save Model". The new model will be compatible with your current environment.
GUI crashes on Linux (`_tkinter` error)	Tkinter development files not installed.	For Debian/Ubuntu, run `sudo apt install python3-tk`. For other systems, consult your package manager.
Model not found (no detection results)	No model has been trained or saved yet, or the loaded model is incompatible.	Use the "Train / Evaluate" tab in the GUI to train a new model with your dataset, then click "Save Model".

🌍 Future Roadmap

🎥 Video steganography detection (Initial support added)
🔊 Audio steganography analysis (Initial support and MFCC features added)
🔮 Deep Learning (CNN, ResNet) integration
🕸️ Web-based dashboard
📡 Real-time stego-sniffer for network traffic

🧑‍🚀 Author

👨‍💻 Developed with dedication by CODE 💡 "Because hidden data should never remain invisible."

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataset-downloader.py		dataset-downloader.py
main.py		main.py
requirement.txt		requirement.txt
setup.py		setup.py
steg_detector_meta.json		steg_detector_meta.json
steg_detector_model.joblib		steg_detector_model.joblib

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

<[STEG-DETECTOR]>

🚀 Recent Major Upgrade!

🌌 Vision & Inspiration

⚡ Key Changes & Improvements (v3.0)

🛠️ Installation

🚀 Usage

⚠️ Crucial First Step: Retrain Your Model!

GUI Options

🧩 Dataset Training

🧑‍💻 Example

🩺 Error Handling & Fixes

🌍 Future Roadmap

🧑‍🚀 Author

About

Uh oh!

Uh oh!

Languages

License

0warn/STEG-Detector

Folders and files

Latest commit

History

Repository files navigation

<[STEG-DETECTOR]>

🚀 Recent Major Upgrade!

🌌 Vision & Inspiration

⚡ Key Changes & Improvements (v3.0)

🛠️ Installation

🚀 Usage

⚠️ Crucial First Step: Retrain Your Model!

GUI Options

🧩 Dataset Training

🧑‍💻 Example

🩺 Error Handling & Fixes

🌍 Future Roadmap

🧑‍🚀 Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages