SSL-YOLO

A semi-supervised approach for few-shot object detection using contrastive learning with YOLOv8.

SSL-YOLO employs a self-supervised approach to pretrain the backbone of YOLOv8 models for few-shot object detection using contrastive representation learning from unlabeled data before supervised fine-tuning on a small labeled dataset.

Features

Self-supervised pretraining using contrastive learning
Support for YOLOv8 model variants (n, s, m, l, x)
Few-shot object detection capability
Customizable data augmentation pipeline
Based on Ultralytics v8.0.117 framework (modified ultralytics/yolo/engine/trainer.py file to enable loading and freezing of the pretrained backbone)

Installation

git clone https://github.com/Rayen023/ssl-yolo.git
cd ssl-yolo

Using uv (Recommended)

uv sync

Using pip

pip install -r requirements.txt

Setup & Configuration

1. Prepare Datasets

Semi-Supervised Learning: Collect unlabeled images related to your domain
Few-Shot Object Detection: Prepare a small dataset (~10 images per class) in YOLOv8 format

2. Configuration Settings

All parameters are configured in config.yaml. Note the required number of classes (nc) must match in your YOLOv8 config.

Usage

python ssl_training.py

This script will:

Train the backbone using contrastive learning on unlabeled data
Save the pretrained backbone weights
Fine-tune the model on your few-shot dataset with the backbone frozen
Save the resulting model

How It Works

Contrastive Learning Phase

Data Augmentation: Each image undergoes two different random augmentations
Feature Extraction & Projection: Both augmented versions pass through the backbone and are projected to a lower-dimensional space
Contrastive Loss: NT-Xent loss pushes together features from the same image and pulls apart features from different images

Object Detection Phase

Backbone Transfer: The pretrained backbone is loaded into a YOLOv8 model
Fine-tuning: The model is trained on a small labeled dataset (10-shot)
Evaluation: The model is evaluated on the test set

Tips for Best Results

Dataset Selection: Use an unlabeled dataset contextually similar to your target domain
Augmentation Strategy: Customize based on your specific use case
Batch Size: Use the largest batch size your GPU memory allows
Training Duration: Longer pretraining generally leads to better representations
Learning Rate Scheduling: Adjust for optimal convergence

Benchmark Results

We evaluated our methodology on the NEU-DET dataset in a 10-shot setting, systematically comparing against various Few-Shot Learning (FSL) representation paradigms. The performance, measured by Mean Average Precision (mAP@50), is summarized below:

Strategy	Validation Paradigm	mAP@50
ISS-NFT	In-Domain Self-Supervised pre-training & Novel-class Fine-Tuning*	57.1%
ISS-FFT	In-Domain Self-Supervised pre-training & Full Fine-Tuning	72.9%
CDT	Cross-Domain Transfer (pre-trained on COCO)	32.8%

* Evaluated on the FS-ND dataset split, SSL-YOLO improved the mAP@50 from a baseline of 0.127 to 0.571. Paper link.

Citation

@INPROCEEDINGS{11394884,
  author={Ghali, Rayen and Benhafid, Zhor and Selouani, Sid Ahmed},
  booktitle={2025 IEEE Smart World Congress (SWC)},
  title={Benchmarking Few-Shot Learning Techniques for Steel Surface Defect Detection},
  year={2025},
  pages={9-14},
  doi={10.1109/SWC65939.2025.00031}
}

Acknowledgements

Based on the Ultralytics YOLOv8 implementation
Contrastive learning approach based on SimCLR

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
ultralytics		ultralytics
utils		utils
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
find_optimal_batch_size.py		find_optimal_batch_size.py
pipeline.png		pipeline.png
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
ssl_training.py		ssl_training.py
train_detector.py		train_detector.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SSL-YOLO

Features

Installation

Using uv (Recommended)

Using pip

Setup & Configuration

1. Prepare Datasets

2. Configuration Settings

Usage

How It Works

Contrastive Learning Phase

Object Detection Phase

Tips for Best Results

Benchmark Results

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SSL-YOLO

Features

Installation

Using uv (Recommended)

Using pip

Setup & Configuration

1. Prepare Datasets

2. Configuration Settings

Usage

How It Works

Contrastive Learning Phase

Object Detection Phase

Tips for Best Results

Benchmark Results

Citation

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages