Multi-View, multi-data augmentation (MVMA_SSRL)

The offical implemenation of MVMA framework.

Features

Multi-view data augmentation: generate Local and Global views of the same object by Random Cropping.
Multi-data augmentation: apply different augmentation techniques to different parts of an image comprising (Random & Searched Policies).
Configurable pipeline: easily define your data augmentation pipeline by specifying the desired transformations and their parameters.
Batch processing: augment multiple images in parallel to speed up the data generation process.
Compatibility: integrate with popular deep learning libraries such as PyTorch and PyTorch Lightning

SimAug: SimCLR Augmentation Policies.
RA: Random Augmentation Policies.
FA: FastAuto Augmentation Policies.
AA: AutoAugmentation Policies.
How To Setting Augmentation Strategy Flag in Pretraining

--num_augment_trategy SimCLR_FA \ # Setting different type of Augmentation strategies

--num_augment_strategies 2\ # adjust number strategies based on num_augment_strategy flag to make effect in Dataloader.

4 Other Hyperparameters setting

Use a large init learning rate {0.2, 0.3} for short training epochs. This would archieve better performance, which could be hidden by the initialization if the learning rate is too small. Use a small init learning rate for Longer training epochs should use value around 0.2.

--max_epochs 100 \

--batch_size 256 \

--lr 0.2 \

5 Distributed training in 1 Note

Controlling number of GPUs in your machine by change the --gpus flag

--gpus 0,1,2,3,4,5,6,7\ # Setting Number GPUs 

--accelerator gpu \

--strategy ddp \ # Setting training Strategy in Pytorch Lightning

MVMA Pre-trained models

**1 We open-sourced total 10 pretrained models here **:

Augmentation Strategies: AA (AutoAugmentation), FA (FastAuto Augmentation), RA (Random Augmentation), SimAug (SimCLR Augmentation pipeline)
These checkpoints are stored in Google Drive Storage:

Pre-trained Models	Width	Param (M)	Pretrained epochs	Augmentation Strategies	Number of Crops
ResNet50 (1x)	1X	24	100	SimAug-FA	2 View-(224), 4 View-(96)
ResNet50 (1x)	1X	24	100	SimAug - AA - FA	2 View-(224), 4 View-(96)
ResNet50 (1x)	1X	24	200	SimAug - RA	2 View-(224), 2 View-(96)
ResNet50 (1x)	1X	24	300	SimAug-RA-FA	2 View-(224), 4 View-(96)
ResNet50 (2x)	2X	94.0	100	SimAug - RA	2 View-(224), 3 View-(96)
ViT Small	1X	22.2	100	SimAug - RA	2 View-(224), 10 View-(96)
ViT Small	1X	22.2	100	SimAug - RA - FA	2 View-(224), 10 View-(96)
ViT Small	1X	22.2	100	SimAug	2 View-(224), 10 View-(96)
ViT Small	1X	22.2	200	SimAug-RA	2 View-(224), 10 View-(96)
ViT Small	1X	22.2	300	SimAug-RA	2 View-(224), 10 View-(96)

2 Model Performance monitor During Training with attached Linear classification layer

MVMA (ResNet-50) Pretraining on 100 Epochs on ImageNet 1k comparision via With BYOL method
MVMA (ResNet-50) Scaling Wider 2x on 100 Epochs on ImageNet 1k comparision via With BYOL method

MVMA (ResNet-50) on 300 Epochs on ImageNet 1k comparision via With BYOL method

ViT Small
ViT Base

3 Self-Supervised Pretraining Log

Report Logs Experiments

Finetuning the linear head (linear eval)

To fine-tune a linear head (with a single GPU), try the following command:

For fine-tuning a linear head on ImageNet using GPUs, first set the CHKPT_DIR to pretrained model dir and set a new MODEL_DIR, then use the following command: Stay tune! The instructions will update soon

** Performance of Linear Evaluation on ImageNet Validation Set

Convolution ResNet (ResNet-50)

Semi-supervised learning and fine-tuning the whole network

You can access 1% and 10% ImageNet subsets used for semi-supervised learning via tensorflow datasets: simply set dataset=imagenet2012_subset/1pct and dataset=imagenet2012_subset/10pct in the command line for fine-tuning on these subsets.

You can also find image IDs of these subsets in imagenet_subsets/.

To fine-tune the whole network on ImageNet (1% of labels), refer to the following command:

Stay tune! The instructions will update soon

Acknowledgement

[Solo-Learn SSL Library] (https://github.com/vturrisi/solo-learn)

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
MVAR_Dino		MVAR_Dino
MV_MA_SSL		MV_MA_SSL
bash_files/pretrain		bash_files/pretrain
images		images
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
baseline_pretrain.py		baseline_pretrain.py
dino.py		dino.py
main_linear.py		main_linear.py
main_mvar_dino.py		main_mvar_dino.py
mv_ma_pretrain_edit.py		mv_ma_pretrain_edit.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-View, multi-data augmentation (MVMA_SSRL)

Features

Table of Contents

Installation Environment

Using pip

Using conda

Self-supervised Pretraining

Preparing Dataset:

Setting SSL Pretraining Hyperparameter Flags:

MVMA Pre-trained models

Finetuning the linear head (linear eval)

Semi-supervised learning and fine-tuning the whole network

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-View, multi-data augmentation (MVMA_SSRL)

Features

Table of Contents

Installation Environment

Using pip

Using conda

Self-supervised Pretraining

Preparing Dataset:

Setting SSL Pretraining Hyperparameter Flags:

MVMA Pre-trained models

Finetuning the linear head (linear eval)

Semi-supervised learning and fine-tuning the whole network

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages