Skip to content

[RFC] YOLO v5 #48

@oke-aditya

Description

@oke-aditya

🚀 Feature

Implement YOLO v5 from torch.hub.
This library removes such dataset abstraction and aims to provides a clean modular interface to models.

Some key points to note: -

  1. YOLO Achieves SOTA using Mosiac transforms as @zhiqwang mentioned.
  2. We don't need to define the model/weights/backbones, we can load using torch hub.
  3. We should provide an easy to use model definition, a simple train_step, val_step, fit API and a lightning trainer. Datasets, augmentations, transforms are not needed.

Note that none of quickvision models can achieve SOTA, limitations being torchvision's implementations and not using transforms/datasets. But they are faster, easier and flexible to train. Something which torchvision too does.

With this context, we can start adding YOLO v5.

Depedencies: -

  1. Avoid opencv-python at all costs. Opencv is not like PIL a library for image reading. It is huge and has lot of subdependencies. Keeping library light will enable us to use PyTorch Docker containers and directly infer using torchserve.

Evaluation mode: -
. We don't need .fuse() .eval()such methods. We only need to load the model fromtorch.hub.
Currently, we do not have inference scripts for any models, but surely in future #2 . So right now let's focus on training.

Metadata

Metadata

Labels

Medium PriorityShould be addressed in few daysModel to ImplementThis model should be implmentedfeatureA new feature request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions