Commit code to repositority for initial basic training runs

Dependent on #11 and #9.

Do as much of our testing on SmolLM2 135M/400M models before we scale up.

- Make sure rsLoRA, LoRA, DoRA works on the models we pick and we have the command to run it all
- Perform rigorous hyperparameter sweep to understand differences betwen rsLoRA, LoRA and DoRA. Don't have to be full runs and so can be done parallel with faster training pipelines being implemented. 
- We need tests that our LoRA/rsLoRA/DoRA training result matches torchtune/HF/etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit code to repositority for initial basic training runs #13

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Commit code to repositority for initial basic training runs #13

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions