Amazon Web Services
- 15.2k followers
- United States of America
- https://amazon.com/aws
- open-source-github@amazon.com
Pinned Loading
Repositories
Showing 10 of 541 repositories
- sagemaker-hyperpod-cluster-setup Public
This repository provides setup assets to create Amazon SageMaker HyperPod clusters orchestrated with either Slurm or Amazon EKS. These clusters help you quickly scale model development tasks such as training, fine-tuning, or inference across a cluster of hundreds or thousands of AI accelerators.
aws/sagemaker-hyperpod-cluster-setup’s past year of commit activity - aws-ofi-nccl Public
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
aws/aws-ofi-nccl’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…