ray-project/ray Contributor
vllm-project/vllm Contributor
ray-project/ray Contributor
vllm-project/vllm ContributorRay is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A high-throughput and memory-efficient inference and serving engine for LLMs
Forked from vllm-project/vllm-omni
A framework for efficient model inference with omni-modality models
Python
Forked from rasbt/reasoning-from-scratch
Implement a reasoning LLM in PyTorch from scratch, step by step
Jupyter Notebook