-
Notifications
You must be signed in to change notification settings - Fork 10
Pull requests: unixsysdev/llama-turboquant
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml: add Vulkan and SYCL backends for TQ3_0 KV cache quantization
ggml
SYCL
Vulkan
#7
opened May 7, 2026 by
metalchef1
Loading…
feat: add PrismML Q1_0/Q1_0_G128 1-bit ternary quantization support
ggml
python
#6
opened Apr 1, 2026 by
carlosfundora
Loading…
feat: add flash attention support for TQ3_0 K-cache on CUDA/Blackwell
examples
ggml
Nvidia GPU
#4
opened Mar 28, 2026 by
marinero2k
Loading…
ProTip!
What’s not been updated in a month: updated:<2026-05-02.