Github Repos 📦
Home
About
Back
1Cat-vLLM
1CatAI
vLLM fork for Tesla V100 (SM70) with AWQ 4-bit support, CUDA 12.8 build flow, and validated Qwen3.5 27B/35B deployment on multi-GPU V100.
Python
View on Github