first commit

This commit is contained in:
2026-05-28 07:21:15 +00:00
commit 6465520041
57 changed files with 942 additions and 0 deletions

View File

@ -0,0 +1,8 @@
# vllm-server
OpenAI-compatible model serving with vLLM.
The base is CPU-safe YAML. Add `components/gpu-nvidia` in environments that
provide NVIDIA GPUs, and let the instance overlay patch model name, resources,
and cache size.