Files
2026-05-28 07:21:41 +00:00
..
2026-05-28 07:21:41 +00:00
2026-05-28 07:21:41 +00:00
2026-05-28 07:21:41 +00:00

vllm-server

OpenAI-compatible model serving with vLLM.

The base is CPU-safe YAML. Add components/gpu-nvidia in environments that provide NVIDIA GPUs, and let the instance overlay patch model name, resources, and cache size.