Files
ocdp-workload-manifests/apps/vllm-server/README.md
2026-05-28 07:21:41 +00:00

9 lines
231 B
Markdown

# vllm-server
OpenAI-compatible model serving with vLLM.
The base is CPU-safe YAML. Add `components/gpu-nvidia` in environments that
provide NVIDIA GPUs, and let the instance overlay patch model name, resources,
and cache size.