first commit

2026-05-28 07:21:15 +00:00
commit 6465520041
57 changed files with 942 additions and 0 deletions
--- a/apps/vllm-server/README.md
+++ b/apps/vllm-server/README.md
@ -0,0 +1,8 @@
+# vllm-server
+
+OpenAI-compatible model serving with vLLM.
+
+The base is CPU-safe YAML. Add `components/gpu-nvidia` in environments that
+provide NVIDIA GPUs, and let the instance overlay patch model name, resources,
+and cache size.
+