Home β€Ί LiteLLM Gateway

πŸ€– LiteLLM Gateway

Port 4000. Auth required. R1: never add a second instance.

Model Aliases

AliasBacking ModelVRAMUse Case
qwen-coder-fastqwen2.5-coder:7b4.7 GBDefault β€” Ralph loops, quick tasks
qwen-coder-deepqwen2.5-coder:14b-instruct9.0 GB (solo)Deep coding, architecture
deepseek-reasondeepseek-r1:14b9.0 GB (solo)Reasoning, debugging
embeddingsnomic-embed-text274 MBRAG embeddings
nim-deepseeknvidia/deepseek-v3.2 (NIM)CloudVRAM-saturated fallback
qwen3-coder-cloudqwen3-coder:480b-cloudCloudBest for Ralph scaffolding + tool-use

Adding a New Model

ollama pull <model-name>
# Edit ~/dryad-brain/litellm-config.yaml β€” never cat-append
nano ~/dryad-brain/litellm-config.yaml
cd ~/dryad-brain && docker compose restart dryad-litellm
curl http://localhost:4000/v1/models -H "Authorization: Bearer sk-dryad-master" | jq ".data[].id"