AI Agent

Configure OpenAI-compatible /v1 endpoints. Examples: llama.cpp server, vLLM, LM Studio, Ollama (via /v1), OpenAI, etc.

MCP servers run on this machine (stdio). They expose tools the model can call mid-response.

System prompt

Temperature Max tokens