Install HTTP Server (Docker)¶
Run the HolmesGPT HTTP API server locally using Docker Compose — no Kubernetes required.
To deploy the HTTP server on Kubernetes, see the Helm Chart instead.
Prerequisites¶
- Docker and Docker Compose
- Supported AI Provider API key
Installation¶
-
Clone the repository (or just download
docker-compose.yaml): -
Set your API key:
-
Start the server:
-
Verify it's running:
The API is available at http://localhost:5050.
Configuration¶
Edit docker-compose.yaml to configure your setup:
- LLM provider: Uncomment the environment variables for your provider (Anthropic, Gemini, Azure, AWS Bedrock)
- Kubernetes access: The compose file mounts
~/.kube/configso Holmes can query your cluster - Cloud credentials: AWS and GCloud credential directories are mounted read-only
- Holmes config:
~/.holmesis mounted for custom configuration
Kubeconfig with localhost clusters
If your kubeconfig points to 127.0.0.1 or localhost (common with Docker Desktop, minikube, kind), the container automatically rewrites the Kubernetes API server address to host.docker.internal on startup so the cluster is reachable. Remote clusters (EKS, GKE, AKS, etc.) are not affected.
API Reference¶
See the HTTP API Reference for full documentation on available endpoints, request/response formats, and usage examples.
Next Steps¶
Customize Holmes Settings¶
The Docker Compose file mounts ~/.holmes into the container. Create ~/.holmes/config.yaml to customize Holmes behavior:
# Change the LLM model
model: "anthropic/claude-sonnet-4-5-20250929"
# Limit the number of tool-calling steps per investigation
max_steps: 100
# Enable a builtin integration (e.g. Confluence)
toolsets:
confluence:
enabled: true
config:
api_url: "https://yourcompany.atlassian.net"
user: "your-email@example.com"
api_key: "your-api-token"
For configuring additional data sources, see Toolset Configuration.
After editing, restart the container to apply changes:
For the full list of environment variables and options, see the Environment Variables reference.
Learn More¶
- HTTP API Reference — Full API documentation
- Helm Chart — Deploy the HTTP server on Kubernetes
- CLI Installation — Run HolmesGPT as a command-line tool instead