HolmesGPT Not Finding Any Issues? Here's Why.¶
1. Truncation: Too Much Data¶
Data overflow causes important information to be truncated. See #437 for summarization improvements.
Solution:
- Use specific namespaces and time ranges
- Target individual components instead of cluster-wide queries
2. Missing Data Access¶
HolmesGPT can't access logs, metrics, or traces from your observability stack.
Solution:
- Verify toolset configuration connects to Prometheus/Grafana/logs
- Test connectivity:
kubectl exec -it <holmes-pod> -- curl http://prometheus:9090/api/v1/query?query=up
3. Unclear Prompts¶
Vague questions produce poor results.
Bad:
- "Why is my pod not working?"
- "Check if anything is wrong with my cluster"
- "Something is broken in production and users are complaining"
- "My deployment keeps failing but I don't know why"
- "Can you debug this issue I'm having with my application?"
Good:
- "Why is payment-service pod restarting in production namespace?"
- "What caused memory spike in web-frontend deployment last hour?"
5. Model Issues¶
Older LLM models lack reasoning capability for complex problems.
Solution:
Recommended Models:
anthropic/claude-opus-4-1-20250805- Most powerful for complex investigations (recommended)anthropic/claude-sonnet-4-20250514- Superior reasoning with faster performancegpt-4.1- Good balance of speed/capability
See benchmark results for detailed model performance comparisons.
6. Extra inputs are not permitted Errors From the LLM Provider¶
Some providers reject messages that contain fields they don't recognize, producing errors like:
litellm.BadRequestError: OpenAIException - messages.1.provider_specific_fields: Extra inputs are not permitted
This happens when LiteLLM attaches provider-specific metadata (e.g. provider_specific_fields) to assistant messages and those messages are later sent back to a provider that doesn't accept the field.
Solution: Set LLM_EXTRA_STRIP_MESSAGE_FIELDS to a comma-separated list of fields to strip before sending:
Replace the value with whichever field is named in your error message. Multiple fields can be passed, e.g. "provider_specific_fields,reasoning_content".
Still stuck?¶
Join our Slack community or open a GitHub issue for help.