Skip to content

Self-Hosted Models v1

Generated: 2025-10-08 05:37 UTC

Total Duration: 5h 38m 14s

Iterations: 1

Judge (classifier) model: gpt-4o

About this Benchmark

HolmesGPT is continuously evaluated against real-world Kubernetes and cloud troubleshooting scenarios.

If you find scenarios that HolmesGPT does not perform well on, please consider adding them as evals to the benchmark.

Model Accuracy Comparison

Model Pass Fail Skip/Error Total Success Rate
gpt-4.1 59 30 16 105 🟡 66% (59/89)
novita/deepseek/deepseek-v3.1-terminus 73 14 18 105 🟡 84% (73/87)
novita/deepseek/deepseek-v3.2-exp 44 44 17 105 🟡 50% (44/88)
novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 0 89 16 105 🔴 0% (0/89)
novita/zai-org/glm-4.6 2 87 16 105 🟡 2% (2/89)
sonnet-4-20250514 74 13 18 105 🟡 85% (74/87)

Model Cost Comparison

Model Tests Avg Cost Min Cost Max Cost Total Cost
gpt-4.1 81 $0.13 $0.02 $0.57 $10.50
sonnet-4-20250514 84 $0.19 $0.06 $0.84 $16.12

Model Latency Comparison

Model Avg (s) Min (s) Max (s) P50 (s) P95 (s)
gpt-4.1 43.6 0.9 346.5 41.0 81.4
novita/deepseek/deepseek-v3.1-terminus 69.4 12.6 673.4 59.7 106.1
novita/deepseek/deepseek-v3.2-exp 80.0 12.3 635.0 53.9 189.4
novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 12.4 12.4 12.4 12.4 12.4
novita/zai-org/glm-4.6 19.4 6.4 34.1 19.5 26.2
sonnet-4-20250514 52.9 12.5 118.7 50.5 99.9

⚠️ Note: 90 test(s) excluded from latency calculations due to throttling/timeout errors (novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8: 88, sonnet-4-20250514: 2)

Performance by Tag

Success rate by test category and model:

Tag gpt-4.1 novita/deepseek/deepseek-v3.1-terminus novita/deepseek/deepseek-v3.2-exp novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 novita/zai-org/glm-4.6 sonnet-4-20250514 Warnings
chain-of-causation 🟡 25% (¼) 🟡 25% (¼) 🔴 0% (0/4) 🔴 0% (0/4) 🔴 0% (0/4) 🟡 50% (2/4) ⚠️ 24 skipped
context_window 🟡 50% (3/6) 🟡 67% (4/6) 🟡 50% (3/6) 🔴 0% (0/6) 🔴 0% (0/6) 🟡 50% (3/6) ⚠️ 6 skipped
counting 🟢 100% (4/4) 🟡 75% (¾) 🟡 75% (¾) 🔴 0% (0/4) 🔴 0% (0/4) 🟢 100% (4/4)
database 🔴 0% (0/1) 🟢 100% (1/1) 🔴 0% (0/1) 🔴 0% (0/1) 🔴 0% (0/1) 🟢 100% (1/1) ⚠️ 18 skipped
datadog 🔴 0% (0/4) 🟡 75% (¾) 🟡 75% (¾) 🔴 0% (0/4) 🔴 0% (0/4) 🟡 75% (¾)
datetime 🟡 40% (⅖) 🟡 75% (¾) 🟡 60% (⅗) 🔴 0% (0/5) 🟡 20% (⅕) 🟡 50% (2/4) ⚠️ 8 skipped
easy 🟡 86% (30/35) 🟡 97% (34/35) 🟡 66% (23/35) 🔴 0% (0/35) 🟡 3% (1/35) 🟡 97% (34/35) ⚠️ 6 skipped
hard 🟡 27% (3/11) 🟡 64% (7/11) 🟡 18% (2/11) 🔴 0% (0/11) 🔴 0% (0/11) 🟡 73% (8/11) ⚠️ 54 skipped
kafka ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚠️ 12 skipped
kubernetes 🟡 78% (32/41) 🟡 85% (35/41) 🟡 49% (20/41) 🔴 0% (0/41) 🔴 0% (0/41) 🟡 88% (36/41) ⚠️ 42 skipped
logs 🟡 67% (16/24) 🟡 83% (19/23) 🟡 46% (11/24) 🔴 0% (0/24) 🟡 4% (1/24) 🟡 78% (18/23) ⚠️ 56 skipped
medium 🟡 60% (26/43) 🟡 78% (32/41) 🟡 45% (19/42) 🔴 0% (0/43) 🟡 2% (1/43) 🟡 78% (32/41) ⚠️ 41 skipped
network 🟡 50% (2/4) 🟢 100% (4/4) 🟡 50% (2/4) 🔴 0% (0/4) 🔴 0% (0/4) 🟢 100% (4/4)
no-cicd ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚠️ 6 skipped
numerical 🟢 100% (1/1) 🟢 100% (1/1) 🔴 0% (0/1) 🔴 0% (0/1) 🔴 0% (0/1) 🟢 100% (1/1)
port-forward 🟡 50% (½) 🟡 50% (½) 🟡 50% (½) 🔴 0% (0/2) 🔴 0% (0/2) 🟡 50% (½) ⚠️ 42 skipped
prometheus 🟡 50% (½) 🟡 50% (½) 🟡 50% (½) 🔴 0% (0/2) 🔴 0% (0/2) 🟡 50% (½) ⚠️ 12 skipped
question-answer 🟢 100% (4/4) 🟡 75% (¾) 🟡 25% (¼) 🔴 0% (0/4) 🔴 0% (0/4) 🟢 100% (4/4)
runbooks 🟡 67% (4/6) 🟡 83% (⅚) 🟡 33% (2/6) 🔴 0% (0/6) 🔴 0% (0/6) 🟢 100% (6/6) ⚠️ 6 skipped
slackbot 🔴 0% (0/1) ⚪️ - ⚪️ - 🔴 0% (0/1) 🔴 0% (0/1) ⚪️ - ⚠️ 3 skipped
traces 🔴 0% (0/3) 🔴 0% (0/3) 🔴 0% (0/3) 🔴 0% (0/3) 🔴 0% (0/3) 🟡 33% (⅓) ⚠️ 12 skipped
transparency 🟡 64% (9/14) 🟡 71% (10/14) 🟡 43% (6/14) 🔴 0% (0/14) 🟡 7% (1/14) 🟡 64% (9/14) ⚠️ 6 skipped
Overall 🟡 66% (59/89) 🟡 84% (73/87) 🟡 50% (44/88) 🔴 0% (0/89) 🟡 2% (2/89) 🟡 85% (74/87) ⚠️ 101 skipped

Raw Results

Status of all evaluations across models. Color coding:

  • 🟢 Passing 100% (stable)
  • 🟡 Passing 1-99%
  • 🔴 Passing 0% (failing)
  • 🔧 Mock data failure (missing or invalid test data)
  • ⚠️ Setup failure (environment/infrastructure issue)
  • ⏱️ Timeout or rate limit error
  • ⏭️ Test skipped (e.g., known issue or precondition not met)
Eval ID gpt-4.1 novita/deepseek/deepseek-v3.1-terminus novita/deepseek/deepseek-v3.2-exp novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 novita/zai-org/glm-4.6 sonnet-4-20250514
01_how_many_pods 🔗 🟢 🔴 🟢 ⏱️ 🔴 🟢
02_what_is_wrong_with_pod 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
03_what_is_the_command_to_port_forward 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
04_related_k8s_events 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
05_image_version 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
08_sock_shop_frontend 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
09_crashpod 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
100a_historical_logs 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
100b_historical_logs_nonstandard_label 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
101_historical_logs_pod_deleted 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
103_logs_transparency_default_limit 🔗 🔴 🔴 🔴 ⏱️ 🔴 🔴
104a_postgres_root_issue 🔗 🔴 🟢 🔴 ⏱️ 🔴 🟢
104b_postgres_missing_index_pgstat 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
104c_postgres_minimal_missing_index 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
105_redis_wrong_data_structure 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
107_log_filter_http_status_code 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
108_logs_nearby_lines 🔗 🔴 🔴 🔴 ⏱️ 🔴 🟢
109_logs_transparency_not_found 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
10_image_pull_backoff 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
110_k8s_events_image_pull 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
111_disabled_datadog_traces 🔗 🔴 🔴 🔴 ⏱️ 🔴 🔴
111_pod_names_contain_service 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
112_find_pvcs_by_uuid 🔗 🔴 🔴 🔴 ⏱️ 🔴 🟢
114_checkout_latency_tracing_rebuild[0] 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
115_checkout_errors_tracing[0] 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
11_init_containers 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
121_new_relic_checkout_errors_tracing[0] 🔗 🔴 🔴 🔴 ⏱️ 🔴 🔴
122_new_relic_checkout_latency_tracing_rebuild[0] 🔗 🔴 🔴 🔴 ⏱️ 🔴 🟢
123_new_relic_checkout_errors_tracing[0] 🔗 🔴 🔴 🔴 ⏱️ 🔴 🔴
12_job_crashing 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
13a_pending_node_selector_basic 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
13b_pending_node_selector_detailed 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
14_pending_resources 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
156_kafka_opensearch_latency 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
159_prometheus_high_cardinality_cpu[0] 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
159_prometheus_high_cardinality_cpu[1] 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
159_prometheus_high_cardinality_cpu[2] 🔗 🔴 🔴 🔴 ⏱️ 🔴 🔴
15_failed_readiness_probe 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
16_failed_no_toolset_found 🔗 🔴 🔴 🟢 ⏱️ 🔴 🔴
17_oom_kill 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
19_detect_missing_app_details 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
20_long_log_file_search 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
21_job_fail_curl_no_svc_account 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
22_high_latency_dbi_down 🔗 ⚠️ ⚠️ ⚠️ ⚠️ ⚠️ ⚠️
23_app_error_in_current_logs 🔗 🟢 🟢 🟢 ⏱️ 🔴 🔴
24_misconfigured_pvc 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
24a_misconfigured_pvc_basic 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
24b_misconfigured_pvc_detailed 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
25_misconfigured_ingress_class 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
26_page_render_times 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
27a_multi_container_logs 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
27b_multi_container_logs 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
28_permissions_error 🔗 🔴 🟢 🔴 ⏱️ 🔴 🔴
33_cpu_metrics_discovery 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
39_failed_toolset 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
41_setup_argo 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
42_dns_issues_result_new_tools_no_runbook 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
42_dns_issues_steps_new_tools 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
43_current_datetime_from_prompt 🔗 🟢 🟢 🟢 ⏱️ 🟢 🟢
43_slack_deployment_logs 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
44_slack_statefulset_logs 🔗 🔴 🔧 🔧 ⏱️ 🔴 🔧
45_fetch_deployment_logs_simple 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
48_logs_since_thursday 🔗 🔴 🔧 🔴 ⏱️ 🔴 🔧
50_logs_since_specific_date 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
50a_logs_since_last_specific_month 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
51_logs_summarize_errors 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
52_logs_login_issues 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
53_logs_find_term 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
54_not_truncated_when_getting_pods 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
55_kafka_runbook 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
57_wrong_namespace 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
59_label_based_counting 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
60_count_less_than 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
61_exact_match_counting 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
62_fetch_error_logs_with_errors 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
63_fetch_error_logs_no_errors 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
64_keda_vs_hpa_confusion 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
65_health_check_followup 🔗 🟢 🟢 🔴 ⏱️ 🔴 🔴
71_connection_pool_starvation 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
73a_time_window_anomaly 🔗 🔴 🔴 🟢 ⏱️ 🔴 🔴
73b_time_window_anomaly 🔗 🔴 🟢 🔴 ⏱️ 🔴 🔴
76_service_discovery_issue 🔗 🟢 🟢 🔴 ⏱️ 🔴 ⏱️
77_liveness_probe_misconfiguration 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
78a_missing_cpu_limits 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
78b_cpu_quota_exceeded 🔗 🔴 🔴 🟢 ⏱️ 🔴 🟢
79_configmap_mount_issue 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
80_pvc_storage_class_mismatch 🔗 🔴 🟢 🔴 ⏱️ 🔴 🟢
81_service_account_permission_denied 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
82_pod_anti_affinity_conflict 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
83_secret_not_found 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
84_network_policy_blocking_traffic 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
85_hpa_not_scaling 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
86_configmap_like_but_secret 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
89_runbook_missing_cloudwatch 🔗 🟢 🔴 🔴 ⏱️ 🔴 🟢
90_runbook_basic_selection 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
91f_datadog_logs_historical_pod 🔗 🔴 🔴 🔴 🔴 🔴 🔴
93_calling_datadog[0] 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
93_calling_datadog[1] 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
93_calling_datadog[2] 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
93_events_since_specific_date 🔗 🔴 🟢 🟢 ⏱️ 🔴 🟢
94_runbook_transparency 🔗 🟢 🟢 🔴 ⏱️ 🔴 🟢
96_no_matching_runbook 🔗 🔴 🟢 🔴 ⏱️ 🔴 🟢
97_logs_clarification_needed 🔗 🟢 🟢 🟢 ⏱️ 🟢 🟢
98_logs_transparency_default_time 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
99_logs_transparency_custom_time 🔗 🟢 🟢 🟢 ⏱️ 🔴 🟢
SUMMARY 🟡 66% (59/89) 🟡 84% (73/87) 🟡 50% (44/88) 🔴 0% (0/89) 🟡 2% (2/89) 🟡 85% (74/87)

Detailed Raw Results

Eval ID gpt-4.1 novita/deepseek/deepseek-v3.1-terminus novita/deepseek/deepseek-v3.2-exp novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 novita/zai-org/glm-4.6 sonnet-4-20250514
01_how_many_pods 🔗 🟢 100% (1/1) / ⏱️ 41.5s / 💰 $0.05 🔴 0% (0/1) / ⏱️ 30.0s 🟢 100% (1/1) / ⏱️ 35.6s ⏱️ 0% (0/1) / ⏱️ 1395.5s 🔴 0% (0/1) / ⏱️ 17.9s 🟢 100% (1/1) / ⏱️ 27.9s / 💰 $0.08
02_what_is_wrong_with_pod 🔗 🟢 100% (1/1) / ⏱️ 37.2s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 59.8s 🔴 0% (0/1) / ⏱️ 24.5s ⏱️ 0% (0/1) / ⏱️ 1393.4s 🔴 0% (0/1) / ⏱️ 21.9s 🟢 100% (1/1) / ⏱️ 43.3s / 💰 $0.11
03_what_is_the_command_to_port_forward 🔗 🟢 100% (1/1) / ⏱️ 42.4s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 673.4s 🔴 0% (0/1) / ⏱️ 24.0s ⏱️ 0% (0/1) / ⏱️ 1465.4s 🔴 0% (0/1) / ⏱️ 19.0s 🟢 100% (1/1) / ⏱️ 38.0s / 💰 $0.12
04_related_k8s_events 🔗 🟢 100% (1/1) / ⏱️ 44.6s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 40.4s 🔴 0% (0/1) / ⏱️ 22.3s ⏱️ 0% (0/1) / ⏱️ 1392.1s 🔴 0% (0/1) / ⏱️ 17.8s 🟢 100% (1/1) / ⏱️ 38.9s / 💰 $0.09
05_image_version 🔗 🟢 100% (1/1) / ⏱️ 28.8s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 39.9s 🔴 0% (0/1) / ⏱️ 22.3s ⏱️ 0% (0/1) / ⏱️ 1391.5s 🔴 0% (0/1) / ⏱️ 17.0s 🟢 100% (1/1) / ⏱️ 36.4s / 💰 $0.09
08_sock_shop_frontend 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
09_crashpod 🔗 🟢 100% (1/1) / ⏱️ 35.3s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 57.9s 🟢 100% (1/1) / ⏱️ 67.9s ⏱️ 0% (0/1) / ⏱️ 1395.4s 🔴 0% (0/1) / ⏱️ 26.2s 🟢 100% (1/1) / ⏱️ 49.4s / 💰 $0.13
100a_historical_logs 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
100b_historical_logs_nonstandard_label 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
101_historical_logs_pod_deleted 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
103_logs_transparency_default_limit 🔗 🔴 0% (0/1) / ⏱️ 54.8s / 💰 $0.40 🔴 0% (0/1) / ⏱️ 68.7s 🔴 0% (0/1) / ⏱️ 23.8s ⏱️ 0% (0/1) / ⏱️ 1393.1s 🔴 0% (0/1) / ⏱️ 17.0s 🔴 0% (0/1) / ⏱️ 51.3s / 💰 $0.41
104a_postgres_root_issue 🔗 🔴 0% (0/1) / ⏱️ 58.1s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 78.3s 🔴 0% (0/1) / ⏱️ 27.1s ⏱️ 0% (0/1) / ⏱️ 1392.6s 🔴 0% (0/1) / ⏱️ 18.8s 🟢 100% (1/1) / ⏱️ 99.9s / 💰 $0.44
104b_postgres_missing_index_pgstat 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
104c_postgres_minimal_missing_index 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
105_redis_wrong_data_structure 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
107_log_filter_http_status_code 🔗 🟢 100% (1/1) / ⏱️ 46.2s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 126.5s 🔴 0% (0/1) / ⏱️ 31.2s ⏱️ 0% (0/1) / ⏱️ 1393.0s 🔴 0% (0/1) / ⏱️ 19.6s 🟢 100% (1/1) / ⏱️ 74.1s / 💰 $0.20
108_logs_nearby_lines 🔗 🔴 0% (0/1) / ⏱️ 41.6s / 💰 $0.14 🔴 0% (0/1) / ⏱️ 100.2s 🔴 0% (0/1) / ⏱️ 25.5s ⏱️ 0% (0/1) / ⏱️ 1397.4s 🔴 0% (0/1) / ⏱️ 22.8s 🟢 100% (1/1) / ⏱️ 83.5s / 💰 $0.26
109_logs_transparency_not_found 🔗 🟢 100% (1/1) / ⏱️ 72.1s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 58.8s 🔴 0% (0/1) / ⏱️ 23.7s ⏱️ 0% (0/1) / ⏱️ 1398.7s 🔴 0% (0/1) / ⏱️ 19.1s 🟢 100% (1/1) / ⏱️ 36.2s / 💰 $0.09
10_image_pull_backoff 🔗 🟢 100% (1/1) / ⏱️ 34.9s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 55.0s 🔴 0% (0/1) / ⏱️ 24.3s ⏱️ 0% (0/1) / ⏱️ 1466.6s 🔴 0% (0/1) / ⏱️ 19.9s 🟢 100% (1/1) / ⏱️ 58.9s / 💰 $0.12
110_k8s_events_image_pull 🔗 🟢 100% (1/1) / ⏱️ 27.8s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 56.4s 🔴 0% (0/1) / ⏱️ 47.0s ⏱️ 0% (0/1) / ⏱️ 1392.6s 🔴 0% (0/1) / ⏱️ 18.5s 🟢 100% (1/1) / ⏱️ 38.2s / 💰 $0.10
111_disabled_datadog_traces 🔗 🔴 0% (0/1) / ⏱️ 31.0s / 💰 $0.03 🔴 0% (0/1) / ⏱️ 74.7s 🔴 0% (0/1) / ⏱️ 208.5s ⏱️ 0% (0/1) / ⏱️ 1393.3s 🔴 0% (0/1) / ⏱️ 19.2s 🔴 0% (0/1) / ⏱️ 68.2s / 💰 $0.14
111_pod_names_contain_service 🔗 🟢 100% (1/1) / ⏱️ 66.5s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 83.6s 🟢 100% (1/1) / ⏱️ 99.4s ⏱️ 0% (0/1) / ⏱️ 1391.8s 🔴 0% (0/1) / ⏱️ 19.2s 🟢 100% (1/1) / ⏱️ 85.8s / 💰 $0.19
112_find_pvcs_by_uuid 🔗 🔴 0% (0/1) / ⏱️ 47.2s / 💰 $0.07 🔴 0% (0/1) / ⏱️ 54.4s 🔴 0% (0/1) / ⏱️ 111.5s ⏱️ 0% (0/1) / ⏱️ 1393.4s 🔴 0% (0/1) / ⏱️ 18.0s 🟢 100% (1/1) / ⏱️ 43.4s / 💰 $0.13
114_checkout_latency_tracing_rebuild[0] 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
115_checkout_errors_tracing[0] 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
11_init_containers 🔗 🟢 100% (1/1) / ⏱️ 40.8s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 68.4s 🟢 100% (1/1) / ⏱️ 91.3s ⏱️ 0% (0/1) / ⏱️ 1397.4s 🔴 0% (0/1) / ⏱️ 20.1s 🟢 100% (1/1) / ⏱️ 38.6s / 💰 $0.10
121_new_relic_checkout_errors_tracing[0] 🔗 🔴 0% (0/1) / ⏱️ 24.9s / 💰 $0.04 🔴 0% (0/1) / ⏱️ 42.4s 🔴 0% (0/1) / ⏱️ 170.1s ⏱️ 0% (0/1) / ⏱️ 1394.2s 🔴 0% (0/1) / ⏱️ 20.0s 🔴 0% (0/1) / ⏱️ 68.8s / 💰 $0.20
122_new_relic_checkout_latency_tracing_rebuild[0] 🔗 🔴 0% (0/1) / ⏱️ 39.1s / 💰 $0.10 🔴 0% (0/1) / ⏱️ 78.3s 🔴 0% (0/1) / ⏱️ 22.5s ⏱️ 0% (0/1) / ⏱️ 1392.7s 🔴 0% (0/1) / ⏱️ 19.0s 🟢 100% (1/1) / ⏱️ 90.3s / 💰 $0.47
123_new_relic_checkout_errors_tracing[0] 🔗 🔴 0% (0/1) / ⏱️ 20.0s / 💰 $0.03 🔴 0% (0/1) / ⏱️ 52.3s 🔴 0% (0/1) / ⏱️ 54.4s ⏱️ 0% (0/1) / ⏱️ 1392.7s 🔴 0% (0/1) / ⏱️ 18.1s 🔴 0% (0/1) / ⏱️ 98.5s / 💰 $0.33
12_job_crashing 🔗 🟢 100% (1/1) / ⏱️ 42.2s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 58.2s 🔴 0% (0/1) / ⏱️ 24.5s ⏱️ 0% (0/1) / ⏱️ 1395.1s 🔴 0% (0/1) / ⏱️ 19.6s 🟢 100% (1/1) / ⏱️ 54.2s / 💰 $0.14
13a_pending_node_selector_basic 🔗 🟢 100% (1/1) / ⏱️ 41.5s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 67.7s 🟢 100% (1/1) / ⏱️ 105.1s ⏱️ 0% (0/1) / ⏱️ 1393.6s 🔴 0% (0/1) / ⏱️ 20.0s 🟢 100% (1/1) / ⏱️ 55.5s / 💰 $0.13
13b_pending_node_selector_detailed 🔗 🟢 100% (1/1) / ⏱️ 37.4s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 64.9s 🔴 0% (0/1) / ⏱️ 23.0s ⏱️ 0% (0/1) / ⏱️ 1394.3s 🔴 0% (0/1) / ⏱️ 20.6s 🟢 100% (1/1) / ⏱️ 50.2s / 💰 $0.13
14_pending_resources 🔗 🟢 100% (1/1) / ⏱️ 41.4s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 62.0s 🟢 100% (1/1) / ⏱️ 117.1s ⏱️ 0% (0/1) / ⏱️ 1390.6s 🔴 0% (0/1) / ⏱️ 28.8s 🟢 100% (1/1) / ⏱️ 51.5s / 💰 $0.13
156_kafka_opensearch_latency 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
159_prometheus_high_cardinality_cpu[0] 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
159_prometheus_high_cardinality_cpu[1] 🔗 🟢 100% (1/1) / ⏱️ 36.8s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 74.3s 🟢 100% (1/1) / ⏱️ 132.5s ⏱️ 0% (0/1) / ⏱️ 1397.6s 🔴 0% (0/1) / ⏱️ 18.6s 🟢 100% (1/1) / ⏱️ 51.8s / 💰 $0.29
159_prometheus_high_cardinality_cpu[2] 🔗 🔴 0% (0/1) / ⏱️ 47.4s / 💰 $0.14 🔴 0% (0/1) / ⏱️ 40.7s 🔴 0% (0/1) / ⏱️ 68.1s ⏱️ 0% (0/1) / ⏱️ 1398.1s 🔴 0% (0/1) / ⏱️ 20.1s 🔴 0% (0/1) / ⏱️ 37.9s / 💰 $0.14
15_failed_readiness_probe 🔗 🟢 100% (1/1) / ⏱️ 31.5s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 70.2s 🟢 100% (1/1) / ⏱️ 123.3s ⏱️ 0% (0/1) / ⏱️ 1399.9s 🔴 0% (0/1) / ⏱️ 19.2s 🟢 100% (1/1) / ⏱️ 57.9s / 💰 $0.14
16_failed_no_toolset_found 🔗 🔴 0% (0/1) / ⏱️ 21.6s / 💰 $0.03 🔴 0% (0/1) / ⏱️ 60.4s 🟢 100% (1/1) / ⏱️ 46.5s ⏱️ 0% (0/1) / ⏱️ 1393.7s 🔴 0% (0/1) / ⏱️ 20.1s 🔴 0% (0/1) / ⏱️ 22.8s / 💰 $0.06
17_oom_kill 🔗 🟢 100% (1/1) / ⏱️ 43.2s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 53.2s 🟢 100% (1/1) / ⏱️ 97.3s ⏱️ 0% (0/1) / ⏱️ 1398.8s 🔴 0% (0/1) / ⏱️ 34.1s 🟢 100% (1/1) / ⏱️ 54.8s / 💰 $0.16
19_detect_missing_app_details 🔗 🟢 100% (1/1) / ⏱️ 83.0s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 102.0s 🟢 100% (1/1) / ⏱️ 126.1s ⏱️ 0% (0/1) / ⏱️ 1393.3s 🔴 0% (0/1) / ⏱️ 18.5s 🟢 100% (1/1) / ⏱️ 101.3s / 💰 $0.14
20_long_log_file_search 🔗 🟢 100% (1/1) / ⏱️ 43.8s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 103.5s 🟢 100% (1/1) / ⏱️ 81.3s ⏱️ 0% (0/1) / ⏱️ 1390.3s 🔴 0% (0/1) / ⏱️ 19.3s 🟢 100% (1/1) / ⏱️ 54.7s / 💰 $0.11
21_job_fail_curl_no_svc_account 🔗 🟢 100% (1/1) / ⏱️ 346.5s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 77.7s 🔴 0% (0/1) / ⏱️ 24.1s ⏱️ 0% (0/1) / ⏱️ 1429.3s 🔴 0% (0/1) / ⏱️ 19.6s 🟢 100% (1/1) / ⏱️ 54.5s / 💰 $0.18
22_high_latency_dbi_down 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
23_app_error_in_current_logs 🔗 🟢 100% (1/1) / ⏱️ 58.9s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 88.5s 🟢 100% (1/1) / ⏱️ 292.4s ⏱️ 0% (0/1) / ⏱️ 1393.5s 🔴 0% (0/1) / ⏱️ 22.0s 🔴 0% (0/1) / ⏱️ 36.7s
24_misconfigured_pvc 🔗 🟢 100% (1/1) / ⏱️ 46.4s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 84.8s 🟢 100% (1/1) / ⏱️ 96.2s ⏱️ 0% (0/1) / ⏱️ 1394.3s 🔴 0% (0/1) / ⏱️ 19.4s 🟢 100% (1/1) / ⏱️ 61.7s / 💰 $0.15
24a_misconfigured_pvc_basic 🔗 🟢 100% (1/1) / ⏱️ 55.9s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 74.0s 🟢 100% (1/1) / ⏱️ 168.4s ⏱️ 0% (0/1) / ⏱️ 1398.9s 🔴 0% (0/1) / ⏱️ 20.4s 🟢 100% (1/1) / ⏱️ 58.7s / 💰 $0.17
24b_misconfigured_pvc_detailed 🔗 🔴 0% (0/1) / ⏱️ 45.6s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 82.0s 🟢 100% (1/1) / ⏱️ 159.2s ⏱️ 0% (0/1) / ⏱️ 1434.6s 🔴 0% (0/1) / ⏱️ 19.7s 🟢 100% (1/1) / ⏱️ 63.3s / 💰 $0.15
25_misconfigured_ingress_class 🔗 🟢 100% (1/1) / ⏱️ 63.0s / 💰 $0.23 🟢 100% (1/1) / ⏱️ 106.1s 🔴 0% (0/1) / ⏱️ 189.4s ⏱️ 0% (0/1) / ⏱️ 1398.6s 🔴 0% (0/1) / ⏱️ 23.8s 🟢 100% (1/1) / ⏱️ 104.3s / 💰 $0.30
26_page_render_times 🔗 🟢 100% (1/1) / ⏱️ 35.3s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 63.2s 🔴 0% (0/1) / ⏱️ 46.5s ⏱️ 0% (0/1) / ⏱️ 1397.4s 🔴 0% (0/1) / ⏱️ 22.8s 🟢 100% (1/1) / ⏱️ 57.8s / 💰 $0.15
27a_multi_container_logs 🔗 🟢 100% (1/1) / ⏱️ 50.0s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 59.7s 🟢 100% (1/1) / ⏱️ 74.6s ⏱️ 0% (0/1) / ⏱️ 1394.1s 🔴 0% (0/1) / ⏱️ 18.9s 🟢 100% (1/1) / ⏱️ 50.5s / 💰 $0.11
27b_multi_container_logs 🔗 🟢 100% (1/1) / ⏱️ 37.8s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 39.0s 🔴 0% (0/1) / ⏱️ 24.2s ⏱️ 0% (0/1) / ⏱️ 1392.3s 🔴 0% (0/1) / ⏱️ 18.6s 🟢 100% (1/1) / ⏱️ 37.3s / 💰 $0.11
28_permissions_error 🔗 🔴 0% (0/1) / ⏱️ 22.0s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 31.2s 🔴 0% (0/1) / ⏱️ 37.2s ⏱️ 0% (0/1) / ⏱️ 1389.7s 🔴 0% (0/1) / ⏱️ 17.0s 🔴 0% (0/1) / ⏱️ 25.6s / 💰 $0.07
33_cpu_metrics_discovery 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
39_failed_toolset 🔗 🟢 100% (1/1) / ⏱️ 39.3s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 77.7s 🔴 0% (0/1) / ⏱️ 157.3s ⏱️ 0% (0/1) / ⏱️ 1400.6s 🔴 0% (0/1) / ⏱️ 29.5s 🟢 100% (1/1) / ⏱️ 48.6s / 💰 $0.10
41_setup_argo 🔗 🔴 0% (0/1) / ⏱️ 25.0s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 29.7s 🟢 100% (1/1) / ⏱️ 165.8s ⏱️ 0% (0/1) / ⏱️ 1395.3s 🔴 0% (0/1) / ⏱️ 19.2s 🟢 100% (1/1) / ⏱️ 20.3s / 💰 $0.06
42_dns_issues_result_new_tools_no_runbook 🔗 🔴 0% (0/1) / ⏱️ 97.8s / 💰 $0.57 🟢 100% (1/1) / ⏱️ 93.5s 🟢 100% (1/1) / ⏱️ 130.6s ⏱️ 0% (0/1) / ⏱️ 1388.7s 🔴 0% (0/1) / ⏱️ 18.4s 🟢 100% (1/1) / ⏱️ 92.7s / 💰 $0.22
42_dns_issues_steps_new_tools 🔗 🟢 100% (1/1) / ⏱️ 81.4s / 💰 $0.38 🟢 100% (1/1) / ⏱️ 144.0s 🔴 0% (0/1) / ⏱️ 29.7s ⏱️ 0% (0/1) / ⏱️ 1390.7s 🔴 0% (0/1) / ⏱️ 21.1s 🟢 100% (1/1) / ⏱️ 108.7s / 💰 $0.35
43_current_datetime_from_prompt 🔗 🟢 100% (1/1) / ⏱️ 32.0s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 24.3s 🟢 100% (1/1) / ⏱️ 19.7s ⏱️ 0% (0/1) / ⏱️ 1393.6s 🟢 100% (1/1) / ⏱️ 18.7s 🟢 100% (1/1) / ⏱️ 39.1s / 💰 $0.06
43_slack_deployment_logs 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
44_slack_statefulset_logs 🔗 🔴 0% (0/1) / ⏱️ 0.9s ⚪️ - ⚪️ - ⏱️ 0% (0/1) / ⏱️ 1346.7s 🔴 0% (0/1) / ⏱️ 9.0s ⚪️ -
45_fetch_deployment_logs_simple 🔗 🟢 100% (1/1) / ⏱️ 39.5s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 57.4s 🔴 0% (0/1) / ⏱️ 22.1s ⏱️ 0% (0/1) / ⏱️ 1394.5s 🔴 0% (0/1) / ⏱️ 19.8s 🟢 100% (1/1) / ⏱️ 42.7s / 💰 $0.10
48_logs_since_thursday 🔗 🔴 0% (0/1) / ⏱️ 0.9s ⚪️ - 🔴 0% (0/1) / ⏱️ 28.1s ⏱️ 0% (0/1) / ⏱️ 1347.4s 🔴 0% (0/1) / ⏱️ 6.4s ⚪️ -
50_logs_since_specific_date 🔗 🔴 0% (0/1) / ⏱️ 1.0s 🟢 100% (1/1) / ⏱️ 32.8s 🟢 100% (1/1) / ⏱️ 36.2s ⏱️ 0% (0/1) / ⏱️ 1345.5s 🔴 0% (0/1) / ⏱️ 23.3s 🟢 100% (1/1) / ⏱️ 26.9s / 💰 $0.13
50a_logs_since_last_specific_month 🔗 🟢 100% (1/1) / ⏱️ 30.1s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 38.3s 🟢 100% (1/1) / ⏱️ 71.2s ⏱️ 0% (0/1) / ⏱️ 1395.0s 🔴 0% (0/1) / ⏱️ 21.2s 🟢 100% (1/1) / ⏱️ 40.4s / 💰 $0.10
51_logs_summarize_errors 🔗 🟢 100% (1/1) / ⏱️ 37.3s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 46.3s 🟢 100% (1/1) / ⏱️ 45.3s ⏱️ 0% (0/1) / ⏱️ 1394.7s 🔴 0% (0/1) / ⏱️ 22.1s 🟢 100% (1/1) / ⏱️ 126.6s / 💰 $0.10
52_logs_login_issues 🔗 🟢 100% (1/1) / ⏱️ 45.2s / 💰 $0.55 🟢 100% (1/1) / ⏱️ 66.5s 🔴 0% (0/1) / ⏱️ 24.4s ⏱️ 0% (0/1) / ⏱️ 1391.3s 🔴 0% (0/1) / ⏱️ 18.7s 🟢 100% (1/1) / ⏱️ 56.6s / 💰 $0.79
53_logs_find_term 🔗 🟢 100% (1/1) / ⏱️ 30.3s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 46.9s 🟢 100% (1/1) / ⏱️ 53.9s ⏱️ 0% (0/1) / ⏱️ 1395.6s 🔴 0% (0/1) / ⏱️ 18.7s 🟢 100% (1/1) / ⏱️ 44.4s / 💰 $0.14
54_not_truncated_when_getting_pods 🔗 🟢 100% (1/1) / ⏱️ 39.0s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 55.7s 🟢 100% (1/1) / ⏱️ 140.6s ⏱️ 0% (0/1) / ⏱️ 1391.5s 🔴 0% (0/1) / ⏱️ 20.0s 🟢 100% (1/1) / ⏱️ 46.3s / 💰 $0.12
55_kafka_runbook 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
57_wrong_namespace 🔗 🔴 0% (0/1) / ⏱️ 32.9s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 57.6s 🟢 100% (1/1) / ⏱️ 58.3s ⏱️ 0% (0/1) / ⏱️ 1392.9s 🔴 0% (0/1) / ⏱️ 17.2s 🟢 100% (1/1) / ⏱️ 40.6s / 💰 $0.10
59_label_based_counting 🔗 🟢 100% (1/1) / ⏱️ 32.1s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 32.8s 🔴 0% (0/1) / ⏱️ 21.2s ⏱️ 0% (0/1) / ⏱️ 1392.1s 🔴 0% (0/1) / ⏱️ 18.1s 🟢 100% (1/1) / ⏱️ 30.6s / 💰 $0.08
60_count_less_than 🔗 🟢 100% (1/1) / ⏱️ 28.3s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 46.1s 🟢 100% (1/1) / ⏱️ 56.3s ⏱️ 0% (0/1) / ⏱️ 1397.0s 🔴 0% (0/1) / ⏱️ 19.5s 🟢 100% (1/1) / ⏱️ 36.1s / 💰 $0.09
61_exact_match_counting 🔗 🟢 100% (1/1) / ⏱️ 37.2s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 33.8s 🟢 100% (1/1) / ⏱️ 74.3s ⏱️ 0% (0/1) / ⏱️ 1393.3s 🔴 0% (0/1) / ⏱️ 18.2s 🟢 100% (1/1) / ⏱️ 28.4s / 💰 $0.08
62_fetch_error_logs_with_errors 🔗 🟢 100% (1/1) / ⏱️ 43.9s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 41.8s 🔴 0% (0/1) / ⏱️ 45.4s ⏱️ 0% (0/1) / ⏱️ 1392.0s 🔴 0% (0/1) / ⏱️ 17.8s 🟢 100% (1/1) / ⏱️ 34.3s / 💰 $0.09
63_fetch_error_logs_no_errors 🔗 🟢 100% (1/1) / ⏱️ 32.9s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 49.0s 🔴 0% (0/1) / ⏱️ 22.1s ⏱️ 0% (0/1) / ⏱️ 1398.8s 🔴 0% (0/1) / ⏱️ 17.5s 🟢 100% (1/1) / ⏱️ 44.2s / 💰 $0.11
64_keda_vs_hpa_confusion 🔗 🔴 0% (0/1) / ⏱️ 41.0s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 67.0s 🟢 100% (1/1) / ⏱️ 133.7s ⏱️ 0% (0/1) / ⏱️ 1394.2s 🔴 0% (0/1) / ⏱️ 18.3s 🟢 100% (1/1) / ⏱️ 77.6s / 💰 $0.22
65_health_check_followup 🔗 🟢 100% (1/1) / ⏱️ 51.3s / 💰 $0.20 🟢 100% (1/1) / ⏱️ 61.0s 🔴 0% (0/1) / ⏱️ 27.2s ⏱️ 0% (0/1) / ⏱️ 1398.0s 🔴 0% (0/1) / ⏱️ 19.8s 🔴 0% (0/1) / ⏱️ 78.0s / 💰 $0.20
71_connection_pool_starvation 🔗 🟢 100% (1/1) / ⏱️ 43.0s / 💰 $0.55 🟢 100% (1/1) / ⏱️ 61.2s 🔴 0% (0/1) / ⏱️ 352.6s ⏱️ 0% (0/1) / ⏱️ 1397.0s 🔴 0% (0/1) / ⏱️ 19.5s 🟢 100% (1/1) / ⏱️ 69.9s / 💰 $0.84
73a_time_window_anomaly 🔗 🔴 0% (0/1) / ⏱️ 38.9s / 💰 $0.57 🔴 0% (0/1) / ⏱️ 58.3s 🟢 100% (1/1) / ⏱️ 65.3s ⏱️ 0% (0/1) / ⏱️ 1397.7s 🔴 0% (0/1) / ⏱️ 18.9s 🔴 0% (0/1) / ⏱️ 72.9s / 💰 $0.74
73b_time_window_anomaly 🔗 🔴 0% (0/1) / ⏱️ 42.9s / 💰 $0.57 🟢 100% (1/1) / ⏱️ 54.0s 🔴 0% (0/1) / ⏱️ 23.5s ⏱️ 0% (0/1) / ⏱️ 1397.3s 🔴 0% (0/1) / ⏱️ 22.5s 🔴 0% (0/1) / ⏱️ 57.0s / 💰 $0.78
76_service_discovery_issue 🔗 🟢 100% (1/1) / ⏱️ 52.3s / 💰 $0.26 🟢 100% (1/1) / ⏱️ 73.2s 🔴 0% (0/1) / ⏱️ 86.3s ⏱️ 0% (0/1) / ⏱️ 1427.9s 🔴 0% (0/1) / ⏱️ 19.9s ⏱️ 0% (0/1) / ⏱️ 763.4s
77_liveness_probe_misconfiguration 🔗 🟢 100% (1/1) / ⏱️ 45.9s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 61.2s 🟢 100% (1/1) / ⏱️ 104.2s ⏱️ 0% (0/1) / ⏱️ 1395.1s 🔴 0% (0/1) / ⏱️ 20.4s 🟢 100% (1/1) / ⏱️ 50.5s / 💰 $0.14
78a_missing_cpu_limits 🔗 🟢 100% (1/1) / ⏱️ 47.5s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 75.1s 🔴 0% (0/1) / ⏱️ 30.3s ⏱️ 0% (0/1) / ⏱️ 1397.5s 🔴 0% (0/1) / ⏱️ 20.5s 🟢 100% (1/1) / ⏱️ 53.8s / 💰 $0.14
78b_cpu_quota_exceeded 🔗 🔴 0% (0/1) / ⏱️ 43.2s / 💰 $0.08 🔴 0% (0/1) / ⏱️ 60.6s 🟢 100% (1/1) / ⏱️ 68.5s ⏱️ 0% (0/1) / ⏱️ 1393.9s 🔴 0% (0/1) / ⏱️ 18.6s 🟢 100% (1/1) / ⏱️ 52.8s / 💰 $0.12
79_configmap_mount_issue 🔗 🟢 100% (1/1) / ⏱️ 40.8s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 46.0s 🟢 100% (1/1) / ⏱️ 98.8s ⏱️ 0% (0/1) / ⏱️ 1395.8s 🔴 0% (0/1) / ⏱️ 19.4s 🟢 100% (1/1) / ⏱️ 47.6s / 💰 $0.11
80_pvc_storage_class_mismatch 🔗 🔴 0% (0/1) / ⏱️ 41.3s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 69.7s 🔴 0% (0/1) / ⏱️ 28.1s ⏱️ 0% (0/1) / ⏱️ 1406.2s 🔴 0% (0/1) / ⏱️ 29.7s 🟢 100% (1/1) / ⏱️ 46.2s / 💰 $0.12
81_service_account_permission_denied 🔗 🟢 100% (1/1) / ⏱️ 61.1s / 💰 $0.21 🟢 100% (1/1) / ⏱️ 63.2s 🔴 0% (0/1) / ⏱️ 25.0s ⏱️ 0% (0/1) / ⏱️ 1391.6s 🔴 0% (0/1) / ⏱️ 19.5s 🟢 100% (1/1) / ⏱️ 56.2s / 💰 $0.20
82_pod_anti_affinity_conflict 🔗 🟢 100% (1/1) / ⏱️ 34.2s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 59.2s 🟢 100% (1/1) / ⏱️ 78.9s ⏱️ 0% (0/1) / ⏱️ 1393.5s 🔴 0% (0/1) / ⏱️ 21.0s 🟢 100% (1/1) / ⏱️ 55.5s / 💰 $0.15
83_secret_not_found 🔗 🟢 100% (1/1) / ⏱️ 44.6s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 68.8s 🔴 0% (0/1) / ⏱️ 23.6s ⏱️ 0% (0/1) / ⏱️ 1396.1s 🔴 0% (0/1) / ⏱️ 21.0s 🟢 100% (1/1) / ⏱️ 43.1s / 💰 $0.11
84_network_policy_blocking_traffic 🔗 🔴 0% (0/1) / ⏱️ 65.7s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 73.9s 🟢 100% (1/1) / ⏱️ 100.6s ⏱️ 0% (0/1) / ⏱️ 1396.9s 🔴 0% (0/1) / ⏱️ 20.8s 🟢 100% (1/1) / ⏱️ 71.7s / 💰 $0.22
85_hpa_not_scaling 🔗 🟢 100% (1/1) / ⏱️ 59.9s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 67.8s 🟢 100% (1/1) / ⏱️ 132.1s ⏱️ 0% (0/1) / ⏱️ 1393.8s 🔴 0% (0/1) / ⏱️ 19.0s 🟢 100% (1/1) / ⏱️ 54.3s / 💰 $0.17
86_configmap_like_but_secret 🔗 🟢 100% (1/1) / ⏱️ 39.3s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 56.3s 🟢 100% (1/1) / ⏱️ 81.1s ⏱️ 0% (0/1) / ⏱️ 1395.1s 🔴 0% (0/1) / ⏱️ 20.1s 🟢 100% (1/1) / ⏱️ 50.8s / 💰 $0.12
89_runbook_missing_cloudwatch 🔗 🟢 100% (1/1) / ⏱️ 26.9s / 💰 $0.05 🔴 0% (0/1) / ⏱️ 35.3s 🔴 0% (0/1) / ⏱️ 23.7s ⏱️ 0% (0/1) / ⏱️ 1385.0s 🔴 0% (0/1) / ⏱️ 15.2s 🟢 100% (1/1) / ⏱️ 40.1s / 💰 $0.09
90_runbook_basic_selection 🔗 🟢 100% (1/1) / ⏱️ 81.5s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 261.4s 🟢 100% (1/1) / ⏱️ 177.6s ⏱️ 0% (0/1) / ⏱️ 1391.7s 🔴 0% (0/1) / ⏱️ 23.8s 🟢 100% (1/1) / ⏱️ 118.7s / 💰 $0.35
91f_datadog_logs_historical_pod 🔗 🔴 0% (0/1) / ⏱️ 12.5s 🔴 0% (0/1) / ⏱️ 12.6s 🔴 0% (0/1) / ⏱️ 12.3s 🔴 0% (0/1) / ⏱️ 12.4s 🔴 0% (0/1) / ⏱️ 13.3s 🔴 0% (0/1) / ⏱️ 12.5s
93_calling_datadog[0] 🔗 🔴 0% (0/1) / ⏱️ 1.0s 🟢 100% (1/1) / ⏱️ 21.1s 🟢 100% (1/1) / ⏱️ 18.7s ⏱️ 0% (0/1) / ⏱️ 1347.3s 🔴 0% (0/1) / ⏱️ 21.1s 🟢 100% (1/1) / ⏱️ 29.5s / 💰 $0.16
93_calling_datadog[1] 🔗 🔴 0% (0/1) / ⏱️ 1.1s 🟢 100% (1/1) / ⏱️ 44.2s 🟢 100% (1/1) / ⏱️ 13.0s ⏱️ 0% (0/1) / ⏱️ 1347.2s 🔴 0% (0/1) / ⏱️ 8.2s 🟢 100% (1/1) / ⏱️ 12.6s / 💰 $0.16
93_calling_datadog[2] 🔗 🔴 0% (0/1) / ⏱️ 1.0s 🟢 100% (1/1) / ⏱️ 23.9s 🟢 100% (1/1) / ⏱️ 104.7s ⏱️ 0% (0/1) / ⏱️ 1347.5s 🔴 0% (0/1) / ⏱️ 8.4s 🟢 100% (1/1) / ⏱️ 12.5s / 💰 $0.16
93_events_since_specific_date 🔗 🔴 0% (0/1) / ⏱️ 1.0s 🟢 100% (1/1) / ⏱️ 25.9s 🟢 100% (1/1) / ⏱️ 20.7s ⏱️ 0% (0/1) / ⏱️ 1346.5s 🔴 0% (0/1) / ⏱️ 6.4s 🟢 100% (1/1) / ⏱️ 17.2s / 💰 $0.11
94_runbook_transparency 🔗 🟢 100% (1/1) / ⏱️ 80.9s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 103.5s 🔴 0% (0/1) / ⏱️ 129.4s ⏱️ 0% (0/1) / ⏱️ 1396.6s 🔴 0% (0/1) / ⏱️ 21.8s 🟢 100% (1/1) / ⏱️ 90.6s / 💰 $0.24
96_no_matching_runbook 🔗 🔴 0% (0/1) / ⏱️ 44.6s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 80.3s 🔴 0% (0/1) / ⏱️ 635.0s ⏱️ 0% (0/1) / ⏱️ 1403.3s 🔴 0% (0/1) / ⏱️ 20.9s 🟢 100% (1/1) / ⏱️ 92.7s / 💰 $0.60
97_logs_clarification_needed 🔗 🟢 100% (1/1) / ⏱️ 19.3s / 💰 $0.03 🟢 100% (1/1) / ⏱️ 24.7s 🟢 100% (1/1) / ⏱️ 20.4s ⏱️ 0% (0/1) / ⏱️ 1399.3s 🟢 100% (1/1) / ⏱️ 17.8s 🟢 100% (1/1) / ⏱️ 45.2s / 💰 $0.19
98_logs_transparency_default_time 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
99_logs_transparency_custom_time 🔗 🟢 100% (1/1) / ⏱️ 44.9s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 58.0s 🟢 100% (1/1) / ⏱️ 51.1s ⏱️ 0% (0/1) / ⏱️ 1396.9s 🔴 0% (0/1) / ⏱️ 19.6s 🟢 100% (1/1) / ⏱️ 49.3s / 💰 $0.10

Results are automatically generated and updated weekly. View full traces and detailed analysis in Braintrust experiment: local-benchmark-20251007-235918.