Skip to content

September 30, 2025

Generated: 2025-09-30 08:59 UTC

Total Duration: 1h 36m 3s

Iterations: 1

Judge (classifier) model: gpt-4o

About this Benchmark

HolmesGPT is continuously evaluated against real-world Kubernetes and cloud troubleshooting scenarios.

If you find scenarios that HolmesGPT does not perform well on, please consider adding them as evals to the benchmark.

Model Accuracy Comparison

Model Pass Fail Skip/Error Total Success Rate
gpt-4o 65 29 11 105 🟡 69% (65/94)
gpt-4.1 70 24 11 105 🟡 74% (70/94)
gpt-5 74 19 12 105 🟡 80% (74/93)
sonnet-4-20250514 91 3 11 105 🟡 97% (91/94)
sonnet-4-5-20250929 87 7 11 105 🟡 93% (87/94)

Model Cost Comparison

Model Tests Avg Cost Min Cost Max Cost Total Cost
gpt-4o 94 $0.13 $0.03 $0.43 $12.59
gpt-4.1 94 $0.11 $0.02 $0.46 $9.99
gpt-5 93 $0.13 $0.02 $0.47 $12.12
sonnet-4-20250514 94 $0.17 $0.06 $0.58 $15.66
sonnet-4-5-20250929 92 $0.16 $0.06 $0.58 $14.88

Model Latency Comparison

Model Avg (s) Min (s) Max (s) P50 (s) P95 (s)
gpt-4o 36.7 9.4 85.6 36.1 56.5
gpt-4.1 51.9 11.7 641.0 43.3 79.0
gpt-5 170.2 24.3 697.1 144.3 391.2
sonnet-4-20250514 73.2 11.6 654.9 55.7 160.2
sonnet-4-5-20250929 69.5 10.3 694.5 53.5 152.7

Performance by Tag

Success rate by test category and model:

Tag gpt-4o gpt-4.1 gpt-5 sonnet-4-20250514 sonnet-4-5-20250929 Warnings
chain-of-causation 🔴 0% (0/6) 🔴 0% (0/6) 🟡 33% (2/6) 🟢 100% (6/6) 🟢 100% (6/6) ⚠️ 10 skipped
context_window 🟡 86% (6/7) 🟡 43% (3/7) 🟢 100% (7/7) 🟢 100% (7/7) 🟡 86% (6/7)
counting 🟢 100% (4/4) 🟢 100% (4/4) 🟢 100% (4/4) 🟢 100% (4/4) 🟢 100% (4/4)
database 🔴 0% (0/1) 🟢 100% (1/1) 🟢 100% (1/1) 🟢 100% (1/1) 🟢 100% (1/1) ⚠️ 15 skipped
datadog 🟡 75% (¾) 🟢 100% (4/4) 🟡 75% (¾) 🟢 100% (4/4) 🟢 100% (4/4)
datetime 🟢 100% (4/4) 🟡 50% (2/4) 🟢 100% (4/4) 🟢 100% (4/4) 🟢 100% (4/4) ⚠️ 10 skipped
easy 🟡 97% (35/36) 🟢 100% (36/36) 🟡 83% (30/36) 🟢 100% (36/36) 🟡 97% (35/36)
hard 🟡 14% (2/14) 🟡 36% (5/14) 🟡 50% (7/14) 🟢 100% (14/14) 🟡 93% (13/14) ⚠️ 30 skipped
kafka ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚠️ 10 skipped
kubernetes 🟡 60% (28/47) 🟡 70% (33/47) 🟡 72% (34/47) 🟡 98% (46/47) 🟡 91% (43/47) ⚠️ 5 skipped
logs 🟡 69% (18/26) 🟡 69% (18/26) 🟡 85% (22/26) 🟡 92% (24/26) 🟡 88% (23/26) ⚠️ 35 skipped
medium 🟡 64% (28/44) 🟡 66% (29/44) 🟡 86% (37/43) 🟡 93% (41/44) 🟡 89% (39/44) ⚠️ 26 skipped
network 🟡 75% (¾) 🟡 25% (¼) 🟢 100% (4/4) 🟢 100% (4/4) 🟡 75% (¾)
numerical 🟢 100% (1/1) 🟢 100% (1/1) 🟢 100% (1/1) 🟢 100% (1/1) 🟢 100% (1/1)
port-forward 🟡 44% (4/9) 🟡 44% (4/9) 🟡 78% (7/9) 🟡 89% (8/9) 🟡 67% (6/9)
prometheus 🟡 75% (¾) 🟡 75% (¾) 🟢 100% (4/4) 🟢 100% (4/4) 🟡 75% (¾)
question-answer 🟢 100% (4/4) 🟢 100% (4/4) 🟢 100% (4/4) 🟢 100% (4/4) 🟢 100% (4/4)
runbooks 🟡 83% (⅚) 🟡 67% (4/6) 🟡 83% (⅚) 🟢 100% (6/6) 🟡 83% (⅚) ⚠️ 5 skipped
slackbot ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚠️ 5 skipped
traces 🔴 0% (0/5) 🔴 0% (0/5) 🟡 20% (⅕) 🟢 100% (5/5) 🟢 100% (5/5)
transparency 🟡 71% (10/14) 🟡 86% (12/14) 🟡 93% (13/14) 🟡 86% (12/14) 🟡 93% (13/14) ⚠️ 5 skipped
Overall 🟡 69% (65/94) 🟡 74% (70/94) 🟡 80% (74/93) 🟡 97% (91/94) 🟡 93% (87/94) ⚠️ 56 skipped

Raw Results

Status of all evaluations across models. Color coding:

  • 🟢 Passing 100% (stable)
  • 🟡 Passing 1-99%
  • 🔴 Passing 0% (failing)
  • 🔧 Mock data failure (missing or invalid test data)
  • ⚠️ Setup failure (environment/infrastructure issue)
  • ⏱️ Timeout or rate limit error
  • ⏭️ Test skipped (e.g., known issue or precondition not met)
Eval ID gpt-4o gpt-4.1 gpt-5 sonnet-4-20250514 sonnet-4-5-20250929
01_how_many_pods 🔗 🟢 🟢 🟢 🟢 🟢
02_what_is_wrong_with_pod 🔗 🟢 🟢 🟢 🟢 🟢
03_what_is_the_command_to_port_forward 🔗 🟢 🟢 🟢 🟢 🟢
04_related_k8s_events 🔗 🟢 🟢 🟢 🟢 🟢
05_image_version 🔗 🟢 🟢 🔴 🟢 🟢
09_crashpod 🔗 🟢 🟢 🟢 🟢 🟢
100a_historical_logs 🔗 🟢 🟢 🟢 🟢 🟢
100b_historical_logs_nonstandard_label 🔗 🔴 🔴 🔴 🔴 🔴
101_historical_logs_pod_deleted 🔗 🔴 🔴 🟢 🟢 🔴
103_logs_transparency_default_limit 🔗 🔴 🔴 🟢 🔴 🟢
104a_postgres_root_issue 🔗 🔴 🟢 🟢 🟢 🟢
107_log_filter_http_status_code 🔗 🟢 🟢 🟢 🟢 🟢
108_logs_nearby_lines 🔗 🔴 🔴 🔴 🟢 🔴
109_logs_transparency_not_found 🔗 🔴 🟢 🟢 🟢 🟢
10_image_pull_backoff 🔗 🟢 🟢 🟢 🟢 🟢
110_k8s_events_image_pull 🔗 🟢 🟢 🟢 🟢 🟢
111_disabled_datadog_traces 🔗 🔴 🟢 🟢 🔴 🟢
111_pod_names_contain_service 🔗 🟢 🟢 🟢 🟢 🟢
112_find_pvcs_by_uuid 🔗 🔴 🔴 🟢 🟢 🟢
114_checkout_latency_tracing_rebuild[0] 🔗 🔴 🔴 🔴 🟢 🟢
115_checkout_errors_tracing[0] 🔗 🔴 🔴 🟢 🟢 🟢
11_init_containers 🔗 🟢 🟢 🟢 🟢 🟢
121_new_relic_checkout_errors_tracing[0] 🔗 🔴 🔴 🔴 🟢 🟢
122_new_relic_checkout_latency_tracing_rebuild[0] 🔗 🔴 🔴 🔴 🟢 🟢
123_new_relic_checkout_errors_tracing[0] 🔗 🔴 🔴 🔴 🟢 🟢
12_job_crashing 🔗 🔴 🟢 🟢 🟢 🟢
13a_pending_node_selector_basic 🔗 🟢 🟢 🔴 🟢 🟢
13b_pending_node_selector_detailed 🔗 🔴 🟢 🔴 🟢 🟢
14_pending_resources 🔗 🟢 🟢 🔴 🟢 🟢
159_prometheus_high_cardinality_cpu[0] 🔗 🟢 🟢 🟢 🟢 🟢
159_prometheus_high_cardinality_cpu[1] 🔗 🟢 🟢 🟢 🟢 🟢
159_prometheus_high_cardinality_cpu[2] 🔗 🔴 🔴 🟢 🟢 🔴
15_failed_readiness_probe 🔗 🟢 🟢 🟢 🟢 🟢
16_failed_no_toolset_found 🔗 🔴 🔴 🔴 🟢 🔴
17_oom_kill 🔗 🟢 🟢 🔴 🟢 🟢
19_detect_missing_app_details 🔗 🟢 🟢 🟢 🟢 🟢
20_long_log_file_search 🔗 🟢 🟢 🟢 🟢 🟢
21_job_fail_curl_no_svc_account 🔗 🟢 🟢 🔴 🟢 🟢
23_app_error_in_current_logs 🔗 🟢 🟢 🟢 🟢 🟢
24_misconfigured_pvc 🔗 🟢 🟢 🔴 🟢 🟢
24a_misconfigured_pvc_basic 🔗 🔴 🟢 🔴 🟢 🟢
24b_misconfigured_pvc_detailed 🔗 🔴 🟢 🔴 🟢 🟢
25_misconfigured_ingress_class 🔗 🔴 🔴 🟢 🟢 🟢
26_page_render_times 🔗 🟢 🟢 🟢 🟢 🟢
27a_multi_container_logs 🔗 🟢 🟢 🟢 🟢 🟢
27b_multi_container_logs 🔗 🟢 🟢 🟢 🟢 🟢
28_permissions_error 🔗 🟢 🟢 🟢 🟢 🟢
33_cpu_metrics_discovery 🔗 🟢 🟢 🟢 🟢 🟢
39_failed_toolset 🔗 🟢 🟢 🟢 🟢 🟢
41_setup_argo 🔗 🟢 🟢 🟢 🟢 🟢
42_dns_issues_result_new_tools_no_runbook 🔗 🟢 🔴 🟢 🟢 🟢
42_dns_issues_steps_new_tools 🔗 🟢 🟢 🟢 🟢 ⏱️
43_current_datetime_from_prompt 🔗 🟢 🟢 🟢 🟢 🟢
45_fetch_deployment_logs_simple 🔗 🟢 🟢 🟢 🟢 🟢
50_logs_since_specific_date 🔗 🟢 🟢 🟢 🟢 🟢
50a_logs_since_last_specific_month 🔗 🟢 🟢 🟢 🟢 🟢
51_logs_summarize_errors 🔗 🟢 🟢 🟢 🟢 🟢
52_logs_login_issues 🔗 🔴 🟢 🔴 🟢 🟢
53_logs_find_term 🔗 🟢 🟢 🟢 🟢 🟢
54_not_truncated_when_getting_pods 🔗 🟢 🟢 🟢 🟢 🟢
57_wrong_namespace 🔗 🔴 🔴 🟢 🟢 🟢
59_label_based_counting 🔗 🟢 🟢 🟢 🟢 🟢
60_count_less_than 🔗 🟢 🟢 🟢 🟢 🟢
61_exact_match_counting 🔗 🟢 🟢 🟢 🟢 🟢
62_fetch_error_logs_with_errors 🔗 🟢 🟢 🟢 🟢 🟢
63_fetch_error_logs_no_errors 🔗 🟢 🟢 🟢 🟢 🟢
64_keda_vs_hpa_confusion 🔗 🟢 🔴 🟢 🟢 🟢
65_health_check_followup 🔗 🟢 🟢 🟢 🟢 🟢
71_connection_pool_starvation 🔗 🟢 🔴 🟢 🟢 🟢
73a_time_window_anomaly 🔗 🟢 🔴 🟢 🟢 🟢
73b_time_window_anomaly 🔗 🟢 🔴 🟢 🟢 🟢
76_service_discovery_issue 🔗 🟢 🟢 🟢 🟢 ⏱️
77_liveness_probe_misconfiguration 🔗 🟢 🟢 🟢 🟢 🟢
78a_missing_cpu_limits 🔗 🔴 🔴 🟢 🟢 🟢
78b_cpu_quota_exceeded 🔗 🔴 🔴 🟢 🟢 🟢
79_configmap_mount_issue 🔗 🟢 🟢 🟢 🟢 🟢
80_pvc_storage_class_mismatch 🔗 🔴 🔴 🟢 🟢 🟢
81_service_account_permission_denied 🔗 🟢 🟢 🟢 🟢 🟢
82_pod_anti_affinity_conflict 🔗 🔴 🟢 🟢 🟢 🟢
83_secret_not_found 🔗 🟢 🟢 🟢 🟢 🟢
84_network_policy_blocking_traffic 🔗 🟢 🔴 🟢 🟢 🟢
85_hpa_not_scaling 🔗 🔴 🟢 🟢 🟢 🟢
86_configmap_like_but_secret 🔗 🟢 🟢 🟢 🟢 🟢
89_runbook_missing_cloudwatch 🔗 🟢 🟢 🟢 🟢 🟢
90_runbook_basic_selection 🔗 🟢 🟢 🟢 🟢 🟢
91f_datadog_logs_historical_pod 🔗 🔴 🟢 🔴 🟢 🟢
93_calling_datadog[0] 🔗 🟢 🟢 🟢 🟢 🟢
93_calling_datadog[1] 🔗 🟢 🟢 🟢 🟢 🟢
93_calling_datadog[2] 🔗 🟢 🟢 🟢 🟢 🟢
94_runbook_transparency 🔗 🟢 🟢 🟢 🟢 🟢
96_no_matching_runbook 🔗 🔴 🔴 🔴 🟢 🟢
97_logs_clarification_needed 🔗 🟢 🟢 🟢 🟢 🟢
99_logs_transparency_custom_time 🔗 🟢 🟢 🟢 🟢 🟢
93_events_since_specific_date 🔗 🟢 🟢 🔧 🟢 🟢
44_slack_statefulset_logs 🔗 🔧 🔧 🔧 🔧 🔧
48_logs_since_thursday 🔗 🔧 🔧 🔧 🔧 🔧
22_high_latency_dbi_down 🔗 ⚠️ ⚠️ ⚠️ ⚠️ ⚠️
08_sock_shop_frontend 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
104b_postgres_missing_index_pgstat 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
104c_postgres_minimal_missing_index 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
105_redis_wrong_data_structure 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
156_kafka_opensearch_latency 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
43_slack_deployment_logs 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
55_kafka_runbook 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
98_logs_transparency_default_time 🔗 ⏭️ ⏭️ ⏭️ ⏭️ ⏭️
SUMMARY 🟡 69% (65/94) 🟡 74% (70/94) 🟡 80% (74/93) 🟡 97% (91/94) 🟡 93% (87/94)

Detailed Raw Results

Eval ID gpt-4o gpt-4.1 gpt-5 sonnet-4-20250514 sonnet-4-5-20250929
01_how_many_pods 🔗 🟢 100% (1/1) / ⏱️ 27.3s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 30.7s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 50.6s / 💰 $0.03 🟢 100% (1/1) / ⏱️ 26.7s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 27.1s / 💰 $0.08
02_what_is_wrong_with_pod 🔗 🟢 100% (1/1) / ⏱️ 27.0s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 40.0s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 137.4s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 43.1s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 34.7s / 💰 $0.09
03_what_is_the_command_to_port_forward 🔗 🟢 100% (1/1) / ⏱️ 27.5s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 41.1s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 231.3s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 35.4s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 36.3s / 💰 $0.09
04_related_k8s_events 🔗 🟢 100% (1/1) / ⏱️ 33.5s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 37.2s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 64.7s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 36.1s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 36.1s / 💰 $0.09
05_image_version 🔗 🟢 100% (1/1) / ⏱️ 39.1s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 34.7s / 💰 $0.07 🔴 0% (0/1) / ⏱️ 29.8s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 36.7s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 32.0s / 💰 $0.09
09_crashpod 🔗 🟢 100% (1/1) / ⏱️ 42.2s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 39.2s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 137.1s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 160.2s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 55.0s / 💰 $0.14
100a_historical_logs 🔗 🟢 100% (1/1) / ⏱️ 46.8s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 45.0s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 462.7s / 💰 $0.31 🟢 100% (1/1) / ⏱️ 78.2s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 154.9s / 💰 $0.23
100b_historical_logs_nonstandard_label 🔗 🔴 0% (0/1) / ⏱️ 38.8s / 💰 $0.16 🔴 0% (0/1) / ⏱️ 39.7s / 💰 $0.08 🔴 0% (0/1) / ⏱️ 398.3s / 💰 $0.29 🔴 0% (0/1) / ⏱️ 136.4s / 💰 $0.27 🔴 0% (0/1) / ⏱️ 88.3s / 💰 $0.19
101_historical_logs_pod_deleted 🔗 🔴 0% (0/1) / ⏱️ 35.9s / 💰 $0.13 🔴 0% (0/1) / ⏱️ 73.8s / 💰 $0.20 🟢 100% (1/1) / ⏱️ 333.2s / 💰 $0.20 🟢 100% (1/1) / ⏱️ 140.9s / 💰 $0.20 🔴 0% (0/1) / ⏱️ 71.9s / 💰 $0.15
103_logs_transparency_default_limit 🔗 🔴 0% (0/1) / ⏱️ 36.7s / 💰 $0.14 🔴 0% (0/1) / ⏱️ 80.7s / 💰 $0.29 🟢 100% (1/1) / ⏱️ 98.6s / 💰 $0.11 🔴 0% (0/1) / ⏱️ 67.0s / 💰 $0.41 🟢 100% (1/1) / ⏱️ 50.8s / 💰 $0.12
104a_postgres_root_issue 🔗 🔴 0% (0/1) / ⏱️ 39.5s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 68.6s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 190.6s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 71.7s / 💰 $0.19 🟢 100% (1/1) / ⏱️ 60.3s / 💰 $0.20
107_log_filter_http_status_code 🔗 🟢 100% (1/1) / ⏱️ 45.5s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 45.8s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 235.8s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 69.1s / 💰 $0.19 🟢 100% (1/1) / ⏱️ 72.7s / 💰 $0.27
108_logs_nearby_lines 🔗 🔴 0% (0/1) / ⏱️ 36.5s / 💰 $0.15 🔴 0% (0/1) / ⏱️ 62.5s / 💰 $0.11 🔴 0% (0/1) / ⏱️ 293.6s / 💰 $0.24 🟢 100% (1/1) / ⏱️ 72.8s / 💰 $0.21 🔴 0% (0/1) / ⏱️ 88.1s / 💰 $0.29
109_logs_transparency_not_found 🔗 🔴 0% (0/1) / ⏱️ 85.6s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 34.3s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 140.1s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 347.6s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 51.1s / 💰 $0.10
10_image_pull_backoff 🔗 🟢 100% (1/1) / ⏱️ 38.3s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 47.0s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 185.1s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 50.2s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 50.3s / 💰 $0.13
110_k8s_events_image_pull 🔗 🟢 100% (1/1) / ⏱️ 31.9s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 37.7s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 102.5s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 40.0s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 44.7s / 💰 $0.10
111_disabled_datadog_traces 🔗 🔴 0% (0/1) / ⏱️ 16.9s / 💰 $0.03 🟢 100% (1/1) / ⏱️ 29.5s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 181.3s / 💰 $0.10 🔴 0% (0/1) / ⏱️ 93.1s / 💰 $0.20 🟢 100% (1/1) / ⏱️ 23.0s / 💰 $0.06
111_pod_names_contain_service 🔗 🟢 100% (1/1) / ⏱️ 42.2s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 66.6s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 253.0s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 72.3s / 💰 $0.22 🟢 100% (1/1) / ⏱️ 63.9s / 💰 $0.21
112_find_pvcs_by_uuid 🔗 🔴 0% (0/1) / ⏱️ 38.6s / 💰 $0.16 🔴 0% (0/1) / ⏱️ 50.8s / 💰 $0.28 🟢 100% (1/1) / ⏱️ 143.2s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 43.5s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 42.5s / 💰 $0.11
114_checkout_latency_tracing_rebuild[0] 🔗 🔴 0% (0/1) / ⏱️ 38.7s / 💰 $0.17 🔴 0% (0/1) / ⏱️ 59.3s / 💰 $0.20 🔴 0% (0/1) / ⏱️ 273.2s / 💰 $0.25 🟢 100% (1/1) / ⏱️ 99.8s / 💰 $0.33 🟢 100% (1/1) / ⏱️ 80.0s / 💰 $0.32
115_checkout_errors_tracing[0] 🔗 🔴 0% (0/1) / ⏱️ 37.4s / 💰 $0.20 🔴 0% (0/1) / ⏱️ 56.2s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 218.9s / 💰 $0.29 🟢 100% (1/1) / ⏱️ 101.9s / 💰 $0.33 🟢 100% (1/1) / ⏱️ 142.6s / 💰 $0.49
11_init_containers 🔗 🟢 100% (1/1) / ⏱️ 31.2s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 67.3s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 127.5s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 50.4s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 40.5s / 💰 $0.11
121_new_relic_checkout_errors_tracing[0] 🔗 🔴 0% (0/1) / ⏱️ 29.5s / 💰 $0.11 🔴 0% (0/1) / ⏱️ 43.0s / 💰 $0.09 🔴 0% (0/1) / ⏱️ 433.5s / 💰 $0.24 🟢 100% (1/1) / ⏱️ 256.7s / 💰 $0.29 🟢 100% (1/1) / ⏱️ 98.2s / 💰 $0.31
122_new_relic_checkout_latency_tracing_rebuild[0] 🔗 🔴 0% (0/1) / ⏱️ 38.3s / 💰 $0.19 🔴 0% (0/1) / ⏱️ 51.3s / 💰 $0.15 🔴 0% (0/1) / ⏱️ 697.1s / 💰 $0.47 🟢 100% (1/1) / ⏱️ 87.7s / 💰 $0.42 🟢 100% (1/1) / ⏱️ 152.7s / 💰 $0.51
123_new_relic_checkout_errors_tracing[0] 🔗 🔴 0% (0/1) / ⏱️ 47.9s / 💰 $0.26 🔴 0% (0/1) / ⏱️ 39.3s / 💰 $0.09 🔴 0% (0/1) / ⏱️ 341.5s / 💰 $0.26 🟢 100% (1/1) / ⏱️ 102.7s / 💰 $0.32 🟢 100% (1/1) / ⏱️ 133.6s / 💰 $0.58
12_job_crashing 🔗 🔴 0% (0/1) / ⏱️ 27.7s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 35.1s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 116.1s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 55.7s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 57.8s / 💰 $0.18
13a_pending_node_selector_basic 🔗 🟢 100% (1/1) / ⏱️ 31.7s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 42.6s / 💰 $0.10 🔴 0% (0/1) / ⏱️ 27.6s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 49.6s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 52.5s / 💰 $0.13
13b_pending_node_selector_detailed 🔗 🔴 0% (0/1) / ⏱️ 35.1s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 58.2s / 💰 $0.09 🔴 0% (0/1) / ⏱️ 24.4s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 53.6s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 55.4s / 💰 $0.15
14_pending_resources 🔗 🟢 100% (1/1) / ⏱️ 31.5s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 44.9s / 💰 $0.09 🔴 0% (0/1) / ⏱️ 30.9s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 124.7s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 49.2s / 💰 $0.12
159_prometheus_high_cardinality_cpu[0] 🔗 🟢 100% (1/1) / ⏱️ 30.5s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 48.4s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 192.6s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 64.2s / 💰 $0.21 🟢 100% (1/1) / ⏱️ 50.3s / 💰 $0.17
159_prometheus_high_cardinality_cpu[1] 🔗 🟢 100% (1/1) / ⏱️ 41.3s / 💰 $0.24 🟢 100% (1/1) / ⏱️ 42.4s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 143.2s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 56.7s / 💰 $0.20 🟢 100% (1/1) / ⏱️ 41.4s / 💰 $0.17
159_prometheus_high_cardinality_cpu[2] 🔗 🔴 0% (0/1) / ⏱️ 29.2s / 💰 $0.09 🔴 0% (0/1) / ⏱️ 41.3s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 134.7s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 68.5s / 💰 $0.23 🔴 0% (0/1) / ⏱️ 43.0s / 💰 $0.17
15_failed_readiness_probe 🔗 🟢 100% (1/1) / ⏱️ 39.0s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 37.5s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 203.4s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 72.6s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 46.1s / 💰 $0.12
16_failed_no_toolset_found 🔗 🔴 0% (0/1) / ⏱️ 54.1s / 💰 $0.14 🔴 0% (0/1) / ⏱️ 25.2s / 💰 $0.04 🔴 0% (0/1) / ⏱️ 47.6s / 💰 $0.03 🟢 100% (1/1) / ⏱️ 25.5s / 💰 $0.06 🔴 0% (0/1) / ⏱️ 23.8s / 💰 $0.06
17_oom_kill 🔗 🟢 100% (1/1) / ⏱️ 34.5s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 61.6s / 💰 $0.09 🔴 0% (0/1) / ⏱️ 35.5s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 56.5s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 42.4s / 💰 $0.11
19_detect_missing_app_details 🔗 🟢 100% (1/1) / ⏱️ 48.2s / 💰 $0.43 🟢 100% (1/1) / ⏱️ 39.3s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 242.9s / 💰 $0.20 🟢 100% (1/1) / ⏱️ 96.4s / 💰 $0.25 🟢 100% (1/1) / ⏱️ 54.5s / 💰 $0.11
20_long_log_file_search 🔗 🟢 100% (1/1) / ⏱️ 41.1s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 46.0s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 91.8s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 55.6s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 70.1s / 💰 $0.11
21_job_fail_curl_no_svc_account 🔗 🟢 100% (1/1) / ⏱️ 50.1s / 💰 $0.27 🟢 100% (1/1) / ⏱️ 641.0s / 💰 $0.11 🔴 0% (0/1) / ⏱️ 58.1s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 56.9s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 54.0s / 💰 $0.24
23_app_error_in_current_logs 🔗 🟢 100% (1/1) / ⏱️ 48.1s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 64.6s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 206.2s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 76.0s / 💰 $0.58 🟢 100% (1/1) / ⏱️ 73.5s / 💰 $0.35
24_misconfigured_pvc 🔗 🟢 100% (1/1) / ⏱️ 36.0s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 52.8s / 💰 $0.12 🔴 0% (0/1) / ⏱️ 24.3s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 70.9s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 61.6s / 💰 $0.16
24a_misconfigured_pvc_basic 🔗 🔴 0% (0/1) / ⏱️ 30.9s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 47.7s / 💰 $0.12 🔴 0% (0/1) / ⏱️ 30.9s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 55.3s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 66.7s / 💰 $0.16
24b_misconfigured_pvc_detailed 🔗 🔴 0% (0/1) / ⏱️ 44.2s / 💰 $0.20 🟢 100% (1/1) / ⏱️ 74.3s / 💰 $0.18 🔴 0% (0/1) / ⏱️ 26.1s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 58.7s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 156.8s / 💰 $0.16
25_misconfigured_ingress_class 🔗 🔴 0% (0/1) / ⏱️ 51.2s / 💰 $0.11 🔴 0% (0/1) / ⏱️ 51.1s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 285.5s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 83.6s / 💰 $0.23 🟢 100% (1/1) / ⏱️ 78.5s / 💰 $0.30
26_page_render_times 🔗 🟢 100% (1/1) / ⏱️ 29.7s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 48.6s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 241.1s / 💰 $0.21 🟢 100% (1/1) / ⏱️ 54.3s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 55.2s / 💰 $0.16
27a_multi_container_logs 🔗 🟢 100% (1/1) / ⏱️ 32.8s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 48.5s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 105.7s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 49.8s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 37.8s / 💰 $0.12
27b_multi_container_logs 🔗 🟢 100% (1/1) / ⏱️ 30.5s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 48.2s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 132.3s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 44.1s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 37.9s / 💰 $0.11
28_permissions_error 🔗 🟢 100% (1/1) / ⏱️ 19.7s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 25.0s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 92.0s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 27.6s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 23.9s / 💰 $0.07
33_cpu_metrics_discovery 🔗 🟢 100% (1/1) / ⏱️ 26.3s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 35.5s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 121.8s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 79.7s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 45.4s / 💰 $0.13
39_failed_toolset 🔗 🟢 100% (1/1) / ⏱️ 23.3s / 💰 $0.03 🟢 100% (1/1) / ⏱️ 41.7s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 203.3s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 47.6s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 52.3s / 💰 $0.11
41_setup_argo 🔗 🟢 100% (1/1) / ⏱️ 21.3s / 💰 $0.03 🟢 100% (1/1) / ⏱️ 18.5s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 170.6s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 20.0s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 21.2s / 💰 $0.06
42_dns_issues_result_new_tools_no_runbook 🔗 🟢 100% (1/1) / ⏱️ 42.7s / 💰 $0.15 🔴 0% (0/1) / ⏱️ 68.1s / 💰 $0.21 🟢 100% (1/1) / ⏱️ 267.0s / 💰 $0.25 🟢 100% (1/1) / ⏱️ 84.0s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 119.2s / 💰 $0.40
42_dns_issues_steps_new_tools 🔗 🟢 100% (1/1) / ⏱️ 49.8s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 55.2s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 391.2s / 💰 $0.24 🟢 100% (1/1) / ⏱️ 94.6s / 💰 $0.27 ⏱️ 0% (0/1) / ⏱️ 694.5s
43_current_datetime_from_prompt 🔗 🟢 100% (1/1) / ⏱️ 17.2s / 💰 $0.03 🟢 100% (1/1) / ⏱️ 38.3s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 65.1s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 18.8s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 17.7s / 💰 $0.06
45_fetch_deployment_logs_simple 🔗 🟢 100% (1/1) / ⏱️ 31.7s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 51.7s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 103.1s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 37.6s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 43.1s / 💰 $0.11
50_logs_since_specific_date 🔗 🟢 100% (1/1) / ⏱️ 13.9s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 18.5s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 144.3s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 32.2s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 25.1s / 💰 $0.10
50a_logs_since_last_specific_month 🔗 🟢 100% (1/1) / ⏱️ 28.6s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 34.0s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 113.9s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 50.0s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 33.8s / 💰 $0.08
51_logs_summarize_errors 🔗 🟢 100% (1/1) / ⏱️ 31.8s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 31.3s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 105.5s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 40.7s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 44.0s / 💰 $0.10
52_logs_login_issues 🔗 🔴 0% (0/1) / ⏱️ 39.7s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 104.7s / 💰 $0.44 🔴 0% (0/1) / ⏱️ 47.5s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 65.8s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 59.9s / 💰 $0.11
53_logs_find_term 🔗 🟢 100% (1/1) / ⏱️ 32.3s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 53.1s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 73.0s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 41.2s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 42.5s / 💰 $0.14
54_not_truncated_when_getting_pods 🔗 🟢 100% (1/1) / ⏱️ 33.7s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 39.0s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 237.9s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 55.1s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 66.3s / 💰 $0.13
57_wrong_namespace 🔗 🔴 0% (0/1) / ⏱️ 30.9s / 💰 $0.10 🔴 0% (0/1) / ⏱️ 43.3s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 129.6s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 45.7s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 43.5s / 💰 $0.10
59_label_based_counting 🔗 🟢 100% (1/1) / ⏱️ 27.3s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 31.2s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 98.3s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 27.3s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 26.9s / 💰 $0.08
60_count_less_than 🔗 🟢 100% (1/1) / ⏱️ 35.3s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 28.5s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 167.5s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 35.0s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 38.2s / 💰 $0.09
61_exact_match_counting 🔗 🟢 100% (1/1) / ⏱️ 40.3s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 31.9s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 65.3s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 26.6s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 26.3s / 💰 $0.08
62_fetch_error_logs_with_errors 🔗 🟢 100% (1/1) / ⏱️ 31.7s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 42.8s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 121.2s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 38.1s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 39.2s / 💰 $0.09
63_fetch_error_logs_no_errors 🔗 🟢 100% (1/1) / ⏱️ 30.6s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 33.6s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 79.7s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 35.8s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 35.4s / 💰 $0.09
64_keda_vs_hpa_confusion 🔗 🟢 100% (1/1) / ⏱️ 63.2s / 💰 $0.22 🔴 0% (0/1) / ⏱️ 54.1s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 184.6s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 117.7s / 💰 $0.19 🟢 100% (1/1) / ⏱️ 66.7s / 💰 $0.17
65_health_check_followup 🔗 🟢 100% (1/1) / ⏱️ 44.5s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 49.3s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 263.4s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 69.0s / 💰 $0.21 🟢 100% (1/1) / ⏱️ 70.5s / 💰 $0.26
71_connection_pool_starvation 🔗 🟢 100% (1/1) / ⏱️ 38.7s / 💰 $0.13 🔴 0% (0/1) / ⏱️ 58.8s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 161.8s / 💰 $0.19 🟢 100% (1/1) / ⏱️ 57.2s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 56.8s / 💰 $0.17
73a_time_window_anomaly 🔗 🟢 100% (1/1) / ⏱️ 42.0s / 💰 $0.17 🔴 0% (0/1) / ⏱️ 34.0s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 157.5s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 63.5s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 64.9s / 💰 $0.18
73b_time_window_anomaly 🔗 🟢 100% (1/1) / ⏱️ 44.2s / 💰 $0.17 🔴 0% (0/1) / ⏱️ 29.7s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 91.4s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 57.3s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 62.6s / 💰 $0.14
76_service_discovery_issue 🔗 🟢 100% (1/1) / ⏱️ 40.5s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 60.8s / 💰 $0.21 🟢 100% (1/1) / ⏱️ 190.0s / 💰 $0.21 🟢 100% (1/1) / ⏱️ 654.9s / 💰 $0.14 ⏱️ 0% (0/1) / ⏱️ 648.9s
77_liveness_probe_misconfiguration 🔗 🟢 100% (1/1) / ⏱️ 40.1s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 41.7s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 185.6s / 💰 $0.19 🟢 100% (1/1) / ⏱️ 48.8s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 53.5s / 💰 $0.13
78a_missing_cpu_limits 🔗 🔴 0% (0/1) / ⏱️ 25.9s / 💰 $0.07 🔴 0% (0/1) / ⏱️ 30.9s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 217.1s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 54.7s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 58.8s / 💰 $0.14
78b_cpu_quota_exceeded 🔗 🔴 0% (0/1) / ⏱️ 51.1s / 💰 $0.24 🔴 0% (0/1) / ⏱️ 44.6s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 81.1s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 53.5s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 52.6s / 💰 $0.14
79_configmap_mount_issue 🔗 🟢 100% (1/1) / ⏱️ 31.2s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 43.4s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 193.0s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 46.3s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 63.1s / 💰 $0.12
80_pvc_storage_class_mismatch 🔗 🔴 0% (0/1) / ⏱️ 32.4s / 💰 $0.11 🔴 0% (0/1) / ⏱️ 49.5s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 95.6s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 81.8s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 57.5s / 💰 $0.15
81_service_account_permission_denied 🔗 🟢 100% (1/1) / ⏱️ 38.1s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 58.2s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 260.8s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 99.1s / 💰 $0.23 🟢 100% (1/1) / ⏱️ 71.0s / 💰 $0.17
82_pod_anti_affinity_conflict 🔗 🔴 0% (0/1) / ⏱️ 37.4s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 45.9s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 201.0s / 💰 $0.23 🟢 100% (1/1) / ⏱️ 61.9s / 💰 $0.16 🟢 100% (1/1) / ⏱️ 63.0s / 💰 $0.17
83_secret_not_found 🔗 🟢 100% (1/1) / ⏱️ 36.1s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 50.2s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 125.2s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 44.2s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 49.3s / 💰 $0.13
84_network_policy_blocking_traffic 🔗 🟢 100% (1/1) / ⏱️ 38.2s / 💰 $0.17 🔴 0% (0/1) / ⏱️ 79.0s / 💰 $0.18 🟢 100% (1/1) / ⏱️ 238.0s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 99.6s / 💰 $0.21 🟢 100% (1/1) / ⏱️ 59.7s / 💰 $0.15
85_hpa_not_scaling 🔗 🔴 0% (0/1) / ⏱️ 34.6s / 💰 $0.11 🟢 100% (1/1) / ⏱️ 42.1s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 327.4s / 💰 $0.25 🟢 100% (1/1) / ⏱️ 58.7s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 56.9s / 💰 $0.17
86_configmap_like_but_secret 🔗 🟢 100% (1/1) / ⏱️ 48.2s / 💰 $0.22 🟢 100% (1/1) / ⏱️ 45.2s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 334.3s / 💰 $0.22 🟢 100% (1/1) / ⏱️ 50.8s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 61.6s / 💰 $0.16
89_runbook_missing_cloudwatch 🔗 🟢 100% (1/1) / ⏱️ 30.0s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 27.4s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 179.6s / 💰 $0.13 🟢 100% (1/1) / ⏱️ 40.7s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 46.0s / 💰 $0.10
90_runbook_basic_selection 🔗 🟢 100% (1/1) / ⏱️ 52.0s / 💰 $0.26 🟢 100% (1/1) / ⏱️ 72.9s / 💰 $0.09 🟢 100% (1/1) / ⏱️ 275.4s / 💰 $0.20 🟢 100% (1/1) / ⏱️ 202.4s / 💰 $0.51 🟢 100% (1/1) / ⏱️ 127.9s / 💰 $0.32
91f_datadog_logs_historical_pod 🔗 🔴 0% (0/1) / ⏱️ 29.0s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 40.1s / 💰 $0.09 🔴 0% (0/1) / ⏱️ 159.1s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 80.2s / 💰 $0.22 🟢 100% (1/1) / ⏱️ 67.5s / 💰 $0.15
93_calling_datadog[0] 🔗 🟢 100% (1/1) / ⏱️ 61.7s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 11.7s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 41.3s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 13.4s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 10.3s / 💰 $0.15
93_calling_datadog[1] 🔗 🟢 100% (1/1) / ⏱️ 56.5s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 12.2s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 89.5s / 💰 $0.08 🟢 100% (1/1) / ⏱️ 11.8s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 10.4s / 💰 $0.15
93_calling_datadog[2] 🔗 🟢 100% (1/1) / ⏱️ 70.1s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 13.2s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 42.1s / 💰 $0.06 🟢 100% (1/1) / ⏱️ 11.6s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 11.1s / 💰 $0.15
94_runbook_transparency 🔗 🟢 100% (1/1) / ⏱️ 45.4s / 💰 $0.17 🟢 100% (1/1) / ⏱️ 63.4s / 💰 $0.15 🟢 100% (1/1) / ⏱️ 368.3s / 💰 $0.28 🟢 100% (1/1) / ⏱️ 119.5s / 💰 $0.29 🟢 100% (1/1) / ⏱️ 79.7s / 💰 $0.20
96_no_matching_runbook 🔗 🔴 0% (0/1) / ⏱️ 54.3s / 💰 $0.32 🔴 0% (0/1) / ⏱️ 158.5s / 💰 $0.46 🔴 0% (0/1) / ⏱️ 326.2s / 💰 $0.28 🟢 100% (1/1) / ⏱️ 106.8s / 💰 $0.38 🟢 100% (1/1) / ⏱️ 122.3s / 💰 $0.35
97_logs_clarification_needed 🔗 🟢 100% (1/1) / ⏱️ 15.5s / 💰 $0.03 🟢 100% (1/1) / ⏱️ 21.5s / 💰 $0.04 🟢 100% (1/1) / ⏱️ 27.1s / 💰 $0.02 🟢 100% (1/1) / ⏱️ 50.5s / 💰 $0.14 🟢 100% (1/1) / ⏱️ 19.7s / 💰 $0.06
99_logs_transparency_custom_time 🔗 🟢 100% (1/1) / ⏱️ 32.8s / 💰 $0.12 🟢 100% (1/1) / ⏱️ 39.6s / 💰 $0.07 🟢 100% (1/1) / ⏱️ 88.3s / 💰 $0.05 🟢 100% (1/1) / ⏱️ 51.1s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 56.5s / 💰 $0.10
93_events_since_specific_date 🔗 🟢 100% (1/1) / ⏱️ 9.4s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 16.3s / 💰 $0.07 ⚪️ - 🟢 100% (1/1) / ⏱️ 17.8s / 💰 $0.10 🟢 100% (1/1) / ⏱️ 14.6s / 💰 $0.09
44_slack_statefulset_logs 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
48_logs_since_thursday 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
22_high_latency_dbi_down 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
08_sock_shop_frontend 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
104b_postgres_missing_index_pgstat 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
104c_postgres_minimal_missing_index 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
105_redis_wrong_data_structure 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
156_kafka_opensearch_latency 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
43_slack_deployment_logs 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
55_kafka_runbook 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -
98_logs_transparency_default_time 🔗 ⚪️ - ⚪️ - ⚪️ - ⚪️ - ⚪️ -

Results are automatically generated and updated weekly. View full traces and detailed analysis in Braintrust experiment: local-benchmark-20250930-072258.