AWS (MCP)¶

The AWS MCP server provides comprehensive access to AWS services through a secure, read-only interface. It enables Holmes to investigate AWS infrastructure issues, analyze CloudTrail events, examine security configurations, and troubleshoot service-specific problems, answer cost related questions, analyze ELB issues and much more.

Overview¶

The AWS MCP server is deployed as a separate pod in your cluster when using the Holmes or Robusta Helm charts. For CLI users, you'll need to deploy the MCP server manually and configure Holmes to connect to it.

Configuration¶

Holmes CLIHolmes Helm ChartRobusta Helm Chart

For CLI usage, you need to deploy the AWS MCP server first, then configure Holmes to connect to it. Below is an example, on how to deploy it in your cluster.

Step 1: Deploy the AWS MCP Server¶

Create a file named aws-mcp-deployment.yaml:

apiVersion: v1
kind: Namespace
metadata:
  name: holmes-mcp
---
apiVersion: v1
kind: ServiceAccount
metadata:
  name: aws-mcp-sa
  namespace: holmes-mcp
  annotations:
    # For EKS IRSA, add your role ARN:
    # eks.amazonaws.com/role-arn: arn:aws:iam::ACCOUNT_ID:role/YOUR_ROLE_NAME
---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: aws-mcp-server
  namespace: holmes-mcp
spec:
  replicas: 1
  selector:
    matchLabels:
      app: aws-mcp-server
  template:
    metadata:
      labels:
        app: aws-mcp-server
    spec:
      serviceAccountName: aws-mcp-sa
      containers:
      - name: aws-mcp
        image: us-central1-docker.pkg.dev/genuine-flight-317411/devel/aws-api-mcp-server:1.0.1
        imagePullPolicy: Always
        ports:
        - containerPort: 8000
          name: http
        env:
        - name: AWS_REGION
          value: "us-east-1"  # Change to your region
        - name: AWS_DEFAULT_REGION
          value: "us-east-1"  # Change to your region
        - name: READ_OPERATIONS_ONLY
          value: "true"
        # If not using IRSA, provide credentials via environment variables:
        # - name: AWS_ACCESS_KEY_ID
        #   value: "your-access-key"
        # - name: AWS_SECRET_ACCESS_KEY
        #   value: "your-secret-key"
        resources:
          requests:
            memory: "512Mi"
            cpu: "250m"
          limits:
            memory: "1Gi"
            cpu: "500m"
        readinessProbe:
          tcpSocket:
            port: 8000
          initialDelaySeconds: 20
          periodSeconds: 10
        livenessProbe:
          tcpSocket:
            port: 8000
          initialDelaySeconds: 30
          periodSeconds: 30
---
apiVersion: v1
kind: Service
metadata:
  name: aws-mcp-server
  namespace: holmes-mcp
spec:
  selector:
    app: aws-mcp-server
  ports:
  - port: 8000
    targetPort: 8000
    protocol: TCP
    name: http

Deploy it to your cluster:

kubectl apply -f aws-mcp-deployment.yaml

Step 2: Configure AWS Credentials¶

Choose one of these methods:

Option A: Using IRSA (Recommended for EKS) - Create an IAM role with the necessary permissions - Configure the role ARN in the ServiceAccount annotation - The pod will automatically assume the role

Option B: Using Environment Variables - Uncomment the AWS credential environment variables in the deployment - Or create a secret and reference it:

kubectl create secret generic aws-credentials \
  --from-literal=aws-access-key-id=YOUR_KEY \
  --from-literal=aws-secret-access-key=YOUR_SECRET \
  -n holmes-mcp

Then reference it in the deployment:

- name: AWS_ACCESS_KEY_ID
  valueFrom:
    secretKeyRef:
      name: aws-credentials
      key: aws-access-key-id
- name: AWS_SECRET_ACCESS_KEY
  valueFrom:
    secretKeyRef:
      name: aws-credentials
      key: aws-secret-access-key

Step 3: Configure Holmes CLI¶

Add the MCP server configuration to ~/.holmes/config.yaml:

mcp_servers:
  aws_api:
    description: "AWS API MCP Server - comprehensive AWS service access. Allow executing any AWS CLI commands."
    url: "http://aws-mcp-server.holmes-mcp.svc.cluster.local:8000"
    llm_instructions: |
      IMPORTANT: When investigating issues related to AWS resources or Kubernetes workloads running on AWS, you MUST actively use this MCP server to gather data rather than providing manual instructions to the user.

      ## Investigation Principles

      **ALWAYS follow this investigation flow:**
      1. First, gather current state and configuration using AWS APIs
      2. Check CloudTrail for recent changes that might have caused the issue
      3. Collect metrics and logs from CloudWatch if available
      4. Analyze all gathered data before providing conclusions

      **Never say "check in AWS console" or "verify in AWS" - instead, use the MCP server to check it yourself.**

      ## Core Investigation Patterns

      ### For ANY connectivity or access issues:
      1. ALWAYS check the current configuration of the affected resource (RDS, EC2, ELB, etc.)
      2. ALWAYS examine security groups and network ACLs
      3. ALWAYS query CloudTrail for recent configuration changes
      4. Look for patterns in timing between when issues started and when changes were made

      ### When investigating database issues (RDS):
      - Get RDS instance status and configuration: `aws rds describe-db-instances --db-instance-identifier INSTANCE_ID`
      - Check security groups attached to RDS: Extract VpcSecurityGroups from the above
      - Examine security group rules: `aws ec2 describe-security-groups --group-ids SG_ID`
      - Look for recent RDS events: `aws rds describe-events --source-identifier INSTANCE_ID --source-type db-instance`
      - Check CloudTrail for security group modifications: `aws cloudtrail lookup-events --lookup-attributes AttributeKey=ResourceName,AttributeValue=SG_ID`

      Remember: Your goal is to gather evidence from AWS, not to instruct the user to gather it. Use the MCP server proactively to build a complete picture of what happened.

Step 4: Port Forwarding (Optional for Local Testing)¶

If running Holmes CLI locally and need to access the MCP server:

kubectl port-forward -n holmes-mcp svc/aws-mcp-server 8000:8000

Then update the URL in config.yaml to:

url: "http://localhost:8000"

The MCP server will use IRSA (IAM Roles for Service Accounts) for permissions - see the IAM Configuration section below for setup details.

Add the following configuration to your values.yaml file:

mcpAddons:
  aws:
    enabled: true

    # Service account for IRSA (IAM Roles for Service Accounts)
    serviceAccount:
      create: true
      annotations:
        # Add your EKS IRSA role ARN here
        eks.amazonaws.com/role-arn: "arn:aws:iam::ACCOUNT_ID:role/YOUR_ROLE_NAME"

    # AWS configuration
    config:
      region: "us-east-1"  # Your AWS region

For additional configuration options (resources, network policy, node selectors, etc.), see the full chart values.

Then deploy or upgrade your Holmes installation:

helm upgrade --install holmes robusta/holmes -f values.yaml

The MCP server will use IRSA (IAM Roles for Service Accounts) for permissions - see the IAM Configuration section below for setup details.

Add the following configuration to your generated_values.yaml:

globalConfig:
  # Your existing Robusta configuration


# Add the Holmes MCP addon configuration
holmes:
  mcpAddons:
    aws:
      enabled: true

      # Service account for IRSA (IAM Roles for Service Accounts)
      serviceAccount:
        create: true
        annotations:
          # Add your EKS IRSA role ARN here
          eks.amazonaws.com/role-arn: "arn:aws:iam::ACCOUNT_ID:role/YOUR_ROLE_NAME"

      # AWS configuration
      config:
        region: "us-east-1"  # Your AWS region

For additional configuration options (resources, network policy, node selectors, etc.), see the full chart values.

Then deploy or upgrade your Robusta installation:

helm upgrade --install robusta robusta/robusta -f generated_values.yaml --set clusterName=YOUR_CLUSTER_NAME

IAM Configuration¶

EKS with IRSA (Recommended)¶

For EKS clusters, use IAM Roles for Service Accounts (IRSA) for secure, fine-grained permissions:

Create an IAM policy with the required permissions (see example below)
Create an IAM role and attach the policy
Associate the role with the service account using the annotation in values.yaml

IAM Policy¶

The AWS MCP server requires comprehensive read-only permissions across AWS services. We provide a complete IAM policy that covers:

Core Observability: CloudWatch, Logs, Events
Compute & Networking: EC2, ELB, Auto Scaling, VPC
Containers: EKS, ECS, ECR
Security: IAM, CloudTrail, GuardDuty, Security Hub
Databases: RDS, ElastiCache, DocumentDB, Neptune
Cost Management: Cost Explorer, Budgets, Organizations
Storage: S3, EBS, EFS, Backup
Serverless: Lambda, Step Functions, API Gateway, SNS, SQS
And more...

You can find the complete IAM policy in:

GitHub: aws-mcp-iam-policy.json

Quick Setup¶

For EKS IRSA setup, we provide helper scripts to create the policy and IAM Role:

To create it manually:

# Download the policy
curl -O https://raw.githubusercontent.com/robusta-dev/holmes-mcp-integrations/master/servers/aws/aws-mcp-iam-policy.json

# Create IAM policy
aws iam create-policy \
  --policy-name HolmesMCPReadOnly \
  --policy-document file://aws-mcp-iam-policy.json

# Attach to your role (for IRSA)
aws iam attach-role-policy \
  --role-name YOUR_ROLE_NAME \
  --policy-arn arn:aws:iam::ACCOUNT_ID:policy/HolmesMCPReadOnly

Multi-Account Setup¶

For scenarios where you need to access multiple AWS accounts from your EKS clusters, the AWS MCP server supports multi-account access using cross-account IAM roles and EKS token projection.

When to Use Multi-Account Setup¶

You have multiple AWS accounts (dev, staging, prod, etc.)
You want pods in any cluster to access resources in target accounts
You need centralized IAM role management across accounts
You're using AWS Organizations or multi-account architectures

How It Works¶

When multi-account mode is enabled, the MCP server:

Uses EKS token projection instead of IRSA (IAM Roles for Service Accounts)
Mounts an accounts.yaml configuration file that defines target accounts and their IAM roles
Uses assume_role_with_web_identity to assume roles in target accounts
Allows the LLM to specify which account to use via the --profile flag

IAM Setup with setup-multi-account-iam.sh¶

For multi-account setup, we provide a helper script to automate the setup of cross-account OIDC providers and IAM roles:

Multi-Account Setup Script

What the Script Does¶

For each target account, the script:

Creates OIDC Providers: Sets up OIDC providers for each cluster in the target account
Creates IAM Role: Creates a role with trust policy allowing assume_role_with_web_identity from all configured clusters
Attaches Permissions: Applies the read-only permissions policy to the role

This enables pods running in any of your clusters to assume roles in target accounts and access AWS resources there.

Configuration File¶

Create a YAML config file. You can download an example configuration file:

Example Configuration File

Example configuration:

clusters:
  - name: prod-cluster
    region: us-east-1
    account_id: "111111111111"
    oidc_issuer_id: AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
    oidc_issuer_url: https://oidc.eks.us-east-1.amazonaws.com/id/AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

  - name: staging-cluster
    region: us-west-2
    account_id: "111111111111"
    oidc_issuer_id: BBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB
    oidc_issuer_url: https://oidc.eks.us-west-2.amazonaws.com/id/BBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB

kubernetes:
  namespace: default
  service_account: multi-account-mcp-sa

iam:
  role_name: EKSMultiAccountMCPRole
  policy_name: MCPReadOnlyPolicy
  session_duration: 3600

target_accounts:
  - profile: dev
    account_id: "111111111111"
    description: "Development account"

  - profile: prod
    account_id: "222222222222"
    description: "Production account"

Getting OIDC Issuer Information¶

For each cluster, you need the OIDC issuer ID and URL:

# Get OIDC issuer URL
aws eks describe-cluster --name <cluster-name> --query "cluster.identity.oidc.issuer" --output text

# Extract issuer ID from the URL
# URL format: https://oidc.eks.<region>.amazonaws.com/id/<ISSUER_ID>

Running the Setup¶

# Basic usage (uses default config: multi-cluster-config.yaml)
./setup-multi-account-iam.sh setup

# With custom config file
./setup-multi-account-iam.sh setup my-config.yaml

# With custom permissions file
./setup-multi-account-iam.sh setup my-config.yaml ./aws-mcp-iam-policy.json

# Verify the setup
./setup-multi-account-iam.sh verify my-config.yaml

# Teardown (removes all created resources)
./setup-multi-account-iam.sh teardown my-config.yaml

Download the Script¶

# Download the setup script
curl -O https://raw.githubusercontent.com/robusta-dev/holmes-mcp-integrations/master/servers/aws/setup-multi-account-iam.sh
chmod +x setup-multi-account-iam.sh

# Download example configuration file
curl -O https://raw.githubusercontent.com/robusta-dev/holmes-mcp-integrations/master/servers/aws/multi-cluster-config-example.yaml

Helm Chart Configuration¶

Once the IAM roles are set up, configure the Helm chart to enable multi-account mode:

Holmes Helm ChartRobusta Helm Chart

Add the following configuration to your values.yaml file:

mcpAddons:
  aws:
    enabled: true

    # AWS configuration
    config:
      region: "us-east-1"  # Your AWS region
      readOnlyMode: true

    # Multi-account configuration
    multiAccount:
      enabled: true
      profiles:
        dev:
          account_id: "111111111111"
          role_arn: "arn:aws:iam::111111111111:role/EKSMultiAccountMCPRole"
          region: "us-east-1"  # optional, defaults to the region specified in config
        prod:
          account_id: "222222222222"
          role_arn: "arn:aws:iam::222222222222:role/EKSMultiAccountMCPRole"
          region: "us-east-1"  # optional, defaults to the region specified in config
      llm_account_descriptions: |
        You must use the --profile flag to specify the account to use.
        Example: --profile dev - this is the development account and contains the development resources
        Example: --profile prod - this is the production account and contains the production resources

    # Note: When multiAccount.enabled is true, IRSA annotations are not used
    # The service account will use EKS token projection instead
    serviceAccount:
      create: true
      # annotations are ignored when multiAccount is enabled

Add the following configuration to your generated_values.yaml:

holmes:
  mcpAddons:
    aws:
      enabled: true

      # AWS configuration
      config:
        region: "us-east-1"  # Your AWS region
        readOnlyMode: true

      # Multi-account configuration
      multiAccount:
        enabled: true
        profiles:
          dev:
            account_id: "111111111111"
            role_arn: "arn:aws:iam::111111111111:role/EKSMultiAccountMCPRole"
            region: "us-east-1"  # optional, defaults to the region specified in config
          prod:
            account_id: "222222222222"
            role_arn: "arn:aws:iam::222222222222:role/EKSMultiAccountMCPRole"
            region: "us-east-1"  # optional, defaults to the region specified in config
        llm_account_descriptions: |
          You must use the --profile flag to specify the account to use.
          Example: --profile dev - this is the development account and contains the development resources
          Example: --profile prod - this is the production account and contains the production resources

      # Note: When multiAccount.enabled is true, IRSA annotations are not used
      # The service account will use EKS token projection instead
      serviceAccount:
        create: true
        # annotations are ignored when multiAccount is enabled

Important Notes¶

When multiAccount.enabled is true, the Helm chart automatically:
Creates an accounts.yaml ConfigMap with your account profiles
Mounts the accounts.yaml file at /etc/aws/accounts.yaml in the pod
Mounts the EKS service account token at /var/run/secrets/eks.amazonaws.com/serviceaccount
Skips IRSA annotations on the service account (since multi-account uses token projection)
The llm_account_descriptions field is automatically appended to the LLM instructions to help guide the AI on how to use the different accounts
Each profile can optionally specify a region that overrides the default region from config.region
The MCP server will use the EKS token to assume roles in target accounts using assume_role_with_web_identity

Capabilities¶

The AWS MCP server provides access to all AWS services through the AWS CLI. Common investigation patterns include:

CloudTrail Investigation¶

Query recent API calls and configuration changes
Find who made specific changes
Correlate changes with issue timelines
Audit security events

EC2 and Networking¶

Describe instances, security groups, VPCs
Check network ACLs and route tables
Investigate connectivity issues
Review instance metadata and status

RDS Database Issues¶

Check database instance status and configuration
Review security groups and network access
Analyze performance metrics
Look up recent events and modifications

EKS/Container Issues¶

Describe cluster configuration
Check node group status
Query CloudWatch Container Insights
Review pod logs and metrics

Load Balancers¶

Check target health
Review listener configurations
Investigate traffic patterns
Analyze access logs

Cost and Usage¶

Query cost and usage reports
Analyze spending trends
Identify expensive resources

Example Usage¶

Database Connection Issues¶

"My application can't connect to RDS after 3 PM yesterday"

Cost Spike Investigation¶

"Our AWS costs increased 40% last week"

Check IAM Policy for a k8s workload¶

""What IAM policy is the aws mcp using? What capabilities does it have?"