Monitor Amazon Web Services (AWS) with Beats

In this tutorial, you’ll learn how to monitor your AWS infrastructure using Elastic Observability: Logs and Infrastructure metrics.

What you’ll learn

You’ll learn how to:

Create and configure an S3 bucket
Create and configure an SQS queue.
Install and configure Filebeat and Metricbeat to collect Logs and Infrastructure metrics
Collect logs from S3
Collect metrics from Amazon CloudWatch

Create an Elastic Cloud Hosted deployment or Elastic Observability Serverless project. Both include an Elasticsearch cluster for storing and searching your data and Kibana for visualizing and managing your data.

With this tutorial, we assume that your logs and your infrastructure data are already shipped to CloudWatch. We are going to show you how you can stream your data from CloudWatch to Elasticsearch. If you don’t know how to put your AWS logs and infrastructure data in CloudWatch, Amazon provides a lot of documentation around this specific topic:

Collect your logs and infrastructure data from specific AWS services
Export your logs to an S3 bucket

Step 1: Create an S3 Bucket

To centralize your logs in Elasticsearch, you need to have an S3 bucket. Filebeat, the agent you’ll use to collect logs, has an input for S3.

In the AWS S3 console, click on Create bucket. Give the bucket a name and specify the region in which you want it deployed.

Step 2: Create an SQS Queue

You should now have an S3 bucket in which you can export your logs, but you will also need an SQS queue. To avoid significant lagging with polling all log files from each S3 bucket, we will use Amazon Simple Queue Service (SQS). This will provide us with an Amazon S3 notification when a new S3 object is created. The Filebeat S3 input checks SQS for new messages regarding the new object created in S3 and uses the information in these messages to retrieve logs from S3 buckets. With this setup, periodic polling from each S3 bucket is not needed. Instead, the Filebeat S3 input guarantees near real-time data collection from S3 buckets with both speed and reliability.

Create an SQS queue and configure our S3 bucket to send a message to the SQS queue whenever new logs are present in the S3 bucket. Go to the SQS console

Note

Make sure that the queue is created in the same region as the S3 bucket.

Create a standard SQS queue and edit the access policy by using a JSON object to define an advanced access policy:

Note

Replace <sqs-arn> with the ARN of the SQS queue, <s3-bucket-arn> with the ARN of the S3 bucket you just created, the <source-account> with your source account.

						
					{
  "Version": "2012-10-17",
  "Id": "example-ID",
  "Statement": [
    {
      "Sid": "example-statement-ID",
      "Effect": "Allow",
      "Principal": {
        "AWS": "*"
      },
      "Action": "SQS:SendMessage",
      "Resource": "<sqs-arn>",
      "Condition": {
        "StringEquals": {
          "aws:SourceAccount": "<source-account>"
        },
        "ArnLike": {
          "aws:SourceArn": "<s3-bucket-arn>"
        }
      }
    }
  ]
}
		
	

Step 3: Event Notification

Now that your queue is created, go to the properties of the S3 bucket you created and click Create event notification.

Specify that you want to send a notification on every object creation event.

Set the destination as the SQS queue you just created.

Step 4: Install and configure Filebeat

To monitor AWS using the Elastic Stack, you need two main components: an Elastic deployment to store and analyze the data and an agent to collect and ship the data.

Install Filebeat

Download and install Filebeat.

DEB

		curl -L -O https://artifacts.elastic.co/downloads/beats/filebeat/filebeat-9.4.4-amd64.deb
sudo dpkg -i filebeat-9.4.4-amd64.deb

RPM

		curl -L -O https://artifacts.elastic.co/downloads/beats/filebeat/filebeat-9.4.4-x86_64.rpm
sudo rpm -vi filebeat-9.4.4-x86_64.rpm

MacOS

		curl -L -O https://artifacts.elastic.co/downloads/beats/filebeat/filebeat-9.4.4-darwin-x86_64.tar.gz
tar xzvf filebeat-9.4.4-darwin-x86_64.tar.gz

Linux

		curl -L -O https://artifacts.elastic.co/downloads/beats/filebeat/filebeat-9.4.4-linux-x86_64.tar.gz
tar xzvf filebeat-9.4.4-linux-x86_64.tar.gz

Windows

Download the Filebeat Windows zip file.
Extract the contents of the zip file into C:\Program Files.
Rename the filebeat-[version]-windows-x86_64 directory to Filebeat.
Open a PowerShell prompt as an Administrator (right-click the PowerShell icon and select Run As Administrator).
From the PowerShell prompt, run the following commands to install Filebeat as a Windows service:

		PS > cd 'C:\Program Files\Filebeat'
PS C:\Program Files\Filebeat> .\install-service-filebeat.ps1

Note

If script execution is disabled on your system, you need to set the execution policy for the current session to allow the script to run. For example: PowerShell.exe -ExecutionPolicy UnRestricted -File .\install-service-filebeat.ps1.

Set up assets

Filebeat comes with predefined assets for parsing, indexing, and visualizing your data. Run the following command to load these assets. It can take a few minutes.

		./filebeat setup -e -E 'cloud.id=YOUR_DEPLOYMENT_CLOUD_ID' -E 'cloud.auth=elastic:YOUR_SUPER_SECRET_PASS'
		
	

Substitute your Cloud ID and an administrator’s username:password in this command. To find your Cloud ID, click on your deployment.

Filebeat comes with predefined assets for parsing, indexing, and visualizing your data. Run the following command to load these assets. It can take a few minutes.

		./filebeat setup -e -E 'output.elasticsearch.hosts=["https://hostname:port"]' -E 'output.elasticsearch.api_key=YOUR_API_KEY'
		
	

Substitute your Elasticsearch endpoint and an API key in this command. To find your endpoint URL, select Manage next to your project, then find the Elasticsearch endpoint under Application endpoints, cluster and component IDs. Alternatively, open your project, select the help icon, then select Connection details.

Important

Setting up Filebeat is an admin-level task that requires extra privileges. As a best practice, use an administrator role to set up and a more restrictive role for event publishing (which you will do next).

Configure Filebeat output

Next, you are going to configure Filebeat output to Elasticsearch.

Use the Filebeat keystore to store secure settings. Store your connection details in the keystore.

		./filebeat keystore create
echo -n "<Your Deployment Cloud ID>" | ./filebeat keystore add CLOUD_ID --stdin

		./filebeat keystore create
echo -n "<Your Elasticsearch endpoint URL>" | ./filebeat keystore add ES_HOST --stdin

To store logs in Elasticsearch with minimal permissions, create an API key to send data from Filebeat to Elasticsearch. Log into Kibana (you can do so from the Cloud Console without typing in any permissions) and find Dev Tools in the global search field. Send the following request:

						POST /_security/api_key
					{
  "name": "filebeat-monitor-gcp",
  "role_descriptors": {
    "filebeat_writer": {
      "cluster": [
        "monitor",
        "read_ilm",
        "cluster:admin/ingest/pipeline/get",
        "cluster:admin/ingest/pipeline/put"
      ],
      "index": [
        {
          "names": ["filebeat-*"],
          "privileges": ["view_index_metadata", "create_doc"]
        }
      ]
    }
  }
}
		
	

Filebeat needs extra cluster permissions to publish logs, which differs from the Metricbeat configuration. You can find more details here.

The response contains an api_key and an id field, which can be stored in the Filebeat keystore in the following format: id:api_key.
```
echo -n "IhrJJHMB4JmIUAPLuM35:1GbfxhkMT8COBB4JWY3pvQ" | ./filebeat keystore add ES_API_KEY --stdin
		
```
Note

Make sure you specify the -n parameter; otherwise, you will have painful debugging sessions due to adding a newline at the end of your API key.
To see if both settings have been stored, run the following command:
```
./filebeat keystore list
		
```

To configure Filebeat to output to Elasticsearch, edit the filebeat.yml configuration file. Add the following lines to the end of the file.

		cloud.id: ${CLOUD_ID}
output.elasticsearch:
  api_key: ${ES_API_KEY}
		
	

		output.elasticsearch:
  hosts: ["${ES_HOST}"]
  api_key: ${ES_API_KEY}
		
	

Finally, test if the configuration is working. If it is not working, verify that you used the right credentials and, if necessary, add them again.
```
./filebeat test output
		
```

Step 5: Configure the AWS Module

Now that the output is working, you can set up the Filebeat AWS module which will automatically create the AWS input. This module checks SQS for new messages regarding the new object created in the S3 bucket and uses the information in these messages to retrieve logs from S3 buckets. With this setup, periodic polling from each S3 bucket is not needed.

There are many different filesets available: cloudtrail, vpcflow, ec2, cloudwatch, elb and s3access. In this tutorial, we are going to show you a few examples using the ec2 and the s3access filesets.

The ec2 fileset is used to ship and process logs stored in CloudWatch, and export them to an S3 bucket. The s3access fileset is used when S3 access logs need to be collected. It provides detailed records for the requests that are made to a bucket. Server access logs are useful for many applications. For example, access log information can be useful in security and access audits. It can also help you learn about your customer base and understand your Amazon S3 bill.

Let’s enable the AWS module in Filebeat.

./filebeat modules enable aws

Edit the modules.d/aws.yml file with the following configurations.

		- module: aws
  cloudtrail:
    enabled: false
  cloudwatch:
    enabled: false
  ec2:
    enabled: true
    var.credential_profile_name: fb-aws
    var.queue_url: https://sqs.eu-central-1.amazonaws.com/836370109380/howtoguide-tutorial
  elb:
    enabled: false
  s3access:
    enabled: false
  vpcflow:
    enabled: false
		
	

Enables the ec2 fileset.
This is the AWS profile defined following the AWS standard.
Add the URL to the queue containing notifications around the bucket containing the EC2 logs

Make sure that the AWS user used to collect the logs from S3 has at least the following permissions attached to it:

		{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Action": [
              "s3:GetObject",
              "sqs:ReceiveMessage",
              "sqs:ChangeMessageVisibility",
              "sqs:DeleteMessage"
            ],
            "Effect": "Allow",
            "Resource": "*"
        }
    ]
}
		
	

You can now upload your logs to the S3 bucket. If you are using CloudWatch, make sure to edit the policy of your bucket as shown in step 3 of the AWS user guide. This will help you avoid permissions issues.

Start Filebeat to collect the logs.

./filebeat -e

Here’s what we’ve achieved so far:

Now, let’s configure the s3access fileset. The goal here is to be able to monitor how people access the bucket we created. To do this, we’ll create another bucket and another queue. The new architecture will look like this:

Architecture with Access Logging Enabled

Create a new S3 bucket and SQS queue. Ensure that the event notifications on the new bucket are enabled, and that it’s sending notifications to the new queue.

Now go back to the first bucket, and go to Properties > Server access logging. Specify that you want to ship the access logs to the bucket you most recently created.

Copy the URL of the queue you created. Edit the modules.d/aws.ymlfile with the following configurations.

		- module: aws
  cloudtrail:
    enabled: false
  cloudwatch:
    enabled: false
  ec2:
    enabled: true
    var.credential_profile_name: fb-aws
    var.queue_url: https://sqs.eu-central-1.amazonaws.com/836370109380/howtoguide-tutorial
  elb:
    enabled: false
  s3access:
    enabled: true
    var.credential_profile_name: fb-aws
    var.queue_url: https://sqs.eu-central-1.amazonaws.com/836370109380/access-log
  vpcflow:
    enabled: false
		
	

Enables the ec2 fileset.
This is the AWS profile defined following the AWS standard.
Add the URL to the queue containing notifications around the bucket containing the EC2 logs
Add the URL to the queue containing notifications around the bucket containing the S3 access logs

Once you have edited the config file, you need to restart Filebeat. To stop Filebeat, you can press CTRL + C in the terminal. Now let’s restart Filebeat by running the following command:

./filebeat -e

Step 6: Visualize Logs

Now that the logs are being shipped to Elasticsearch we can visualize them in Kibana. To see the raw logs, find Discover in the main menu or use the global search field.

The filesets we used in the previous steps also come with pre-built dashboards that you can use to visualize the data. In Kibana, find Dashboards in the main menu or use the global search field. Search for S3 and select the dashboard called: [Filebeat AWS] S3 Server Access Log Overview:

This gives you an overview of how your S3 buckets are being accessed.

Step 7: Collect Infrastructure metrics

To monitor your AWS infrastructure you will need to first make sure your infrastructure data are being shipped to CloudWatch. To ship the data to Elasticsearch we are going to use the AWS module from Metricbeat. This module periodically fetches monitoring metrics from AWS CloudWatch using GetMetricData API for AWS services.

Important

Extra AWS charges on CloudWatch API requests will be generated by this module. See AWS API requests for more details.

Step 8: Install and configure Metricbeat

In a new terminal window, run the following commands.

Install Metricbeat

Download and install Metricbeat.

DEB

		curl -L -O https://artifacts.elastic.co/downloads/beats/metricbeat/metricbeat-9.4.4-amd64.deb
sudo dpkg -i metricbeat-9.4.4-amd64.deb

RPM

		curl -L -O https://artifacts.elastic.co/downloads/beats/metricbeat/metricbeat-9.4.4-x86_64.rpm
sudo rpm -vi metricbeat-9.4.4-x86_64.rpm

MacOS

		curl -L -O https://artifacts.elastic.co/downloads/beats/metricbeat/metricbeat-9.4.4-darwin-x86_64.tar.gz
tar xzvf metricbeat-9.4.4-darwin-x86_64.tar.gz

Linux

		curl -L -O https://artifacts.elastic.co/downloads/beats/metricbeat/metricbeat-9.4.4-linux-x86_64.tar.gz
tar xzvf metricbeat-9.4.4-linux-x86_64.tar.gz

Windows

Download the Metricbeat Windows zip file.
Extract the contents of the zip file into C:\Program Files.
Rename the metricbeat-[version]-windows-x86_64 directory to Metricbeat.
Open a PowerShell prompt as an Administrator (right-click the PowerShell icon and select Run As Administrator).
From the PowerShell prompt, run the following commands to install Metricbeat as a Windows service:

		PS > cd 'C:\Program Files\Metricbeat'
PS C:\Program Files\Metricbeat> .\install-service-metricbeat.ps1

Note

Set up assets

Metricbeat comes with predefined assets for parsing, indexing, and visualizing your data. Run the following command to load these assets. It can take a few minutes.

		./metricbeat setup -e -E 'cloud.id=YOUR_DEPLOYMENT_CLOUD_ID' -E 'cloud.auth=elastic:YOUR_SUPER_SECRET_PASS'
		
	

Substitute your Cloud ID and an administrator’s username:password in this command. To find your Cloud ID, click on your deployment.

Metricbeat comes with predefined assets for parsing, indexing, and visualizing your data. Run the following command to load these assets. It can take a few minutes.

		./metricbeat setup -e -E 'output.elasticsearch.hosts=["https://hostname:port"]' -E 'output.elasticsearch.api_key=YOUR_API_KEY'
		
	

Substitute your Elasticsearch endpoint and an API key in this command. To find your endpoint URL, select Manage next to your project, then find the Elasticsearch endpoint under Application endpoints, cluster and component IDs. Alternatively, open your project, select the help icon, then select Connection details.

Important

Setting up Metricbeat is an admin-level task that requires extra privileges. As a best practice, use an administrator role to set up, and a more restrictive role for event publishing (which you will do next).

Configure Metricbeat output

Next, you are going to configure Metricbeat output to Elasticsearch.

Use the Metricbeat keystore to store secure settings. Store your connection details in the keystore.

		./metricbeat keystore create
echo -n "<Your Deployment Cloud ID>" | ./metricbeat keystore add CLOUD_ID --stdin

		./metricbeat keystore create
echo -n "<Your Elasticsearch endpoint URL>" | ./metricbeat keystore add ES_HOST --stdin

To store metrics in Elasticsearch with minimal permissions, create an API key to send data from Metricbeat to Elasticsearch. Log into Kibana (you can do so from the Cloud Console without typing in any permissions) and find Dev Tools in the global search field. From the Console, send the following request:

						POST /_security/api_key
					{
  "name": "metricbeat-monitor",
  "role_descriptors": {
    "metricbeat_writer": {
      "cluster": ["monitor", "read_ilm"],
      "index": [
        {
          "names": ["metricbeat-*"],
          "privileges": ["view_index_metadata", "create_doc"]
        }
      ]
    }
  }
}
		
	

The response contains an api_key and an id field, which can be stored in the Metricbeat keystore in the following format: id:api_key.
```
echo -n "IhrJJHMB4JmIUAPLuM35:1GbfxhkMT8COBB4JWY3pvQ" | ./metricbeat keystore add ES_API_KEY --stdin
		
```
Note

Make sure you specify the -n parameter; otherwise, you will have painful debugging sessions due to adding a newline at the end of your API key.
To see if both settings have been stored, run the following command:
```
./metricbeat keystore list
		
```

To configure Metricbeat to output to Elasticsearch, edit the metricbeat.yml configuration file. Add the following lines to the end of the file.

		cloud.id: ${CLOUD_ID}
output.elasticsearch:
  api_key: ${ES_API_KEY}
		
	

		output.elasticsearch:
  hosts: ["${ES_HOST}"]
  api_key: ${ES_API_KEY}
		
	

Finally, test if the configuration is working. If it is not working, verify if you used the right credentials and add them again.
```
./metricbeat test output
		
```

Now that the output is working, you are going to set up the AWS module.

Step 9: Configure the AWS module

To collect metrics from your AWS infrastructure, we’ll use the Metricbeat AWS module. This module contains many metricsets: billing, cloudwatch, dynamodb, ebs, ec2, elb, lambda, and many more. Each metricset is created to help you stream and process your data. In this tutorial, we’re going to show you a few examples using the ec2 and the billing metricsets.

Let’s enable the AWS module in Metricbeat.
```
./metricbeat modules enable aws
		
```

Edit the modules.d/aws.yml file with the following configurations.

		- module: aws
  period: 24h
  metricsets:
    - billing
  credential_profile_name: mb-aws
  cost_explorer_config:
    group_by_dimension_keys:
      - "AZ"
      - "INSTANCE_TYPE"
      - "SERVICE"
    group_by_tag_keys:
      - "aws:createdBy"
- module: aws
  period: 300s
  metricsets:
    - ec2
  credential_profile_name: mb-aws
		
	

Defines the module that is going to be used.
Defines the period at which the metrics are going to be collected
Defines the metricset that is going to be used.
This is the AWS profile defined following the AWS standard.

Make sure that the AWS user used to collect the metrics from CloudWatch has at least the following permissions attached to it:

		{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Action": [
                "ec2:DescribeInstances",
                "ec2:DescribeRegions",
                "cloudwatch:GetMetricData",
                "cloudwatch:ListMetrics",
                "sts:GetCallerIdentity",
                "iam:ListAccountAliases",
                "tag:getResources",
                "ce:GetCostAndUsage"
            ],
            "Effect": "Allow",
            "Resource": "*"
        }
    ]
}
		
	

You can now start Metricbeat:

./metricbeat -e

Step 10: Visualize metrics

Now that the metrics are being streamed to Elasticsearch we can visualize them in Kibana. To open Infrastructure inventory, find Infrastructure in the main menu or use the global search field. Make sure to show the AWS source and the EC2 Instances:

The metricsets we used in the previous steps also comes with pre-built dashboard that you can use to visualize the data. In Kibana, find Dashboards in the main menu or use the global search field. Search for EC2 and select the dashboard called: [Metricbeat AWS] EC2 Overview:

If you want to track your billings on AWS, you can also check the [Metricbeat AWS] Billing Overview dashboard: