Note: This page describes the ECS Fargate integration. For EKS Fargate, see the documentation for Datadog's EKS Fargate integration.
Get metrics from all your containers running in ECS Fargate:
- CPU/Memory usage & limit metrics
- Monitor your applications running on Fargate using Datadog integrations or custom metrics.
The Datadog Agent retrieves metrics for the task definition's containers with the ECS task metadata endpoint. According to the ECS Documentation on that endpoint:
This endpoint returns Docker stats JSON for all of the containers associated with the task. For more information about each of the returned stats, see ContainerStats in the Docker API documentation.
The Task Metadata endpoint is only available from within the task definition itself, which is why the Datadog Agent needs to be run as an additional container within the task definition.
The only configuration required to enable this metrics collection is to set an environment variable ECS_FARGATE
to "true"
in the task definition.
The following steps cover setup of the Datadog Container Agent within AWS ECS Fargate. Note: Datadog Agent version 6.1.1 or higher is needed to take full advantage of the Fargate integration.
Tasks that do not have the Datadog Agent still report metrics with Cloudwatch, however the Agent is needed for Autodiscovery, detailed container metrics, tracing, and more. Additionally, Cloudwatch metrics are less granular, and have more latency in reporting than metrics shipped directly through the Datadog Agent.
To monitor your ECS Fargate tasks with Datadog, run the Agent as a container in same task definition as your application. To collect metrics with Datadog, each task definition should include a Datadog Agent container in addition to the application containers. Follow these setup steps:
- Create an ECS Fargate task
- Create or Modify your IAM Policy
- Run the task as a replica service
The primary unit of work in Fargate is the task, which is configured in the task definition. A task definition is comparable to a pod in Kubernetes. A task definition must contain one or more containers. In order to run the Datadog Agent, create your task definition to run your application container(s), as well as the Datadog Agent container.
The instructions below show you how to configure the task using the AWS CLI tools or the Amazon Web Console.
- Log in to your AWS Web Console and navigate to the ECS section.
- Click on Task Definitions in the left menu, then click the Create new Task Definition button.
- Select Fargate as the launch type, then click the Next step button.
- Enter a Task Definition Name, such as
my-app-and-datadog
. - Select a task execution IAM role. See permission requirements in the Create or Modify your IAM Policy section below.
- Choose Task memory and Task CPU based on your needs.
- Click the Add container button.
- For Container name enter
datadog-agent
. - For Image enter
datadog/agent:latest
. - For Memory Limits enter
256
soft limit. - Scroll down to the Advanced container configuration section and enter
10
in CPU units. - For Env Variables, add the Key
DD_API_KEY
and enter your Datadog API Key as the value. If you feel more comfortable storing secrets in s3, see the ECS Configuration guide. - Add another environment variable using the Key
ECS_FARGATE
and the valuetrue
. Click Add to add the container. - Add another environment variable using the Key
DD_SITE
and the value {{< region-param key="dd_site" code="true" >}}. This defaults todatadoghq.com
if you don't set it. - (Windows Only) Select
C:\
as the working directory. - Add your other containers such as your app. For details on collecting integration metrics, see Integration Setup for ECS Fargate.
- Click Create to create the task definition.
- Download datadog-agent-ecs-fargate. Note: If you are using Internet Explorer, this may download as gzip file, which contains the JSON file mentioned below.**
- Update the JSON with a
TASK_NAME
, your Datadog API Key, and the appropriateDD_SITE
({{< region-param key="dd_site" code="true" >}}). Note: The environment variableECS_FARGATE
is already set to"true"
. - Add your other containers such as your app. For details on collecting integration metrics, see Integration Setup for ECS Fargate.
- Execute the following command to register the ECS task definition:
aws ecs register-task-definition --cli-input-json file://<PATH_TO_FILE>/datadog-agent-ecs-fargate.json
You can use AWS CloudFormation templating to configure your Fargate containers. Use the AWS::ECS::TaskDefinition
resource within your CloudFormation template to set the Amazon ECS task and specify FARGATE
as the required launch type for that task. You can then set the Datadog
option to configure log management, like in the example below:
Resources:
ECSTDNJH3:
Type: 'AWS::ECS::TaskDefinition'
Properties:
NetworkMode: awsvpc
RequiresCompatibilities:
- FARGATE
Cpu: 256
Memory: 1GB
ContainerDefinitions:
- Name: tomcat-test
Image: 'tomcat:jdk8-adoptopenjdk-openj9'
LogConfiguration:
LogDriver: awsfirelens
Options:
Name: datadog
Host: http-intake.logs.datadoghq.com
TLS: 'on'
dd_service: test-service
dd_source: test-source
provider: ecs
apikey: <API_KEY>
MemoryReservation: 500
- Name: log_router
Image: 'amazon/aws-for-fluent-bit:stable'
Essential: true
FirelensConfiguration:
Type: fluentbit
Options:
enable-ecs-log-metadata: true
MemoryReservation: 50
Note: Use a TaskDefinition secret to avoid exposing the apikey
in plain text.
For more information on CloudFormation templating and syntax, see the AWS CloudFormation documentation.
Add the following permissions to your Datadog IAM policy to collect ECS Fargate metrics. For more information, see the ECS policies on the AWS website.
AWS Permission | Description |
---|---|
ecs:ListClusters |
List available clusters. |
ecs:ListContainerInstances |
List instances of a cluster. |
ecs:DescribeContainerInstances |
Describe instances to add metrics on resources and tasks running. |
The only option in ECS Fargate is to run the task as a Replica Service. The Datadog Agent runs in the same task definition as your application and integration containers.
Run the following commands using the AWS CLI tools.
Note: Fargate version 1.1.0 or greater is required, so the command below specifies the platform version.
If needed, create a cluster:
aws ecs create-cluster --cluster-name "<CLUSTER_NAME>"
Run the task as a service for your cluster:
aws ecs run-task --cluster <CLUSTER_NAME> \
--network-configuration "awsvpcConfiguration={subnets=["<PRIVATE_SUBNET>"],securityGroups=["<SECURITY_GROUP>"]}" \
--task-definition arn:aws:ecs:us-east-1:<AWS_ACCOUNT_NUMBER>:task-definition/<TASK_NAME>:1 \
--region <AWS_REGION> --launch-type FARGATE --platform-version 1.1.0
- Log in to your AWS Web Console and navigate to the ECS section. If needed, create a cluster with the Networking only cluster template.
- Choose the cluster to run the Datadog Agent on.
- On the Services tab, click the Create button.
- For Launch type, choose FARGATE.
- For Task Definition, select the task created in the previous steps.
- Enter a Service name.
- For Number of tasks enter
1
, then click the Next step button. - Select the Cluster VPC, Subnets, and Security Groups.
- Load balancing and Service discovery are optional based on your preference.
- Click the Next step button.
- Auto Scaling is optional based on your preference.
- Click the Next step button, then click the Create service button.
After the Datadog Agent is setup as described above, the ecs_fargate check collects metrics with autodiscovery enabled. Add Docker labels to your other containers in the same task to collect additional metrics.
For details on collecting integration metrics, see Integration Setup for ECS Fargate.
Metrics are collected with DogStatsD through UDP port 8125.
To send custom metrics by listening to DogStatsD packets from other containers, set the environment variable DD_DOGSTATSD_NON_LOCAL_TRAFFIC
to true
within the Datadog Agent container.
For environment variables available with the Docker Agent container, see the Docker Agent page. Note: Some variables are not be available for Fargate.
Environment Variable | Description |
---|---|
DD_DOCKER_LABELS_AS_TAGS |
Extract docker container labels |
DD_DOCKER_ENV_AS_TAGS |
Extract docker container environment variables |
DD_KUBERNETES_POD_LABELS_AS_TAGS |
Extract pod labels |
DD_CHECKS_TAG_CARDINALITY |
Add tags to check metrics |
DD_DOGSTATSD_TAG_CARDINALITY |
Add tags to custom metrics |
For global tagging, it is recommended to use DD_DOCKER_LABELS_AS_TAGS
. With this method, the Agent pulls in tags from your Docker container labels. This requires you to add the appropriate labels to your other Docker containers. Labels can be added directly in the task definition.
Format for the Agent container:
{
"name": "DD_DOCKER_LABELS_AS_TAGS",
"value": "{\"<LABEL_NAME_TO_COLLECT>\":\"<TAG_KEY_FOR_DATADOG>\"}"
}
Example for the Agent container:
{
"name": "DD_DOCKER_LABELS_AS_TAGS",
"value": "{\"com.docker.compose.service\":\"service_name\"}"
}
Note: You should not use DD_HOSTNAME
since there is no concept of a host to the user in Fargate. DD_TAGS
is traditionally used to assign host tags, but as of Datadog Agent version 6.13.0 you can also use the environment variable to set global tags on your integration metrics.
In addition to the metrics collected by the Datadog Agent, Datadog has a CloudWatch based ECS integration. This integration collects the Amazon ECS CloudWatch Metrics.
As noted there, Fargate tasks also report metrics in this way:
The metrics made available will depend on the launch type of the tasks and services in your clusters. If you are using the Fargate launch type for your services then CPU and memory utilization metrics are provided to assist in the monitoring of your services.
Since this method does not use the Datadog Agent, you need to configure the AWS integration by checking ECS on the integration tile. Then, Datadog pulls these CloudWatch metrics (namespaced aws.ecs.*
in Datadog) on your behalf. See the Data Collected section of the documentation.
If these are the only metrics you need, you could rely on this integration for collection using CloudWatch metrics. Note: CloudWatch data is less granular (1-5 min depending on the type of monitoring you have enabled) and delayed in reporting to Datadog. This is because the data collection from CloudWatch must adhere to AWS API limits, instead of pushing it to Datadog with the Agent.
Datadog's default CloudWatch crawler polls metrics once every 10 minutes. If you need a faster crawl schedule, contact Datadog support for availability. Note: There are cost increases involved on the AWS side as CloudWatch bills for API calls.
You can monitor Fargate logs by using the AWS FireLens integration built on Datadog's Fluentbit output plugin to send logs to Datadog, or by using the awslogs
log driver and a Lambda function to route logs to Datadog. Datadog recommends using AWS FireLens because you can configure Fluent Bit directly in your Fargate tasks.
Configure the AWS FireLens integration built on Datadog's Fluent Bit output plugin to connect your FireLens monitored log data to Datadog Logs.
-
Enable Fluent Bit in the FireLens log router container in your Fargate task. For more information about enabling FireLens, see the dedicated AWS Firelens docs. For more information about Fargate container definitions, see the AWS docs on Container Definitions. AWS recommends that you use the regional Docker image. Here is an example snippet of a task definition where the Fluent Bit image is configured:
{ "essential": true, "image": "amazon/aws-for-fluent-bit:stable", "name": "log_router", "firelensConfiguration": { "type": "fluentbit", "options": { "enable-ecs-log-metadata": "true" } } }
If your containers are publishing serialized JSON logs over stdout, you should use this extra firelens configuration to get them correctly parsed within Datadog:
{ "essential": true, "image": "amazon/aws-for-fluent-bit:stable", "name": "log_router", "firelensConfiguration": { "type": "fluentbit", "options": { "enable-ecs-log-metadata": "true", "config-file-type": "file", "config-file-value": "/fluent-bit/configs/parse-json.conf" } } }
This converts serialized JSON from the
log:
field into top-level fields. See the AWS sample Parsing container stdout logs that are serialized JSON for more details. -
Next, in the same Fargate task, define a log configuration with AWS FireLens as the log driver, and with data being output to Fluent Bit. Here is an example snippet of a task definition where the FireLens is the log driver, and it is outputting data to Fluent Bit:
{ "logConfiguration": { "logDriver": "awsfirelens", "options": { "Name": "datadog", "apikey": "<DATADOG_API_KEY>", "Host": "http-intake.logs.datadoghq.com", "dd_service": "firelens-test", "dd_source": "redis", "dd_message_key": "log", "dd_tags": "project:fluentbit", "TLS": "on", "provider": "ecs" } } }
Note: If your organization is in Datadog EU site, use
http-intake.logs.datadoghq.eu
for theHost
option instead. The full list of available parameters is described in the Datadog Fluentbit documentation. -
Whenever a Fargate task runs, Fluent Bit sends the container logs to your Datadog monitoring with information about all of the containers managed by your Fargate tasks. You can see the raw logs on the Log Explorer page, build monitors for the logs, and use the Live Container view.
Monitor Fargate logs by using the awslogs
log driver and a Lambda function to route logs to Datadog.
-
Define the Fargate AwsLogDriver in your task. Consult the AWS Fargate developer guide for instructions.
-
Fargate task definitions only support the awslogs log driver for the log configuration. This configures your Fargate tasks to send log information to Amazon CloudWatch Logs. The following shows a snippet of a task definition where the awslogs log driver is configured:
{ "logConfiguration": { "logDriver": "awslogs", "options": { "awslogs-group": "/ecs/fargate-task-definition", "awslogs-region": "us-east-1", "awslogs-stream-prefix": "ecs" } } }
For more information about using the awslogs log driver in your task definitions to send container logs to CloudWatch Logs, see Using the awslogs Log Driver. This driver collects logs generated by the container and sends them to CloudWatch directly.
-
Finally, use a Lambda function to collect logs from CloudWatch and send them to Datadog.
-
Follow the instructions above to add the Datadog Agent container to your task definition with the additional environment variable
DD_APM_ENABLED
set totrue
and set up a host port that uses 8126 with tcp protocol under port mappings. Set theDD_SITE
variable to {{< region-param key="dd_site" code="true" >}}. It defaults todatadoghq.com
if you don't set it. -
Instrument your application based on your setup.
-
Ensure your application is running in the same task definition as the Datadog Agent container.
This integration works on Linux and Windows, but some metrics are OS dependent. All the metrics exposed when running on Windows are also exposed on Linux, but there are some metrics that are only available on Linux.
See metadata.csv for a list of metrics provided by this integration. It also specifies which ones are Linux-only.
The ECS Fargate check does not include any events.
See service_checks.json for a list of service checks provided by this integration.
Need help? Contact Datadog support.
- Blog post: Monitor AWS Fargate applications with Datadog
- FAQ: Integration Setup for ECS Fargate
- Blog post: Monitor your Fargate container logs with FireLens and Datadog
- Blog post: Key metrics for monitoring AWS Fargate
- Blog post: How to collect metrics and logs from AWS Fargate workloads
- Blog post: AWS Fargate monitoring with Datadog
- Blog post: Graviton2-powered AWS Fargate deployments
- Blog post: Monitor AWS Fargate for Windows containerized apps