","heroCtaTwoUrl":"https://aws.amazon.com/big-data/datalakes-and-analytics/data-integration/"},"metadata":{"tags":[{"id":"GLOBAL#product#glue","name":"AWS Glue","namespaceId":"GLOBAL#product","description":"AWS Glue","metadata":{}}]}}]},"metadata":{"auth":{},"testAttributes":{}},"context":{"page":{"locale":null,"site":null,"pageUrl":"https://aws.amazon.com/glue/","targetName":null,"pageSlotId":null,"organizationId":null,"availableLocales":null},"environment":{"stage":"prod","region":"us-east-1"},"sdkVersion":"1.0.115"},"refMap":{"manifest.js":"0a0328ab4e","rt-hero.rtl.css":"02176eb808","rt-hero.css":"4d75859a95","rt-hero.css.js":"388ff790be","rt-hero.js":"f64f492ef9","rt-hero.rtl.css.js":"e7802c7128"},"settings":{"templateMappings":{"hasSubnav":"hasSubnav","heading":"headline","subheading":"subheading","button1CTA":"heroCtaOne","button1URL":"heroCtaOneUrl","button2CTA":"heroCtaTwo","button2URL":"heroCtaTwoUrl","breadcrumbs":"breadcrumbs","freeTierContent":"freeTierContent","freeTierURL":"freeTierURL","dark":"dark"}}}
AWS Glue
Discover, prepare, and integrate all your data at any scaleBenefits of AWS Glue
How it works
AWS Glue is a serverless data integration service that makes it easier to discover, prepare, move, and integrate data from multiple sources for analytics, machine learning (ML), and application development.
-
Data integration engine options
-
Event-driven ETL
-
AWS Glue Data Catalog
-
No-code ETL jobs
-
Manage and monitor data quality
-
Data preparation
-
Data integration engine options
-
Choose your preferred data integration engine in AWS Glue to support your users and workloads.
-
Event-driven ETL
-
AWS Glue can run your extract, transform, and load (ETL) jobs as new data arrives. For example, you can configure AWS Glue to initiate your ETL jobs to run as soon as new data becomes available in Amazon Simple Storage Service (S3).
-
AWS Glue Data Catalog
-
You can use the Data Catalog to quickly discover and search multiple AWS datasets without moving the data. Once the data is cataloged, it is immediately available for search and query using Amazon Athena, Amazon EMR, and Amazon Redshift Spectrum.
-
No-code ETL jobs
-
AWS Glue Studio makes it easier to visually create, run, and monitor AWS Glue ETL jobs. You can build ETL jobs that move and transform data using a drag-and-drop editor, and AWS Glue automatically generates the code.
-
Manage and monitor data quality
-
AWS Glue Data Quality automates data quality rule creation, management, and monitoring to help ensure high quality data across your data lakes and pipelines.
-
Data preparation
-
With AWS Glue DataBrew, you can explore and experiment with data directly from your data lake, data warehouses, and databases, including Amazon S3, Amazon Redshift, AWS Lake Formation, Amazon Aurora, and Amazon Relational Database Service (RDS). You can choose from over 250 prebuilt transformations in DataBrew to automate data preparation tasks such as filtering anomalies, standardizing formats, and correcting invalid values.
Additionally, AWS Glue Studio offers a data preparation tool that allows you to prepare data with an interactive, point-and-click visual interface without writing code.
Use Cases
Simplify ETL pipeline development
Interactively explore, experiment on, and process data
Discover data efficiently
Support various processing frameworks and workloads
What's new
Get started with Glue
Did you find what you were looking for today?
Let us know so we can improve the quality of the content on our pages.