AWS Glue

A serverless data integration service.

Visit Website →

Overview

AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it simple and cost-effective to categorize your data, clean it, enrich it, and move it reliably between various data stores and data streams. AWS Glue consists of a central metadata repository known as the AWS Glue Data Catalog, an ETL engine that automatically generates Python or Scala code, and a flexible scheduler that handles dependency resolution, job monitoring, and retries.

✨ Key Features

  • Serverless ETL
  • Automatic schema discovery (crawlers)
  • Integrated data catalog
  • Visual and code-based job authoring
  • Job scheduling and orchestration

🎯 Key Differentiators

  • Serverless and fully managed
  • Deep integration with the AWS data ecosystem
  • Automatic schema discovery

Unique Value: Provides a simple and cost-effective way to build and run ETL jobs in the AWS cloud without managing any infrastructure.

🎯 Use Cases (4)

ETL/ELT pipelines Data preparation for analytics Building a data lake Streaming data integration

✅ Best For

  • Building serverless ETL jobs to process data in Amazon S3
  • Creating and managing a data catalog for a data lake on AWS

💡 Check With Vendor

Verify these considerations match your specific requirements:

  • Complex, multi-cloud or hybrid-cloud orchestration.
  • Workflows that are not primarily data integration tasks.

🏆 Alternatives

Azure Data Factory Google Cloud Data Fusion Talend Informatica

Offers seamless integration with AWS data stores, but is less flexible for multi-cloud scenarios and may be less user-friendly for complex transformations than tools with graphical interfaces.

💻 Platforms

Web API

🔌 Integrations

Amazon S3 Amazon Redshift Amazon RDS Amazon DynamoDB JDBC-accessible databases

🛟 Support Options

  • ✓ Email Support
  • ✓ Live Chat
  • ✓ Phone Support
  • ✓ Dedicated Support (AWS Support Plans tier)

🔒 Compliance & Security

✓ SOC 2 ✓ HIPAA ✓ BAA Available ✓ GDPR ✓ ISO 27001 ✓ SSO ✓ SOC 1, 2, 3 ✓ HIPAA ✓ GDPR ✓ ISO/IEC 27001, 27017, 27018 ✓ PCI DSS Level 1

💰 Pricing

Contact for pricing
Free Tier Available

Free tier: Free tier for the Data Catalog and crawlers.

Visit AWS Glue Website →