Big Data

Here are the key AWS Big Data components grouped by their primary functions:

Storage:

  • S3 (Simple Storage Service) for object storage

  • Amazon EFS (Elastic File System)

  • Amazon EBS (Elastic Block Store)

  • Amazon FSx for managed file systems

Processing & Analytics:

  • EMR (Elastic MapReduce) for Hadoop ecosystem

  • Amazon Redshift for data warehousing

  • Amazon Athena for S3 querying

  • AWS Glue for ETL

  • Amazon Kinesis for real-time data processing

Data Movement:

  • AWS Data Pipeline for workflow orchestration

  • AWS Transfer Family for file transfers

  • Database Migration Service (DMS)

  • Kinesis Firehose for data delivery

Databases:

  • RDS (Relational Database Service)

  • DynamoDB for NoSQL

  • DocumentDB for MongoDB workloads

  • Neptune for graph databases

  • Timestream for time-series data

Visualization:

  • QuickSight for BI and visualization

Security & Governance:

  • Lake Formation for data lake setup

  • Macie for data security

  • AWS KMS for key management

Query & Search:

  • OpenSearch (formerly Elasticsearch)

  • Amazon Redshift Spectrum

  • Amazon Athena

Last updated

Was this helpful?