Redshift
Amazon Redshift Overview
A cloud-based, fully managed data warehouse solution
Built on modified PostgreSQL architecture
Designed for OLAP (Online Analytical Processing) workloads
Can store up to 16 PB of data
Key Features
High Performance
Delivers up to 10x performance compared to traditional data warehouses
Uses columnar storage (column-based vs row-based)
Enables efficient parallel query execution
Durability & Availability
Supports Multi-AZ deployment across 2 Availability Zones
Automated snapshots stored in S3 (S3 bucket management handled by AWS)
Data Processing
Best practice: Use large batch inserts for optimal performance
Redshift Spectrum allows direct querying of S3 data without loading into Redshift tables
Enhanced VPC Routing forces COPY and UNLOAD operations through your VPC
Last updated
Was this helpful?