Data Pipeline
common data flow:
Operational Database (Source) ↓
ETL/ELT (Extract, Transform, Load) ↓
Data Warehouse and/or Data Lake (Storage/Processing) ↓
BI Tools (Analysis/Visualization)
Key points:
ETL/ELT happens before data reaches the Data Lake
Data Lake is typically a destination, not an intermediate step
You generally don't do ETL after the Data Lake phase
Common patterns:
ETL → Data Warehouse → BI Tools
ELT → Data Lake → BI Tools
ELT → Data Lake → Data Warehouse → BI Tools
The choice between ETL and ELT depends on your needs:
ETL: Transform before loading (traditional approach)
ELT: Load raw data first, transform later (modern approach, common with Data Lakes)
Last updated
Was this helpful?