ETL-Prozess (EN)

Technology

Process for extracting, transforming and loading data

ETL-Prozess

The ETL process (Extract, Transform, Load) is a fundamental process in data provision. In the extraction phase, data is drawn from various sources, transformed and cleaned in the transformation phase, and finally the processed data is loaded into the target system. This process ensures that data is clean, consolidated and ready for analysis.

Architecture

flowchart TD     A[Data Sources] --> B[Extraction]     B --> C[Transformation]     C --> D[Validation]     D --> E[Loading]     E --> F[Target System]          subgraph Data Sources         A1[Relational Databases]         A2[NoSQL Databases]         A3[CSV/Excel Files]         A4[APIs]         A5[Log Files]     end          subgraph Transformation         C1[Data Cleaning]         C2[Format Conversion]         C3[Enrichment]         C4[Aggregation]     end          subgraph Target System         F1[Data Warehouse]         F2[Data Lake]         F3[OLAP Database]     end 

In Context

  • Typically used together with Data Warehousing and Business Intelligence
  • Related to: ELT Process, Data Pipeline, Data Integration
  • Example use case: Consolidation of sales data from various systems for company-wide reports
Quelle: AI Generated