A comprehensive reference for understanding modern data architecture terminology
What it means: Data structure is applied when you query the data, not when you store it.
What it means: Data structure is enforced when data is loaded into the system.
What it means: Data is stored column by column (like Excel columns).
What it means: Traditional approach - clean and structure data before storing it.
What it means: Modern approach - store raw data first, clean it when needed.
What it means: Processing data continuously as it arrives.
What it means: Central repository storing all types of raw data at low cost.
What it means: Structured repository optimized for business reporting and analytics.
What it means: Combines flexibility of data lakes with reliability of data warehouses.
What it means: Decentralized approach where business domains own their data as products.
What it means: Enhanced data lake format with database-like reliability features.
What it means: Open format that works with multiple analytics engines.
What it means: Database properties ensuring data reliability (Atomicity, Consistency, Isolation, Durability).
What it means: Ability to query data as it existed at any point in the past.
This glossary serves as a reference for business stakeholders to understand and participate in data architecture discussions.
🏠 Back to Home