Correct option is D
A data warehouse is a centralized system designed for the storage, integration, and analysis of structured data collected from multiple sources. Unlike operational databases, which handle real-time transactions, a data warehouse is optimized for analytical processing, historical data storage, and business intelligence.
Key characteristics of a data warehouse include:
Subject-Oriented – Data is categorized based on subjects like sales, finance, or inventory, rather than daily operations.
Integrated – It consolidates data from multiple sources (e.g., ERP, CRM, and other databases) into a unified format.
Time-Variant – Data warehouses maintain historical records to enable trend analysis and forecasting.
Non-Volatile – Once stored, data is not frequently updated or deleted, ensuring consistency for analytical queries.
Usage of a Data Warehouse:
Business Intelligence (BI) tools use data warehouses for reporting and decision-making.
They support Online Analytical Processing (OLAP), allowing multidimensional analysis.
Industries like banking, healthcare, and retail use data warehouses for customer behavior analysis and strategic planning.
The most well-known data warehouse technologies include Amazon Redshift, Google BigQuery, Snowflake, and Microsoft Azure Synapse Analytics.
Information Booster:
Data Warehouse vs. Database: A database is optimized for daily transactions, whereas a data warehouse is optimized for large-scale analytical queries.
ETL (Extract, Transform, Load): A key process in data warehousing that extracts data from various sources, transforms it into a consistent format, and loads it into the warehouse.
OLTP vs. OLAP:
Types of Data Warehouses:
OLTP (Online Transaction Processing): Supports real-time transactions (e.g., banking apps).
OLAP (Online Analytical Processing): Supports complex queries and multidimensional analysis.
Advantages: Improves decision-making, ensures data consistency, enhances query performance, and supports data-driven strategies.
Enterprise Data Warehouse (EDW): A centralized repository for the entire organization.
Data Marts: A smaller, department-specific subset of a data warehouse.
Operational Data Store (ODS): A hybrid that supports real-time operations and analytics.
Additional Knowledge:
(a) A system used for transaction processing
Transaction Processing Systems (TPS) are designed for real-time data entry, retrieval, and modification.
Examples: ATM withdrawals, e-commerce transactions, and airline reservations.
Data warehouses do not handle real-time transactions; they store historical data for analysis.
(b) A tool for generating real-time analysis
Real-time analysis is performed using streaming analytics tools like Apache Kafka, Apache Flink, or Google Dataflow.
A data warehouse supports batch processing and scheduled reports, not real-time analytics.
For real-time data processing, organizations use NoSQL databases, event-driven architectures, or in-memory computing (e.g., SAP HANA).
(c) A database for unstructured data storage
Unstructured data (e.g., images, videos, and social media posts) is stored in Data Lakes or NoSQL databases like MongoDB, Hadoop, and Amazon S3.
A data warehouse primarily handles structured and semi-structured data in predefined schemas.