Correct option is D
Data scrubbing, also known as data cleaning, is typically performed
before data analysis or reporting to ensure that the data is accurate, consistent, and complete. This process involves removing or correcting inaccurate, incomplete, or irrelevant data, thus improving the quality of the dataset.
Example:
· For a sales database, data scrubbing would involve handling missing entries in sales figures or correcting invalid dates before generating sales reports.
Important Key Points:
1.
Data scrubbing is a critical step in the
ETL (Extract, Transform, Load) pipeline and ensures high-quality data for decision-making.
2. It involves processes like:
· Removing duplicates
· Handling missing values
· Standardizing formats
3. Poor-quality data can lead to incorrect analysis, affecting business outcomes.
Knowledge Booster:
·
Before data entry: Incorrect; data validation, not scrubbing, occurs at this stage.
·
Before data backup: Incorrect; data is typically backed up as-is, and scrubbing happens after retrieval.
·
During data storage: Incorrect; data storage focuses on structuring and indexing, not cleaning.
·
After query execution: Incorrect; data scrubbing occurs before queries are run to ensure accurate results.