Data Science Portfolio
Personal Projects and Curiosity-Driven Explorations
Data Cleaning Strategic
data government
data cleansing
What data cleaning is?
Used mainly when dealing with data stored in database, the terms data validation, data cleaning, or data scrubbing refers to the process of detecting, correcting, replacing, modifying or removing messy data from a record set, table, or database.
Why we need data cleaning process?
First of all, Data is dirty. Live with it. Data cleaning consists primarily in implementing error prevention strategies before they occur. However, error-prevention strategies can reduce but not eliminate common errors.
Links
- Source code on my github