Data Science Pipeline Flashcard #3: What is Data Cleaning?

Thank you to our friends at the Northeast Big Data Innovation Hub for creating a series of Data Science Flashcards video.

Learn about the Data Science Pipeline with the National Student Data Corps! The third step in the Data Science Pipeline is Data Cleaning. Data Cleaning is a part of data pre-processing that removes the inconsistencies and prepares and validates the data before the main analysis can be performed.