Data Cleaning 101 in SQL - #4.2 A Practical Tutorial for Data Deduplication

I am glad to have you for the fourth part of my complete guide on data cleaning. #1: Tidying Messy Data #2: Dealing with Missing Data #3.1: A discussion on the Nature of Outliers #3.2: The Origin of Outliers & Detection Techniques #4.1: Where does Data Duplication come from? #4.2: A Practical Tutorial for Data Deduplication <– You are here Can you spot the impostor? - Picture by the author...

September 16, 2023 · 5 min · Brian Tran

Data Cleaning 101 in SQL - #1 Tidying Messy Data

I am glad to have you for the first part of my complete guide on data cleaning. #1: Tidying Messy Data <– You are here #2: Dealing with Missing Data #3.1: A discussion on the Nature of Outliers #3.2: The Origin of Outliers & Detection Techniques #4.1: Where does Data Duplication come from? #4.2: A Practical Tutorial for Data Deduplication Photo by Vadim Sherbakov on Unsplash Data cleaning has always been a nightmare that every single analyst has to walk through (like through fire and flames)....

July 16, 2023 · 15 min · Brian Tran

Data Cleaning 101 in SQL - #2 Dealing with Missing Data

I am glad to have you for the second part of my complete guide on data cleaning. #1: Tidying Messy Data #2: Dealing with Missing Data <– You are here #3.1: A discussion on the Nature of Outliers #3.2: The Origin of Outliers & Detection Techniques #4.1: Where does Data Duplication come from? #4.2: A Practical Tutorial for Data Deduplication When you are missing someone, time seems to move slower, and when I’m falling in love with someone, time seems to be moving faster....

July 16, 2023 · 10 min · Brian Tran