Data Cleaning 101 in SQL — # 3.1 A discussion on the Nature of Outliers

I am glad to have you for the third part of my complete guide on data cleaning. #1: Tidying Messy Data #2: Dealing with Missing Data #3.1: A discussion on the Nature of Outliers <– You are here #3.2: The Origin of Outliers & Detection Techniques #4.1: Where does Data Duplication come from? #4.2: A Practical Tutorial for Data Deduplication I like weird people. The black sheep, the odd ducks, the rejects, the eccentric, the loners, the lost and forgotten. More often than not, these people have the most beautiful souls. — Unknown ...

September 25, 2025 · 8 min · Brian Tran

Data Cleaning 101 in SQL - #4.2 A Practical Tutorial for Data Deduplication

I am glad to have you for the fourth part of my complete guide on data cleaning. #1: Tidying Messy Data #2: Dealing with Missing Data #3.1: A discussion on the Nature of Outliers #3.2: The Origin of Outliers & Detection Techniques #4.1: Where does Data Duplication come from? #4.2: A Practical Tutorial for Data Deduplication <– You are here Can you spot the impostor? - Picture by the author ...

September 16, 2023 · 5 min · Brian Tran

Data Cleaning 101 in SQL - #1 Tidying Messy Data

I am glad to have you for the first part of my complete guide on data cleaning. #1: Tidying Messy Data <– You are here #2: Dealing with Missing Data #3.1: A discussion on the Nature of Outliers #3.2: The Origin of Outliers & Detection Techniques #4.1: Where does Data Duplication come from? #4.2: A Practical Tutorial for Data Deduplication Photo by Vadim Sherbakov on Unsplash Data cleaning has always been a nightmare that every single analyst has to walk through (like through fire and flames). Although it is a hard pill to swallow, we cannot simply skip this step since the avoidance of data cleaning only brings more miseries to the analyst. ...

July 16, 2023 · 15 min · Brian Tran

Data Cleaning 101 in SQL - #2 Dealing with Missing Data

I am glad to have you for the second part of my complete guide on data cleaning. #1: Tidying Messy Data #2: Dealing with Missing Data <– You are here #3.1: A discussion on the Nature of Outliers #3.2: The Origin of Outliers & Detection Techniques #4.1: Where does Data Duplication come from? #4.2: A Practical Tutorial for Data Deduplication When you are missing someone, time seems to move slower, and when I’m falling in love with someone, time seems to be moving faster. — Taylor Swift ...

July 16, 2023 · 10 min · Brian Tran