A Beginner’s Guide to Data Cleaning Using Pandas

Most real-world datasets are messy. Before you can run any analysis or build a model, you need to deal with missing values, strange outliers, inconsistent formatting, and incorrect data types. This post walks through the basics of data cleaning using pandas, one of the most popular Python libraries for data manipulation. We’ll use a dataset of Nairobi property listings as our example. It contains information like location, price, number of bedrooms, and date posted. Let’s get started. ...

July 1, 2025 · 2 min · Brian Njenga Mwaura

About Me

Hi, I’m Brian Njenga Mwaura, a data analyst and statistician with a strong foundation in R, Python, and AWS. 🎓 BSc in Statistics ☁️ AWS Cloud Certified 🧠 Skilled in data cleaning, modeling, and visualization 📈 Special interest in financial markets (Forex) 🤖 Automating tasks and uncovering insights from complex datasets This blog and portfolio showcase projects I’ve built across: Data analysis Web scraping AWS cloud automation Let’s connect and turn data into decisions! ...

June 19, 2025 · 1 min · Brian Njenga Mwaura