Data Validation: The Key to Building Sturdy Systems
The key to building sturdier, more reliable systems? Use schemas to validate data...
Parquet → CSV: Four Python Libraries Compared
Among Polars, DuckDB, PyArrow, and Pandas, which one delivers the fastest Parquet-to-CSV conversions?
Combining CSVs With Slightly Different Schemas
Learn three ways to merge multiple CSV files that don’t share the exact same schema in Python...
Recoding (Column) Values in Python
Data recoding is a dreaded task, but the results are well worth the effort. In this post, I share several methods for quickly recoding column values using the Polars and Pandas libraries in Python...
Renaming Columns in Python
This post shares several methods for renaming DataFrame columns using the Polars and Pandas libraries in Python...
Design with a discerning, editing eye
A keen editing eye can enhance your designs, enabling effective communication and visual appeal. Learn how to use this skill by creating a customized ggplot2 theme in R...