*but were too afraid to ask like: the mysteries of SettingWithCopyWarning, the use of inplace=True, when to use .copy(), when to use .apply(), NaNs, BlockManagers and more! You use pandas every day, but do you really know how it works? Let’s pull back the curtain together!
Pandas, the data wrangling workhorse, will celebrate its 18th year of existence in 2026. You rely on it daily for analysis, but are you truly confident in your code?
This session is dedicated to the unwritten rules and hidden mechanics that separate a confident user from one who constantly battles warnings and unexpected outputs. We will confront the infamous SettingWithCopyWarning that haunts chained operations, clarify the critical differences in deep vs. shallow copies and the true cost of using inplace=True. We’ll also demystify the complex handling of missing data (NaNs) and much more!
Crucially, we will look to the future. Pandas is engaged in a DataFrame library race with newer, high-performance libraries like Polars and Duckdb. The latest advancements—pandas 2.0 and the forthcoming 3.0, with features like Copy-on-Write and Apache Arrow integration—are the direct response, promising a future of dramatically improved speed, memory efficiency, and data types.
Join me to master the crucial concepts of the past and prepare for the performance gains and new behaviors of the future, ensuring your skills stay ahead of the curve. Stop guessing and start mastering pandas!
I’m Francesco, but everybody calls me Frengo. I am a Machine Learning Engineer with a passion for building innovative solutions. I work at Prima, the insurance company, where I try to bring a blend of humor and empathy, making collaboration a joy rather than a chore. I play the guitar and I sing in - not one - but two choirs. I also like cooking… and eating.