Workshop: Processing Large Data with Pandas
Pandas has become an incredibly popular Python package for processing data. We’ve had brief mentions of it before, but this month we have a proper workshop by a subject matter expert, Dayton’s own Evelyn Boettcher!
In this tutorial, we will:
- Learn how Python uses memory with Pandas
- How to reduce the Pandas’ dataframe memory footprint.
- Learn what data types are
- Speed up reading in csv files by using categories
- Reduce the memory footprint by 90%
This is a hands-on workshop - bring a laptop if you can!
https://github.com/ejboettcher/Talk-ProcessingLargeDatawithPandas
New meeting location!
Innovation Hub - not Brixx!
We’ve begun to meet in the Innovation Hub, a gorgeous new facility that’s part of the renovated Dayton Arcade complex. Enter through the doors that face the Wright Stop Plaza bus hub.
Street parking is free in the evening. I usually park on Ludlow Street.
or
if for any reason coming downtown doesn’t work for you (for instance, you’ve converted yourself to purely digital format and now exist as a set of cloud-hosted algorithms), we’ll be online as well!
Join us at 7 PM EDT on the PyFri Discord channel, discord.gg/9SgTh3T, and click on the General voice chat link. You may need to install the Discord desktop app rather than just using the web interface.