Clean up your CSV data files with easy Python coding

by Katie Kodes | at MinneBar 13 | 12:10 – 1:00 in Stephen Leacock | View Schedule

Slides: https://pypancsv.github.io/pypancsv

If you've ever moved table-style data from Point A to Point B, you may have tried exporting it to CSV, cleaning the CSV file in Excel, and re-importing the results.

When Excel seems "just not quite enough" for that middle step, consider Python, a programming language that has exploded in popularity among non-programmers dealing with complex data problems.

See how short pieces of Python code can perform advanced filter-and-replace operations, summarize data, combine multiple CSV files (VLOOKUP across multiple columns? no problem!), and speed up processing of extra-large CSV files.

We'll cover:

  • When it's faster to code Python, and when to click buttons in Excel

  • Some "programming 101" to help you read the session's demo code

  • Demos

  • Training and practice resources

This session is aimed at:

  • "Non-programmers" who are are pretty darned handy with Excel formulas, if you do say so yourselves

  • Non-Python programmers who want to see what CSV processing looks like in Python

Beginner

Katie Kodes

Once told, "I've always imagined your brain is shaped like an old-fashioned library card catalog," Katie is thrilled by any chance to help others find -- and maintain -- order in their data.

Katie blogs about SQL, Python, Salesforce, and other ways to deal with data at https://www.katiekodes.com/