PDS II (IDS 541)#

Practical Data Science II is a flipped-classroom, exercise and project-focused course. Building on the computational thinking skills developed in Practical Data Science I, this course introduces students to a range of computational inquiry methods, including network analysis, geospatial analysis, advanced plotting, and natural language processing (NLP). Throughout, the focus will be on developing hands-on experience implementing these methods with messy real-world data to ensure students are prepared to deploy these tools to answer the questions they care about. Requirements: Practical Data Science I.

Syllabus#

A preliminary syllabus can be found here.

Office Hours#

Adriane: Signup with this link

Nick (Zoom or in Gross 231): Thursday 9 am - 10:00 am. Please email me in advance if you plan to attend. I’ll usually be there regardless, but knowing people are coming is useful if conflicts arise.

Diego (Zoom or Gross Hall 2nd Floor, 230K): Monday 12:00 - 1:00 pm

Meron (Zoom or Gross Hall 2nd Floor, 230N): Wednesday 12:00 - 1:00 pm.

Books#

Class Schedule (Preliminary)#

Date

Topic

Do Before Class

In-Class Exercise

Mon Jan 12

  • Pandas: Reshaping

Wed Jan 14

  • Speed and Performance in Python

  • Pandas: Categoricals

Mon Jan 19

  • MLK DAY NO CLASS

Wed Jan 21

  • Big Data: What is it, how do I work with it?

Mon Jan 26

SNOW

Wed Jan 28

  • Parallelism

  • Distributed Computing

(Note reading includes a 45 minute video to watch)

Mon Feb 2

SNOW AGAIN

Wed Feb 4

  • More Dask

Mon Feb 9

-GIS: Vector Data

Wed Feb 11

-GIS: Vector Data

Mon Feb 16

-GIS: Vector Data

Wed Feb 18

-GIS: Vector Data

(Yes, this is just the docs. But I wrote a lot of them so it’s not cheating!)

Mon Feb 23

  • GIS: Rasters

Wed Feb 25

  • GIS: Rasters

Mon Mar 2

  • GIS: Rasters

Wed Mar 4

  • GIS: Mixing Vector and Raster

Mon Mar 9

NO CLASS

SPRING BREAK

Wed Mar 11

NO CLASS

SPRING BREAK

Mon Mar 16

  • Machine Learning

  • Géron, Chpt 1: The Machine Learning Landscape (stop before “Batch and Online Learning,” then read the “Testing and Validating” section)

  • Géron, Chpt 2: End-to-End Machine Learning Project

Wed Mar 18

  • Machine Learning

Mon Mar 23

  • Machine Learning

Wed Mar 25

  • Machine Learning

  • Supervised ML

Mon Mar 30

  • Machine Learning

  • SciKit-Learn

Wed Apr 1

  • Git and github

Mon Apr 6

  • Git and github

Wed Apr 8

  • Network Data

  • Intro to graph-tool

Mon Apr 13

  • Network Data

  • Community Detection

  • Shortest Path

Wed Apr 15

LAST CLASS