Class Plan

Surface Water Drinking Water

A data management (Sata) and document scraping exercises.

Curriculum Modules

Data Management in Stata
  • Author: Benjamin Jacobs
  • Tool: Stata
  • Audience: Undergraduate (Introductory)
  • Estimated Time: 1-hour class session

Walks students through data management techniques in a research environment, covering best practices for downloading, transforming, merging, and saving data.

Word Document Scraping Workshop
  • Author: Joseph Bodenheimer
  • Tool: Python
  • Audience: Undergraduate (Introductory)
  • Estimated Time: 1-hour class session

Builds a Python workflow to scrape public data from KDHE Consumer Confidence Report .docx files, using python-docx, regular expressions, and pandas to extract fields across many reports.

Back to top