Class Plan
Surface Water Drinking Water
A data management (Sata) and document scraping exercises.
Curriculum Modules
Data Management in Stata
-
Author: Benjamin Jacobs
-
Tool: Stata
-
Audience: Undergraduate (Introductory)
-
Estimated Time: 1-hour class session
Walks students through data management techniques in a research environment, covering best practices for downloading, transforming, merging, and saving data.
Word Document Scraping Workshop
-
Author: Joseph Bodenheimer
-
Tool: Python
-
Audience: Undergraduate (Introductory)
-
Estimated Time: 1-hour class session
Builds a Python workflow to scrape public data from KDHE Consumer Confidence Report .docx files, using python-docx, regular expressions, and pandas to extract fields across many reports.