Data I

Programming Skills

programming skills
model building
University of Kansas
Author

Will Duncan

Course Description

Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and sys- tems to derive knowledge and insights from data. This course teaches students the critical concepts of inference and computing, working with real behavioral, economic, geographic, physical, social, and text data. Students obtain basic statistics training from a computational perspective using simulation to answer questions, explore problems, and delve into social issues surrounding data analysis such as privacy and design.

Course Objective

Upon successful completion of this course, students will be able to: - Have learned the basic concepts and skills of data science. – Work with data from multiple sources and identify relevant questions. – Understand and process data and organize information. – Analyze data by identifying and applying statistical methods. – Visualize and communicate data. – Translate results into solutions and communicate findings in a way that positively affects policy or organizational decisions. - Become familiar with the programming language of Python, particularly in the use of data science and statistical modeling.

Prerequisite: NONE

Syllabus

Unable to display PDF file. Download instead.

Lecture

Lecture 1: Introduction Click!

Lecture 2: Causality Click!

Lecture 3: Introduction to Tables and Data Types Click!

Lecture 4:Arrays & Ranges Click!

Lecture 5: Creating Tables Click!

Lecture 6: Data Visualization Click!

Lecture 7: Functions Click!

Lecture 8: Groups, Pivots and Joins Click!

Lecture 9: Iteration, Sampling, and Chance Click!

Lecture 10: Probability & Samples Click!

Lecture 11: Models and Assessing Models Part 1 Click! Part 2 Click!

Lecture 12: Decisions under Uncertainty Click!

Lecture 13: Causality and A/B Testing Click!

Lecture 14: Estimation and Confidence Click!

Lecture 15: Center. Spread, and Normal Distribution Click!

Lecture 16: Sampling Variability Click!

Lecture 17: Correlation Click!

Lecture 18: Regression Click!

Lecture 19: Regression Inference Click!

Lecture 20: Classification Click!

In-Class Activity

Lab 1: Introduction Click!

Lab 2: Table Operations Click!

Lab 3: Data Types and Arrays Click!

Lab 4: Functions and Visualizations Click!

Lab 5: Simulations Click!

Lab 6: Assessing Models Click!

Lab 7: Crime and Penalty Click!

Lab 8: Normal Distribution and Variance of Sample Means Click!

Lab 9: Correlation and Regression Click!

Lab 10: Classification Click!

Assignment

Homework Assignment 1: Causality Click!

Homework Assignment 2: Arrays and Tables Click!

Homework Assignment 3: Table Manipulation and Visualization Click!

Homework Assignment 4: Functions, Histograms, and Groups Click!

Homework Assignment 5: Functions, Iterations Click!

Homework Assignment 6: Probability, Simulation, Estimation, Assessing Models Click!

Homework Assignment 7: Testing Hypotheses Click!

Homework Assignment 8: Confidence Intervals Click

Homework Assignment 9: Bootstrap, Resampling, CLT Click!

Homework Assignment 10: Linear Regression Click!

Homework Assignment 11: Regression Inference Click!

Data

TBD

Resources

Textbook

Textbook: Computational and Inferential Thinking: The Foundations of Data Science

Google Colab

Homework Assignments and Data Lab use Google Colab

Back to top