DS 200 Introduction to Data Science

This hands-on, project-based course introduces data science, blending computer science and statistics to extract insights from diverse datasets. Students master Python’s data science tools, including Jupyter, NumPy, Pandas, and Scikit-learn, and apply advanced visualization techniques. The curriculum emphasizes statistical understanding and analytical models like regression and clustering. A key focus is on data storytelling, enabling students to effectively communicate complex insights, bridging technical analysis with strategic decision-making.

Instructor:

Dr. Peilong Li

Office:

Esbenshade 284B

Appointments:

By email

Number of Credits

4

Pre-requisites

  • CS 121 Computer Science I
  • Any Math course from the following: MA 121, MA 135, MA 251

Textbooks

  • (Required) Peilong Li. DS 200 Introduction to Data Science. 2024, zyBooks.

    • Sign in or create an account at: learn.zybooks.com
    • Enter zyBook code: ETOWNDS200LiSpring2024
    • Subscribe. Subscription fee: $89. Valid through end of May.
  • (Optional) Joel Grus. Data Science from Scratch: First Principles with Python (DSS). 2015, O’Reilly Media, ISBN: 0636920033400. E-copy: link

  • (Optional) William McKinney. Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython (PDA). 2012, O’Reilly Media, ISBN: 0636920023784. E-copy: link