
2012 London Summer Olympics Data
Local Data
Data Source
Problem Source
Description

Movie Data
Local Data
Data Source
Problem Source
Description

Computer Network Traffic Data
Local Data
Data Source
Problem Source
Description

20152016 New York City Manhattan Rolling Sales Data
Local Data
Data Source
Page Source
Description

List of Countries by Continent. (countries)
Local Data
Data Source
Problem Source
Description

GDP Per Capita by Purchasing Power Parities
Local Data
Data Source
Problem Source
Description

The Sean Lahman's Baseball Database
Local Data
Data Source
Problem Source
Description
Notebooks
Here we present Python notebooks with the code used to create most of the figures appearing in "The Data Science Design Manual".
These notebooks were created by Yeseul Lee, and serve as practical examples in analysis and visualization in Python.
We provide links to the GitHub page as well as local copies of each notebook

Chapter 2: Mathematical Preliminaries

Chapter 4: Scores and Rankings

Chapter 5: Statistical Analysis

Chapter 6: Visualizing Data

Chapter 7: Mathematical Models

Chapter 8: Linear Algebra

Chapter 9: Linear and Logistic Regression

Chapter 10: Distance and Network Methods

Chapter 11: Machine Learning