-
2012 London Summer Olympics Data
Local Data
Data Source
Problem Source
Description
-
Movie Data
Local Data
Data Source
Problem Source
Description
-
Computer Network Traffic Data
Local Data
Data Source
Problem Source
Description
-
2015-2016 New York City Manhattan Rolling Sales Data
Local Data
Data Source
Page Source
Description
-
List of Countries by Continent. (countries)
Local Data
Data Source
Problem Source
Description
-
GDP Per Capita by Purchasing Power Parities
Local Data
Data Source
Problem Source
Description
-
The Sean Lahman's Baseball Database
Local Data
Data Source
Problem Source
Description
Notebooks
Here we present Python notebooks with the code used to create most of the figures appearing in "The Data Science Design Manual".
These notebooks were created by Yeseul Lee, and serve as practical examples in analysis and visualization in Python.
We provide links to the GitHub page as well as local copies of each notebook
-
Chapter 2: Mathematical Preliminaries
-
Chapter 4: Scores and Rankings
-
Chapter 5: Statistical Analysis
-
Chapter 6: Visualizing Data
-
Chapter 7: Mathematical Models
-
Chapter 8: Linear Algebra
-
Chapter 9: Linear and Logistic Regression
-
Chapter 10: Distance and Network Methods
-
Chapter 11: Machine Learning