Introduction to Data Science#
Overview#
You will start with an overview of the course and then an introduction to statistics and data science. You will learn some of the tools of the trade in scientific python, particularly numerical python, Jupyter nodebooks, and handling data.
Goals#
Getting overview of the course, activities and policies
Setting up your environment on Google Colab
Gain familiarity with Jupyter Notebooks and Numerical python
Learn about handling and describing data
Learn about the importance of clustering data in physics
Learn how to find structure in data (clustering)
KMeans, Spectral Clustering, DBSCAN
Measure and reduce dimensionality
Adapt linear models to nonlinear problems
Learn about Kernel functions
Lecture Materials#
Homework Assignment#
Supplemental Readings#
A Whirlwind Tour of Python, Jake VanderPlas: free PDF, notebooks online.
Nonlinear Dimensionality Reduction by Locally Linear Embedding