Introduction to Data Science#

Overview#

You will start with an overview of the course and then an introduction to statistics and data science. You will learn some of the tools of the trade in scientific python, particularly numerical python, Jupyter nodebooks, and handling data.

Goals#

  • Getting overview of the course, activities and policies

  • Setting up your environment on Google Colab

  • Gain familiarity with Jupyter Notebooks and Numerical python

  • Learn about handling and describing data

  • Learn about the importance of clustering data in physics

  • Learn how to find structure in data (clustering)

    • KMeans, Spectral Clustering, DBSCAN

  • Measure and reduce dimensionality

  • Adapt linear models to nonlinear problems

  • Learn about Kernel functions

Lecture Materials#

Homework Assignment#

Supplemental Readings#