Understanding, Extracting, Sourcing and Processing Data#

Welcome to our second lecture at STRV Data Science Academy. In the first part of the lecture, you will have a chance to play around with three new libraries: numpy, pandas and matplotlib. Those libraries are essential for data wrangling and analysis practiced in the second part of the lecture.

The Plan#

Section

Time

Introduction to NumPy

20 minutes

Introduction to Pandas

25 minutes

Introduction to Matplotlib

10 minutes

Break

15 minutes

Acquiring Data

10 minutes

Doing Exploratory Data Analysis (EDA)

20 minutes

Data Preparation for ML

20 minutes

If you missed the class, or you want to revisit some content, download the lecture recording part 1 and part 2.