Data Science with Python 4 days (32 hours)
Audience: Previous experience with Python is required.
Description: Python has become the premier language of data science. Not only is the syntax short and readable, but Python now has widely-used libraries to cover any use case.
This course will cover the essentials of the Python data science ecosystem. Students will use Numpy and Pandas to analyze tabular data, create graphs with Matplotlib, and build predictive models with Scikit-Learn.
By the end of the course, students will be able to extract data from various sources, do exploratory analysis and basic machine learning.
This course will alternate between short lectures and exercise sessions, so that students leave with both theoretical and hands-on understanding. All slides, exercises and solutions will be available online afterwards for further practice
For availability and pricing, please schedule a call or send a message.
Syllabus
Review
- Jupyter Notebooks
- Functions and Methods
- Booleans
- Lists
- Dictionaries
- Importing Packages
Numpy
- Numpy Arrays
- Broadcasting
- Element-Wise Operations
- Boolean Indexing
- Boolean Filters
- Numpy Methods
Pandas Dataframes
- Dataframe
- Series & Index
- Filtering a Dataframe
Data Analysis
- Summary Statistics
- Group By
- Pivot Tables
Sources of Data
- Database Tables
- APIs
- Timeseries data
Matplotlib
- PyPlot
- Style Configuration
- Line Plots
- Bar Charts
- Histograms
Scikit-Learn
- Linear Regression
- Logistic Regression