Skip to main content
x

Introduction to Data Science in Python

Language

English

Course format Online
Date 2021-01-13 - 2021-02-13
Duration 4 weeks, 8 hours per week
Cost Free, certification 40$US

This course will introduce the learner to the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the Series and DataFrame as the central data structures for data analysis, along with tutorials on how to use functions such as groupby, merge, and pivot tables effectively. By the end of this course, students will be able to take tabular data, clean it, manipulate it, and run basic inferential statistical analyses.

This course should be taken before any of the other Applied Data Science with Python courses: Applied Plotting, Charting & Data Representation in Python, Applied Machine Learning in Python, Applied Text Mining in Python, Applied Social Network Analysis in Python.

Learning outcomes

  • Understand techniques such as lambdas and manipulating csv files

  • Describe common Python functionality and features used for data science

  • Query DataFrame structures for cleaning and processing

  • Explain distributions, sampling, and t-tests

Files/Documents

ISCED Categories

Statistics
Scientific modelling