Data Science Essentials – Course by EdX

Level: Beginner
Duration: 6 weeks
Delivery: Online
Certification: N/A
Cost: 99
Course Provider: Microsoft / EdX


You will learn key concepts in data acquisition, preparation, exploration, and visualization taught alongside practical application oriented examples such as how to build a cloud data science solution using R, and Python on Azure stack.

Along the course, you will explore the data science process, understand probability and statistics in data science, learn data exploration and visualization, data ingestion, cleansing, and transformation, and you will receive an introduction to machine learning. The hands-on elements of this course leverage a combination of R, Python, and Microsoft Azure Machine Learning.

Training Course Content

Explore the data science process – An Introduction
Understand data science thinking

  • Know the data science process
  • Use AML to create and publish a first machine learning experiment
  • Lab: Creating your first model in Azure Machine Learning


Probability and statistics in data science
Understand and apply confidence intervals and hypothesis testing

  • Understand the meaning and application of correlation Know how to apply simulation
  • Lab: Working with probability and statistics
  • Lab: Simulation and hypothesis testing


Working with data – Ingestion and preparation
Know the basics of data ingestion and selection

  • Understand the importance and process for data cleaning, integration and transformation
  • Lab: Data ingestion and selection – new
  • Lab: Data munging with Azure Machine Learning, R, and Python on Azure stack


Data Exploration and Visualization
Know how to create and interpret basic plot types

  • Understand the process of exploring datasets
  • Lab: Exploring data with visualization with Azure Machine Learning, R and Python


Introduction to Supervised Machine Learning
Understand the basic concepts of supervised learning

  • Understand the basic concepts of unsupervised learning
  • Create simple machine learning models in AML
  • Lab: Classification of people by income
  • Lab: Auto price prediction with regression


Lab: K-means clustering with Azure Machine Learning

Who Is It For?



Add a Verified Certificate for $99 USD

About the Provider

Graeme Malcolm, Senior Content Developer, Microsoft Learning Experiences; trainer, consultant, and author, specializing in SQL Server and the Microsoft data platform. He is a Microsoft Certified Solutions Expert for the SQL Server Data Platform and Business Intelligence. After years of working with Microsoft as a partner and vendor, he now works in the Microsoft Learning Experiences team as a senior content developer, where he plans and creates content for developers and data professionals who want to get the best out of Microsoft technologies.

Steve Elston, Managing Director, Quantia Analytics, LLC; big data geek and data scientist, with over two decades of experience using R and S/SPLUS for predictive analytics and machine learning. He holds a PhD degree in Geophysics from Princeton University, and has led multi-national data science teams across various companies.

Cynthia Rudin, Associate Professor, MIT and Duke; leads the Prediction Analysis Lab at MIT, and is associated with the Computer Science and Artificial Intelligence Laboratory and the Sloan School of Management. She holds a PhD in applied and computational mathematics from Princeton University, and was previously, an associate research scientist at the Center for Computational Learning Systems at Columbia U.

Rate this Article

All fields marked with red asterisks are required fields.

User Reviews

· October 31, 2018

If you can dedicate the 6 weeks, this is a very worthwhile course. It's a great introduction into data science essentials that will help greatly in finding a job.

Your compare list