bestcourses is supported by learners. When you buy through links on our website, we may earn an affiliate commission. Learn more

Exploratory Data Analysis in Python

A course about how to approach a dataset for the first time

4.58 / 5.0
4865 students1 hours 50 minutes

Created by Gianluca Malato, offered on Udemy

bestcourses score™

Student feedback

5.4/10

To make sure that we score courses properly, we pay a lot of attention to the reviews students leave on courses and how many students are taking a course in the first place. This course has a total of 4865 students which left 80 reviews at an average rating of 4.58, which is average.

Course length

8.7/10

We analyze course length to see if courses cover all important aspects of a topic, taking into account how long the course is compared to the category average. This course has a length of 1 hours 50 minutes, which is pretty short. This might not be a bad thing, but we've found that longer courses are often more detailed & comprehensive. The average course length for this entire category is 7 hours 54 minutes.

Overall score

5.8/10

This course currently has a bestcourses score of 5.8/10, which makes it an average course. Overall, there are probably better courses available for this topic on our platform.

Description

When we put our hands on a dataset for the first time, we can’t wait to test several models and algorithms. This is wrong because if we don’t know the information before feeding our model, the results will be unreliable and the model itself will surely fail. Moreover, if we don’t select the best features in advance, the training phase becomes slow and the model won’t learn anything useful.

So, the first approach we must have is to take a look at our dataset and visualize the information it contains. In other words, we have to explore it.

That’s the purpose of the Exploratory Data Analysis.

EDA is an important step of data science and machine learning. It helps us explore the information hidden inside a dataset before applying any model or algorithm. It makes heavy use of data visualization, it’s bias-free.

Moreover, it lets us figure out whether our features have predictive power or not, determining if the machine learning project we are working on has chances to be successful. Without EDA, we may give the wrong data to a model without reaching any success.

With this course, the student will learn:

  • How to visualize information that is hidden inside the dataset

  • How to visualize the correlation and the importance of the columns of a dataset

  • Some useful Python libraries

All the lessons are practical and made using Python programming language and Jupyter notebooks. All the notebooks are downloadable.

What you will learn

  • Exploring a dataset for calculating overall statistics
  • Visualize the correlations between the features
  • Visualize the predictive power of the features
  • Create useful insights from a dataset

Requirements

  • Python programming language
Udemy logo
Available on

Udemy

With almost 200,000 courses and close to 50 million students, Udemy is one of the most visited online learning platforms. Popular topics include software development, the digital economy, but also more traditional topics like cooking and music.

Frequently asked questions

  • Price: $19.99
  • Platform: Udemy
  • Language: English
  • 1 hours 50 minutes
Exploratory Data Analysis in Python thumbnail

bestcourses score: 5.8/10

There might be better courses available for this topic.