Back to Catalog

Exploratory Data Analysis with Python

IntermediateGuided Project

Python is a popular programming language for data analysis due to its numerous libraries. In this guided project, you will use Python to perform data analysis and determine which variables are important to feed prediction models.

4.4 (323 Reviews)

Language

  • English

Topic

  • Data Science

Enrollment Count

  • 1.26K

Skills You Will Learn

  • Python, Data Science, Data Analysis

Offered By

  • IBMSkillsNetwork

Estimated Effort

  • 30 minutes

Platform

  • SkillsNetwork

Last Update

  • May 3, 2024
About This Guided Project
Data analysis involves a number of steps. With Python, you can import data, visualize data, and perform descriptive statistical analysis. You can also group data and perform Analysis of Variance (ANOVA) tests.

Python, a popular programming language used in data analysis, provides a wide range of libraries and tools for handling, cleaning, and visualizing data. Python’s simplicity, versatility, and integration with other technologies make it a valuable tool for data analysts and scientists in many industries.

You’ll begin this guided project by importing data from multiple sources using Python. You’ll then analyze individual feature patterns using visualization techniques to get a better understanding of the data. Next, you’ll perform descriptive statistical analysis to summarize the data and gain insights into its distribution and relationships. You’ll learn the basics of grouping data to compare and analyze subsets of data. You will also study the concepts of correlation and causation to understand the relationship between different features of the data. Finally, you will learn about ANOVA, a statistical method to determine the difference between two or more means. 

After completing this project, you’ll be ready to apply your new Python skills for data analysis and prediction modeling.

A Look at the Project Ahead

After completing this project, you'll be able to:
  • Import data using Python
  • Analyze data using visualization techniques
  • Perform descriptive statistical analysis
  • Conduct ANOVA tests

What You'll Need

 For this project, you will need:
  • Familiarity with Python and pandas
  • Familiarity with principles of data science
  • A web browser

You’ll have access to a Cloud-based IDE that includes all of the required software, including a Python environment, pre-installed. All you need is a recent version of a modern web browser to complete this project. 

Instructors

Joseph Santarcangelo

Senior Data Scientist at IBM

Joseph has a Ph.D. in Electrical Engineering, his research focused on using machine learning, signal processing, and computer vision to determine how videos impact human cognition. Joseph has been working for IBM since he completed his PhD.

Read more