Back to Catalog

Discovering roots with the Porter stemming algorithm

BeginnerGuided Project

Explore the Natural Language Processing (NLP) technique, stemming, with the Porter stemming algorithm. Learn about the different types of stemming and how its used in search engines and spell checking by using Buddha's text file for practical application. Stemming is used to reduce words to their root or base form, known as the "stem." This NLP technique involves removing suffixes and prefixes from words to normalize them, allowing different variations of the same word to be treated as equivalent.

Language

  • English

Topic

  • Artificial Intelligence

Enrollment Count

  • 51

Skills You Will Learn

  • Python, NLP, Data Science, Artificial Intelligence, Data Analysis

Offered By

  • IBMSkillsNetwork

Estimated Effort

  • 30 minutes

Platform

  • SkillsNetwork

Last Update

  • March 13, 2025
About this Guided Project
Explore the Natural Language Processing (NLP) concept called stemming and dive into the Porter stemming algorithm. Stemming is all about making words shorter and simpler to understand.  In this guided project, you'll learn how to apply the Porter stemming algorithm to Buddha's words in Python. You'll complete an exercise where you compare Snowball and Porter stemming to see which one you like best.

This hands-on project is based on the Stemming text using the Porter stemming algorithm in Python tutorial created by Jacob Murel.

A Look at the Project Ahead

After completing this guided project, you will be able to:
  • Understand text analysis in NLP.
  • Recognize stemming and its various types.
  • Learn the principles and rules behind the Porter stemming algorithm.
  • Implement the Porter stemming algorithm in Python.

What You'll Need

While prior experience with NLP is not required, having some familiarity with Python concepts, string manipulation, and regular expressions will make the project easier to complete. However, the project is designed to guide you through the process, step by step, so that you can learn and master Porter stemming, even if you have no prior experience in NLP.

Instructors

Jigisha Barbhaya

Data Scientist

I am a Data scientist at IBM and Lead instructor at Skills network. I love to learn and educate. I have completed my MSc(Computer Application) specialisation in Data science from Symbiosis University.

Read more

Contributors

Kang Wang

Data Scientist

I am a Data Scientist in the IBM. I am also a PhD Candidate in the University of Waterloo.

Read more