Exploring Spark's GraphX
IntermediateCourse
Spark provides a graph-parallel computation library in GraphX. Graph-parallel is a paradigm that allows representation of your data as vertices and edges. Spark's GraphX provides a set of fundamental operators in addition to a growing collection of algorithms and builders to simplify graph analytics tasks.

Language
- English
Topic
- Big Data
Offered By
- BDU
Platform
- SkillsNetwork
Last Update
- February 11, 2025
About this Course
ABOUT THIS SPARK GRAPHX COURSE
Spark provides a graph-parallel computation library in GraphX. Graph-parallel is a paradigm that allows representation of your data as vertices and edges. Spark GraphX provides a set of fundamental operators in addition to a growing collection of algorithms and builders to simplify graph analytics tasks.
In this course, you will learn about Spark GraphX components and the background of graph-parallel operations. You will see how Spark implements this with RDDs and how it compares vs Data Parallel operations. You will get to explore how to visualize your data using various graph operators.
COURSE SYLLABUS
Module 1 - Introduction to Graph-Parallel
- Learned about GraphX components, construction, and background
- See how Data Parallel, Graph-Parallel, and RDDs tie in with GraphX
Module 2 - Visualizing Spark GraphX and Exploring Graph Operators
- Learn about how GraphX handles visualizations, create views and look alternative options
- Take a look at few introductory Graph Operators and PageRank
Module 3 - Modifying Spark GraphX
- Understand how GraphX deals with modifications and RDDs
- Take a look at a Property Operators, Structural Operators and how to utilize them
Module 4 - Neighborhood Aggregation and Caching
- Learn about Neighborhood Aggregation - aggregateMessages
- Learn how to cache with GraphX and take a look at other graph
GENERAL INFORMATION
- This course is self-paced.
- It can be taken at any time.
- It can be audited as many times as you wish.
RECOMMENDED SKILLS PRIOR TO TAKING THIS COURSE
- None
REQUIREMENTS
- None
COURSE STAFF

Kevin Wong
Kevin Wong is a Technical Curriculum Developer. He enjoys developing courses that focuses on the education in the Big Data field. Kevin updates courses to be compatible with the newest software releases, recreates courses on the new cloud environment, and develops new courses such as Introduction to Machine Learning. In addition to contributing to the content on Cognitive Class, he has worked with various components that deal with Big Data, including Hadoop, Pig, Hive, Phoenix, HBase, MapReduce & YARN, Sqoop and Oozie. Kevin is working on obtaining a degree in Computer Engineering from the University of Alberta.

Language
- English
Topic
- Big Data
Offered By
- BDU
Platform
- SkillsNetwork
Last Update
- February 11, 2025
Instructors

Nayef Abou Tayoun
Read more