PySpark for Beginners [Video]

PySpark for Beginners [Video]

Tomasz Drabas

Build data-intensive applications locally and deploy at scale using the combined powers of Python and Spark 2.0
Packt Subscription
FREE
$9.99/m after trial
Video
$106.25
RRP $124.99
Save 14%
What do I get with a Packt subscription?
  • Exclusive monthly discount - no contract
  • Unlimited access to entire Packt library of 6500+ eBooks and Videos
  • 120 new titles added every month, on new and emerging tech
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
$0.00
$106.25
$9.99 p/m after trial
RRP $124.99
Subscription
Video
Start a FREE 10-day trial

Frequently bought together


PySpark for Beginners [Video] Book Cover
PySpark for Beginners [Video]
$ 124.99
$ 106.25
Hands-On PySpark for Big Data Analysis [Video] Book Cover
Hands-On PySpark for Big Data Analysis [Video]
$ 124.99
$ 106.25
Buy 2 for $212.50
Save $37.48
Add to Cart

Video Description

Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. This course will show you how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Spark 2.0 architecture and how to set up a Python environment for Spark. You will get familiar with the modules available in PySpark. You will learn how to abstract data with RDDs and DataFrames and understand the streaming capabilities of PySpark. Also, you will get a thorough overview of machine learning capabilities of PySpark using ML and MLlib, graph processing using GraphFrames, and polyglot persistence using Blaze. Finally, you will learn how to deploy your applications to the cloud using the spark-submit command. By the end of this course, you will have established a firm understanding of the Spark Python API and how it can be used to build data-intensive applications.

All the code and supporting files for this course are available on Github at https://github.com/PacktPublishing/PySpark-for-Beginners

Style and Approach

This course takes a very comprehensive, step-by-step approach so you understand how the Spark ecosystem can be used with Python to develop efficient, scalable solutions. Every section is standalone and defined in a very easy-to-understand manner.

Video Preview

What You Will Learn

  • Learn about Apache Spark and the Spark 2.0 architecture
  • Build and interact with Spark DataFrames using Spark SQL
  • Read, transform, and understand data and use it to train machine learning models
  • Build machine learning models with MLlib and ML

Authors

Video Details

ISBN 139781789538762
Course Length1 hour and 34 minutes
Read More

Read More Reviews

Recommended for You

Hands-On PySpark for Big Data Analysis [Video] Book Cover
Hands-On PySpark for Big Data Analysis [Video]
$ 124.99
$ 106.25
Apache Spark with Python - Big Data with PySpark and Spark [Video] Book Cover
Apache Spark with Python - Big Data with PySpark and Spark [Video]
$ 149.99
$ 127.50
Linux Command Line for Beginners [Video] Book Cover
Linux Command Line for Beginners [Video]
$ 184.99
$ 157.25
Introduction to PostgreSQL Databases with PgAdmin For Beginners [Video] Book Cover
Introduction to PostgreSQL Databases with PgAdmin For Beginners [Video]
$ 183.99
$ 156.40
Learning PySpark [Video] Book Cover
Learning PySpark [Video]
$ 124.99
$ 106.25
Hands-On Big Data Analytics with PySpark Book Cover
Hands-On Big Data Analytics with PySpark
$ 19.99
$ 14.00