Building a Big Data Analytics Stack [Video]

Preview in Mapt

Building a Big Data Analytics Stack [Video]

Tomasz Lelek

Learn about Big Data tools needed to create Big Data Stack
Mapt Subscription
FREE
$29.99/m after trial
Video
$106.25
RRP $124.99
Save 14%
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$106.25
$29.99 p/m after trial
RRP $124.99
Subscription
Video
Start 14 Day Trial

Frequently bought together


Building a Big Data Analytics Stack [Video] Book Cover
Building a Big Data Analytics Stack [Video]
$ 124.99
$ 106.25
Apache Spark with Scala - Learn Spark from a Big Data Guru [Video] Book Cover
Apache Spark with Scala - Learn Spark from a Big Data Guru [Video]
$ 149.99
$ 127.50
Buy 2 for $35.01
Save $239.97
Add to Cart

Video Details

ISBN 139781787125018
Course Length1 hour and 31 minutes

Video Description

Building a Big Data ecosystem is hard. There are a variety of technologies available and every one of them has its pros and cons. When building a big data pipeline for software engineers, we need to use more low-level tools and APIs such as HBase and Apache Spark.

In this course, we’ll check out HBase, a database built by optimizing on the HDFS. Moving on, we’ll have a bit of fun with Spark MLlib. Finally, you’ll get an understanding of ETL and deploy a Hadoop project to the cloud. Building Big Data Ecosystem is hard. There are a variety of technologies available and every one of them has own pros and cons. Software Engineers we need to use more low-level tools and APIs like HBase and Apache Spark while building big data pipeline.

By the end of the course, you’ll be able to use more high-level tools that have more user-friendly, declarative APIs such as Pig and Hive.

Style and Approach

This course will give you both a knowledge-based understanding and practical hands-on experience of Hadoop 2.7. It also looks at Spark, Pig, Hive, HBase, and YARN, so you can understand how to implement these components while using Hadoop clusters.

Table of Contents

Pig and Hive
The Course Overview
Introduction to Pig
Introduction to Hive
Hive Query Language
Spark Your Engines
Writing Spark Jobs
Introducing YARN
Creating Spark Job
HBase the Hadoop Database
HBase and HDFS
Using HBase Database from Java Application
Machine Learning Toolkit
Composing Spark ML Pipelines
Build a Recommendation System Using Collaborative Filtering
AWS EMR
ETL
Introducing AWS EMR
Creating S3 and EMR Cluster
Running Jobs in Series Using EMR Java API

What You Will Learn

  • Use Pig and Hive in a non-Java way to understand the power of Hadoop
  • Explore Spark and use it to stream and batch process
  • Use HBase database from Java application
  • Find out more about the machine learning toolkit and its use with Spark
  • Know how to leverage the pros of Big Data tools

Authors

Table of Contents

Pig and Hive
The Course Overview
Introduction to Pig
Introduction to Hive
Hive Query Language
Spark Your Engines
Writing Spark Jobs
Introducing YARN
Creating Spark Job
HBase the Hadoop Database
HBase and HDFS
Using HBase Database from Java Application
Machine Learning Toolkit
Composing Spark ML Pipelines
Build a Recommendation System Using Collaborative Filtering
AWS EMR
ETL
Introducing AWS EMR
Creating S3 and EMR Cluster
Running Jobs in Series Using EMR Java API

Video Details

ISBN 139781787125018
Course Length1 hour and 31 minutes
Read More

Read More Reviews

Recommended for You

Apache Spark with Scala - Learn Spark from a Big Data Guru [Video] Book Cover
Apache Spark with Scala - Learn Spark from a Big Data Guru [Video]
$ 149.99
$ 127.50
Apache Spark with Java - Learn Spark from a Big Data Guru [Video] Book Cover
Apache Spark with Java - Learn Spark from a Big Data Guru [Video]
$ 197.99
$ 168.30
The Complete JavaScript Developer: A Primer to Full Stack JS [Video] Book Cover
The Complete JavaScript Developer: A Primer to Full Stack JS [Video]
$ 71.99
$ 61.20
The Complete Javascript Course: Build a Professional Project [Video] Book Cover
The Complete Javascript Course: Build a Professional Project [Video]
$ 191.99
$ 163.20
MERN Stack Front To Back: Full Stack React, Redux and Node.js [Video] Book Cover
MERN Stack Front To Back: Full Stack React, Redux and Node.js [Video]
$ 143.99
$ 122.40
Unity 2017 - Building a Tilemap 2D Game from Scratch [Video] Book Cover
Unity 2017 - Building a Tilemap 2D Game from Scratch [Video]
$ 124.99
$ 106.25