Building a Big Data Analytics Stack [Video]

Preview in Mapt

Building a Big Data Analytics Stack [Video]

Tomasz Lelek

Learn about Big Data tools needed to create Big Data Stack

Quick links: > What will you learn?> Table of content

Video
$106.25
RRP $124.99
Save 14%
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$106.25
RRP $124.99

Frequently bought together


Building a Big Data Analytics Stack [Video] Book Cover
Building a Big Data Analytics Stack [Video]
$ 124.99
$ 106.25
Practical Big Data Analytics Book Cover
Practical Big Data Analytics
$ 35.99
$ 25.20
Buy 2 for $35.00
Save $125.98
Add to Cart

Video Details

ISBN 139781787125018
Course Length1 hour and 31 minutes

Video Description

Building a Big Data ecosystem is hard. There are a variety of technologies available and every one of them has its pros and cons. When building a big data pipeline for software engineers, we need to use more low-level tools and APIs such as HBase and Apache Spark.

In this course, we’ll check out HBase, a database built by optimizing on the HDFS. Moving on, we’ll have a bit of fun with Spark MLlib. Finally, you’ll get an understanding of ETL and deploy a Hadoop project to the cloud. Building Big Data Ecosystem is hard. There are a variety of technologies available and every one of them has own pros and cons. Software Engineers we need to use more low-level tools and APIs like HBase and Apache Spark while building big data pipeline.

By the end of the course, you’ll be able to use more high-level tools that have more user-friendly, declarative APIs such as Pig and Hive.

Style and Approach

This course will give you both a knowledge-based understanding and practical hands-on experience of Hadoop 2.7. It also looks at Spark, Pig, Hive, HBase, and YARN, so you can understand how to implement these components while using Hadoop clusters.

Table of Contents

Pig and Hive
The Course Overview
Introduction to Pig
Introduction to Hive
Hive Query Language
Spark Your Engines
Writing Spark Jobs
Introducing YARN
Creating Spark Job
HBase the Hadoop Database
HBase and HDFS
Using HBase Database from Java Application
Machine Learning Toolkit
Composing Spark ML Pipelines
Build a Recommendation System Using Collaborative Filtering
AWS EMR
ETL
Introducing AWS EMR
Creating S3 and EMR Cluster
Running Jobs in Series Using EMR Java API

What You Will Learn

  • Use Pig and Hive in a non-Java way to understand the power of Hadoop
  • Explore Spark and use it to stream and batch process
  • Use HBase database from Java application
  • Find out more about the machine learning toolkit and its use with Spark
  • Know how to leverage the pros of Big Data tools

Authors

Table of Contents

Pig and Hive
The Course Overview
Introduction to Pig
Introduction to Hive
Hive Query Language
Spark Your Engines
Writing Spark Jobs
Introducing YARN
Creating Spark Job
HBase the Hadoop Database
HBase and HDFS
Using HBase Database from Java Application
Machine Learning Toolkit
Composing Spark ML Pipelines
Build a Recommendation System Using Collaborative Filtering
AWS EMR
ETL
Introducing AWS EMR
Creating S3 and EMR Cluster
Running Jobs in Series Using EMR Java API

Video Details

ISBN 139781787125018
Course Length1 hour and 31 minutes
Read More

Read More Reviews

Recommended for You

Practical Big Data Analytics Book Cover
Practical Big Data Analytics
$ 35.99
$ 25.20
From 0 to 1: Hive for Processing Big Data [Video] Book Cover
From 0 to 1: Hive for Processing Big Data [Video]
$ 49.99
$ 42.50
Tensorflow Solutions for Data [Video] Book Cover
Tensorflow Solutions for Data [Video]
$ 124.99
$ 106.25
Data Visualization Solutions for Beginners [Video] Book Cover
Data Visualization Solutions for Beginners [Video]
$ 124.99
$ 106.25
Building Your Application with React Native [Video] Book Cover
Building Your Application with React Native [Video]
$ 124.99
$ 106.25
Building Serverless Applications [Video] Book Cover
Building Serverless Applications [Video]
$ 124.99
$ 106.25