Apache Flume: Distributed Log Collection for Hadoop

English

Created by
Last updated Fri, 02-Feb-2024

+ View more

Course overview

If you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed.

What will i learn?

Understand the Flume architecture, and also how to download and install open source Flume from Apache
Follow along a detailed example of transporting weblogs in Near Real Time (NRT) to Kibana/Elasticsearch and archival in HDFS
Learn tips and tricks for transporting logs and data in your production environment
Understand and configure the Hadoop File System (HDFS) Sink
Use a morphlinebacked Sink to feed data into Solr
Create redundant data flows using sink groups
Configure and use various sources to ingest data
Inspect data records and move them between multiple destinations based on payload content
Transform data enroute to Hadoop and monitor your data flows

Requirements

Curriculum for this course

1 Lessons 5 hrs 56 mins

Apache Flume: Distributed Log Collection for Hadoop

1 Lessons 05:56:00 Hours

Apache Flume: Distributed Log Collection for Hadoop
Preview 05:56:00

+ View more

Other related courses

51 mins

Mastering Data Analysis

Updated Thu, 19-Aug-2021

0 2

7 mins

Understanding Concepts of Data Science

Updated Thu, 19-Mar-2020

0 1

4 mins

Python as a Tool

Updated Thu, 19-Mar-2020

0 0

11 mins

Crash Course of Python

Updated Thu, 19-Mar-2020

0 0

About instructor

Includes:

5 hrs 56 mins On demand videos
1 Lessons
Access on mobile and tv
Full lifetime access

Apache Flume: Distributed Log Collection for Hadoop

What will i learn?

Are you sure ?