Data Ingestion With Flume & Kafka
Training Goals
To provide a thorough understanding of Flume configuration & Kafka. Participants will be able to implement practical data flows in their projects.Pre-requisite : Some programming background, preferably Java.
Contents
Introduction to multiplexed data flows, fan-out flows, aggregators.
Implementing Custom De-Serialisers and Interceptors.
Advanced Flume Configuration.
Kafka Architecture – Publish/Subscribe Model
Implementing custom Publishers
Kafka Consumers – HDFS consumer, HBase consumer, Cassandra Consumer and many others.
ETL developers, Java developers, Analytics professionals and Hadoop developers.
Intended Audience
Methodology
The program is designed to provide an overview of Cassandra. Key concepts in each area will be explained and working code provided. Participants will be able to run the examples and expected to understand code on their own with some pointers. Detailed code walk-though is not provided. Code is written in Java.