Skip to content

Latest commit

 

History

History
65 lines (48 loc) · 3.19 KB

README.md

File metadata and controls

65 lines (48 loc) · 3.19 KB

Flume Pluggables (aka Plugins)

Build Status

Contact info at my website or @nisaydhm

About Apache Flume

"Apache Flume is a distributed, reliable, and available system for efficiently collecting, aggregating and moving large amounts of log data from many different sources to a centralized data store."

(from Flume User Guide).

Dev Environment Setup

If you want to setup Eclipse to make changes and compile yourself, check my post for How to write Custom Flume components

About this repo

Flume is a flexible piece of software and allows plugging in custom code for many of its components. Most probable candidates for custom code are Flume's Interceptors, Event Handlers (for HTTP Source), and Serializers (e.g. Avro Serializer). Less likely but still important components that can be swapped with custom implementations are Channel Selectors, JMS Message Converters, and custom Sink Processors.

This repo aims to provide a collection of custom Apache Flume Serializers, Handlers, and Interceptors which can be used as a reference point for implementing your own. The code assumes an HTTP Source and HDFS Sink. Code works with both File and Memory Channels. If you are only going to use the Memory Channel see the NOTE in README.md file in eventserializer folder.

Flume Agent Configuration

Agent configuration files are in ./conf directory

Events

  • MetricsEvent
  • ProbeReportEvent
  • TrackingEvent

Event Handlers

  • MetricsEventHandler
  • ProbeReportEventHandler
  • TrackingEventHandler

Interceptors

  • MetricsServerInterceptor
  • TrackingDeviceTypeInterceptor

Avro Serializers

  • MetricsEventAvroSerializer
  • ProbeReportEventAvroSerializer
  • TrackingEventAvroSerializer

Event Parser

  • MetricsEventParser

Event Model

  • MetricsModel
  • TrackingErrorEventModel

Flume Event Flow



           +-------------+   +-------------+   +---------------+   +-------------+   +-------------+
 Events    | HTTP Source |   |Interceptor  |   |File / Memory  |   |Avro         |   |Hadoop / HDFS|
+--------->+ Handler     +-->|(optional)   +-->|Channel        |-->|Serializer   |-->|  Sink       |
           +-------------+   +-------------+   +---------------+   +--+-----+----+   +-------------+
                                                                       |     ^
                                                                       |     |
                                                                       v     |
                                                                    +--------+----+
                                                                    | Event Parser|
                                                                    | (optional)  |
                                                                    +-------------+

(ASCII diagram created via: ASCIIFlow)