Avro support #1844

gianm · 2015-10-21T22:59:25Z

Should probably be an extension.

For realtime we need a ByteBufferInputRowParser (something similar to the ProtoBufInputRowParser, but for Avro).

For batch we need a recommended Avro-aware InputFormat and an InputRowParser that can read whatever type is returned by that InputFormat. I haven't used Avro before so I'm not sure what the right choice of InputFormat is. AvroKeyInputFormat from https://avro.apache.org/docs/1.7.0/api/java/org/apache/avro/mapreduce/AvroKeyInputFormat.html seems like a possible candidate.

The text was updated successfully, but these errors were encountered:

himanshug · 2015-10-22T04:27:45Z

We have had avro working for a while, but code is not generic enough and very specific to our schemas. In fact, it will not be possible to take a general avro schema and convert it to druid row because avro has support for very many complex types, so we will have to compromise anyway. Also, it was written pre druid-0.8.0 era where it wasn't possible to have InputFormats that could return anything but Text records.
With druid-0.8.2, the limitation regarding Text records is gone. In my org, some people are working on next gen druid avro integration.

zhaown · 2015-10-22T07:20:34Z

I'm using avro with druid for production, for batch indexing, it's not complicated based on @himanshug 's #1472, and I'm using my AvroValueInputFormat which is the mirror of AvroKeyInputFormat.

But for realtime indexing, it's a bit more cumbersome because you need an schema to deserialize avro object from binary stream, and you don't want to send schema with every serialized record to kafka. Then you need an schema registry, currently we are using schemarepo and camus schema registry client, the latter is not in the maven central...

I'll try to clean my code and try to submit an PR for this this weekend if I got some time.

zhaown · 2015-10-25T15:10:23Z

Please check #1858

zhaown mentioned this issue Nov 20, 2015

Support avro ingestion for realtime & hadoop batch indexing #1858

Merged

fjy closed this as completed in #1858 Jan 5, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avro support #1844

Avro support #1844

gianm commented Oct 21, 2015

himanshug commented Oct 22, 2015

zhaown commented Oct 22, 2015

zhaown commented Oct 25, 2015

Avro support #1844

Avro support #1844

Comments

gianm commented Oct 21, 2015

himanshug commented Oct 22, 2015

zhaown commented Oct 22, 2015

zhaown commented Oct 25, 2015