Ability to skip Incremental Index during query using query context #1957

nishantmonu51 · 2015-11-12T01:39:41Z

add adds the ability to skip incremental index when querying results from realtime nodes.
default behaviour is to include incrementalIndex in queries.

fjy · 2015-11-12T01:43:27Z

can you explain the logic behind this PR?

drcrallen · 2015-11-12T01:46:15Z

I'm guessing its because this context setting causes the query to only hit persisted segments, which should be faster. If you don't care about absolute-up-to-the-second data you can skip hitting the incremental index which might be heavily hit due to ingestion.

@nishantmonu51 can you get some performance metrics around this?

nishantmonu51 · 2015-11-12T02:02:33Z

@fjy, @drcrallen, yeah its correct, the idea behind this is to help with query performance when you don't want data upto the latest second and can tolerate delay upto the intermediatePersistPeriod,
the general idea on how it will help is -

the already persisted segments are faster to query then IncrementalIndex.
With caching of results for persisted segments (Enable caching on intermediate realtime persists #1943) and this change, queries can be tuned to always be cached.

nishantmonu51 · 2015-11-12T02:06:04Z

@drcrallen will look into getting some performance numbers.

xvrl · 2015-11-12T21:31:16Z

@fjy tldr; it's the difference between real-time and real²time :)

drcrallen · 2015-11-16T19:03:54Z

server/src/main/java/io/druid/segment/realtime/plumber/RealtimePlumber.java

@@ -291,6 +294,7 @@ private Sink getSink(long timestamp)

                        // The realtime plumber always uses SingleElementPartitionChunk
                        final Sink theSink = holder.getObject().getChunk(0).getObject();
+                        final boolean skipIncrementalSegment = query.getContextValue(SKIP_INCREMENTAL_SEGMENT, false);


I think this will fail with class cast exception if the value is a string instead of a boolean.

also, it might be useful to get it once on top and reuse instead of getting for each sink.

I think we only support Strings for backwards compatibility, do we have to support strings for new properties ?

This PR adds adds the ability to skip incremental index when querying results from realtime nodes. default behaviour is to include incrementalIndex in queries. review comment

himanshug · 2015-11-18T23:06:09Z

can you document the new attribute?

fjy · 2015-11-18T23:07:38Z

@nishantmonu51 @xvrl I don't really understand the purpose of this PR? Is it so you guys can avoid querying the in memory component of Druid's realtime? Why not look into why the in-memory component is slow?

gianm · 2015-11-20T18:31:00Z

👍

xvrl · 2015-11-24T01:30:37Z

👍 it's helpful to have to investigate performance problems. Probably should not need to be documented, since it's behavior is dependent on real-time ingestion implementation details. If it proves very useful, we can promote it.

Ability to skip Incremental Index during query using query context

nishantmonu51 added the Discuss label Nov 12, 2015

xvrl mentioned this pull request Nov 12, 2015

Backport a couple more changes metamx/druid#11

Merged

drcrallen reviewed Nov 16, 2015
View reviewed changes

Ability to skip Incremental Index during query using query context

60f649d

This PR adds adds the ability to skip incremental index when querying results from realtime nodes. default behaviour is to include incrementalIndex in queries. review comment

nishantmonu51 force-pushed the skip-incremental-segment branch from 0d15dae to 60f649d Compare November 18, 2015 19:00

xvrl removed the Discuss label Nov 24, 2015

gianm added a commit that referenced this pull request Nov 24, 2015

Merge pull request #1957 from metamx/skip-incremental-segment

13af260

Ability to skip Incremental Index during query using query context

gianm merged commit 13af260 into apache:master Nov 24, 2015

nishantmonu51 mentioned this pull request Dec 1, 2015

0.8.3 backports #2022

Merged

gianm added this to the 0.8.3 milestone Dec 1, 2015

gianm mentioned this pull request Dec 4, 2015

druid-0.8.3 release notes #2044

Closed

seoeun25 pushed a commit to seoeun25/incubator-druid that referenced this pull request Jan 10, 2020

apache#1957 Support registering UDFs in Druid

e3209a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to skip Incremental Index during query using query context #1957

Ability to skip Incremental Index during query using query context #1957

nishantmonu51 commented Nov 12, 2015

fjy commented Nov 12, 2015

drcrallen commented Nov 12, 2015

nishantmonu51 commented Nov 12, 2015

nishantmonu51 commented Nov 12, 2015

xvrl commented Nov 12, 2015

drcrallen Nov 16, 2015

himanshug Nov 17, 2015

nishantmonu51 Nov 18, 2015

himanshug commented Nov 18, 2015

fjy commented Nov 18, 2015

gianm commented Nov 20, 2015

xvrl commented Nov 24, 2015

Ability to skip Incremental Index during query using query context #1957

Ability to skip Incremental Index during query using query context #1957

Conversation

nishantmonu51 commented Nov 12, 2015

fjy commented Nov 12, 2015

drcrallen commented Nov 12, 2015

nishantmonu51 commented Nov 12, 2015

nishantmonu51 commented Nov 12, 2015

xvrl commented Nov 12, 2015

drcrallen Nov 16, 2015

Choose a reason for hiding this comment

himanshug Nov 17, 2015

Choose a reason for hiding this comment

nishantmonu51 Nov 18, 2015

Choose a reason for hiding this comment

himanshug commented Nov 18, 2015

fjy commented Nov 18, 2015

gianm commented Nov 20, 2015

xvrl commented Nov 24, 2015