-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Draft] 0.21.0 Release Notes #10752
Comments
Hi @jihoonson , should #10363 be mentioned in the release note since it has changed the user-facing configurations ? |
@FrankChen021 hmm, sure. It seems worth calling it out. I added it under the bug fixes section. |
hey @jihoonson , I am very excited about I will address all of the above including introducing the new build in a new PR, those updates would make this feature a lot more robust and production ready. Thanks to @clintropolis for introducing #10680 which really stresses the overlord/coordinator startup, discovery, leader election machinery to help discover above improvements. that said, I think, 0.21.0 should continue to go on its schedule. For that reason, I think we should remove the mention of this new feature in 0.21.0 release notes and we can document its availability in next release. |
@himanshug thanks for taking a look. I understand it's not matured enough to use in production yet, but would like to keep it because it's one of the most exciting new features that 0.21.0 introduces and worth drawing people's attention. From my experience, most people don't want to test new features out in their production right away when they are first introduced. Rather, they would like to wait for a couple of releases until those features become matured and stable. In this case, as it is stated as experimental, people would be especially careful before trying it out. Later, once it becomes production ready, we can promote it again in the release notes by stating like "this cool feature is now production ready. everyone should try it out". If we didn't mention it in this release, but in some future release when it becomes more stable, I think that people could still underestimate its maturity because it would be a feature that is promoted in the release notes first time. What do you think? |
sure, you are the release manager :) anyways, now that the conversation is here in this issue, power users would likely note the upcoming improvements. however, by that time new PR would be done and merged for them to cherry pick that specific commit. |
Thanks. That sounds good 🙂 |
Hi @jihoonson this link https://druid.apache.org/docs/latest/development/extensions-core/kubernetes.html seems like outdated with 404 error. Is there a new site to refer about k8s extension? |
Hi @Gabriel39, the kubernetes extension is added in 0.21.0 and not available in the latest release (0.20.1) yet. The doc will be updated once 0.21.0 is released. |
Hello, |
Hi @stroeovidiu, the vote for 0.21.0 release is in progress. It will pass in a couple of days unless we find some issue in the release. |
Hello,
Thank you :) |
Hi @jp707049, no worries and sorry for the delay. The 0.21.0 release vote has just passed! We will announce the release tomorrow. When you try the k8s extension, please keep it mind that the extension is experimental and so you might be able to see some unexpected bugs. |
@jihoonson do you plan to update the release tag with the release notes and close this? |
Apologize for the delay. I'm finalizing the release process now. |
I just announced the 0.21.0 release. Thanks everyone for your patience! |
Apache Druid 0.21.0 contains around 120 new features, bug fixes, performance enhancements, documentation improvements, and additional test coverage from 36 contributors. Refer to the complete list of changes and everything tagged to the milestone for further details.
# New features
# Operation
# Service discovery and leader election based on Kubernetes
The new Kubernetes extension supports service discovery and leader election based on Kubernetes. This extension works in conjunction with the HTTP-based server view (
druid.serverview.type=http
) and task management (druid.indexer.runner.type=httpRemote
) to allow you to run a Druid cluster with zero ZooKeeper dependencies. This extension is still experimental. See Kubernetes extension for more details.#10544
#9507
#10537
# New dynamic coordinator configuration to limit the number of segments when finding a candidate segment for segment balancing
You can set the
percentOfSegmentsToConsiderPerMove
to limit the number of segments considered when picking a candidate segment to move. The candidates are searched up tomaxSegmentsToMove * 2
times. This new configuration prevents Druid from iterating through all available segments to speed up the segment balancing process, especially if you have lots of available segments in your cluster. See Coordinator dynamic configuration for more details.#10284
#
status
andselfDiscovered
endpoints for IndexersThe Indexer now supports
status
andselfDiscovered
endpoints. See Processor information APIs for details.#10679
# Querying
# New
grouping
aggregator functionYou can use the new
grouping
aggregator SQL function withGROUPING SETS
orCUBE
to indicate which grouping dimensions are included in the current grouping set. See Aggregation functions for more details.#10518
# Improved missing argument handling in expressions and functions
Expression processing now can be vectorized when inputs are missing. For example a non-existent column. When an argument is missing in an expression, Druid can now infer the proper type of result based on non-null arguments. For instance, for
longColumn + nonExistentColumn
,nonExistentColumn
is treated as(long) 0
instead of(double) 0.0
. Finally, in default null handling mode, math functions can produce output properly by treating missing arguments as zeros.#10499
# Allow zero period for
TIMESTAMPADD
TIMESTAMPADD
function now allows zero period. This functionality is required for some BI tools such as Tableau.#10550
# Ingestion
# Native parallel ingestion no longer requires explicit intervals
Parallel task no longer requires you to set explicit intervals in
granularitySpec
. If intervals are missing, the parallel task executes an extra step for input sampling which collects the intervals to index.#10592
#10647
# Old Kafka version support
Druid now supports Apache Kafka older than 0.11. To read from an old version of Kafka, set the
isolation.level
toread_uncommitted
inconsumerProperties
. Only 0.10.2.1 have been tested up until this release. See Kafka supervisor configurations for details.#10551
Multi-phase segment merge for native batch ingestion
A new tuningConfig,
maxColumnsToMerge
, controls how many segments can be merged at the same time in the task. This configuration can be useful to avoid high memory pressure during the merge. See tuningConfig for native batch ingestion for more details.#10689
# Native re-ingestion is less memory intensive
Parallel tasks now sort segments by ID before assigning them to subtasks. This sorting minimizes the number of time chunks for each subtask to handle. As a result, each subtask is expected to use less memory, especially when a single Parallel task is issued to re-ingest segments covering a long time period.
#10646
# Web console
# Updated and improved web console styles
The new web console styles make better use of the Druid brand colors and standardize paddings and margins throughout. The icon and background colors are now derived from the Druid logo.
#10515
# Partitioning information is available in the web console
The web console now shows datasource partitioning information on the new
Segment granularity
andPartitioning
columns.Segment granularity
column in theDatasources
tabPartitioning
column in theSegments
tab#10533
# The column order in the
Schema
table matches thedimensionsSpec
The
Schema
table now reflects the dimension ordering in thedimensionsSpec
.#10588
# Metrics
# Coordinator duty runtime metrics
The coordinator performs several 'duty' tasks. For example segment balancing, loading new segments, etc. Now there are two new metrics to help you analyze how fast the Coordinator is executing these duties.
coordinator/time
: the time for an individual duty to executecoordinator/global/time
: the time for the whole duties runnable to execute#10603
# Query timeout metric
A new metric provides the number of timed out queries. Previously timed out queries were treated as interrupted and included in the
query/interrupted/count
(see Changed HTTP status codes for query errors for more details).query/timeout/count
: the number of timed out queries during the emission period#10567
# Shuffle metrics for batch ingestion
Two new metrics provide shuffle statistics for MiddleManagers and Indexers. These metrics have the
supervisorTaskId
as their dimension.ingest/shuffle/bytes
: number of bytes shuffled per emission periodingest/shuffle/requests
: number of shuffle requests per emission periodTo enable the shuffle metrics, add
org.apache.druid.indexing.worker.shuffle.ShuffleMonitor
indruid.monitoring.monitors
. See Shuffle metrics for more details.#10359
# New clock-drift safe metrics monitor scheduler
The default metrics monitor scheduler is implemented based on
ScheduledThreadPoolExecutor
which is prone to unbounded clock drift. A new monitor scheduler,ClockDriftSafeMonitorScheduler
, overcomes this limitation. To use the new scheduler, setdruid.monitoring.schedulerClassName
toorg.apache.druid.java.util.metrics.ClockDriftSafeMonitorScheduler
in the runtime.properties file.#10448
#10732
# Others
# New extension for a password provider based on AWS RDS token
A new
PasswordProvider
type allows access to AWS RDS DB instances using temporary AWS tokens. This extension can be useful when an RDS is used as Druid's metadata store. See AWS RDS extension for more details.#9518
# The
sys.servers
table shows leadersA new long-typed column
is_leader
in thesys.servers
table indicates whether or not the server is the leader.#10680
#
druid-influxdb-emitter
extension supports the HTTPS protocolSee Influxdb emitter extension for new configurations.
#9938
# Docker
# Small docker image
The docker image size is reduced by half by eliminating unnecessary duplication.
#10506
# Development
# Extensible Kafka consumer properties via a new
DynamicConfigProvider
A new class
DynamicConfigProvider
enables fetching consumer properties at runtime. For instance, you can useDynamicConfigProvider
fetchbootstrap.servers
from location such as a local environment variable if it is not static. Currently, only a map-based config provider is supported by default. See DynamicConfigProvider for how to implement a custom config provider.#10309
# Bug fixes
Druid 0.21.0 contains 30 bug fixes, you can see the complete list here.
# Post-aggregator computation with subtotals
Before 0.21.0, the query fails with an error when you use post aggregators with sub-totals. Now this bug is fixed and you can use post aggregators with subtotals.
#10653
# Indexers announce themselves as segment servers
In 0.19.0 and 0.20.0, Indexers could not process queries against streaming data as they did not announce themselves as segment servers. They are fixed to announce themselves properly in 0.21.0.
#10631
# Validity check for segment files in historicals
Historicals now perform validity check after they download segment files and re-download automatically if those files are crashed.
#10650
#
StorageLocationSelectorStrategy
injection failure is fixedThe injection failure while reading the configurations of
StorageLocationSelectorStrategy
is fixed.#10363
# Upgrading to 0.21.0
Consider the following changes and updates when upgrading from Druid 0.20.0 to 0.21.0. If you're updating from an earlier version than 0.20.0, see the release notes of the relevant intermediate versions.
# Improved HTTP status codes for query errors
Before this release, Druid returned the "internal error (500)" for most of the query errors. Now Druid returns different error codes based on their cause. The following table lists the errors and their corresponding codes that has changed:
There is also a new query metric for query timeout errors. See New query timeout metric for more details.
#10464
#10746
# Query interrupted metric
query/interrupted/count
no longer counts the queries that timed out. These queries are counted byquery/timeout/count
.#
context
dimension in query metricscontext
is now a default dimension emitted for all query metrics.context
is a JSON-formatted string containing the query context for the query that the emitted metric refers to. The addition of a dimension that was not previously alters some metrics emitted by Druid. You should plan to handle this newcontext
dimension in your metrics pipeline. Since the dimension is a JSON-formatted string, a common solution is to parse the dimension and either flatten it or extract the bits you want and discard the full JSON-formatted string blob.#10578
# Deprecated support for Apache ZooKeeper 3.4
As ZooKeeper 3.4 has been end-of-life for a while, support for ZooKeeper 3.4 is deprecated in 0.21.0 and will be removed in the near future.
#10780
# Consistent serialization format and column naming convention for the
sys.segments
tableAll columns in the
sys.segments
table are now serialized in the JSON format to make them consistent with other system tables. Column names now use the same "snake case" convention.#10481
# Known issues
# Known security vulnerability in the Thrift library
The Thrift extension can be useful for ingesting files of the Thrift format into Druid. However, there is a known security vulnerability in the version of the Thrift library that Druid uses. The vulerability can be exploitable by ingesting maliciously crafted Thrift files when you use Indexers. We recommend granting the
DATASOURCE WRITE
permission to only trusted users.For a full list of open issues, please see Bug .
# Credits
Thanks to everyone who contributed to this release!
@a2l007
@abhishekagarwal87
@asdf2014
@AshishKapoor
@awelsh93
@ayushkul2910
@bananaaggle
@capistrant
@ccaominh
@clintropolis
@cloventt
@FrankChen021
@gianm
@harinirajendran
@himanshug
@jihoonson
@jon-wei
@kroeders
@liran-funaro
@martin-g
@maytasm
@mghosh4
@michaelschiff
@nishantmonu51
@pcarrier
@QingdongZeng3
@sthetland
@suneet-s
@tdt17
@techdocsmith
@valdemar-giosg
@viatcheslavmogilevsky
@viongpanzi
@vogievetsky
@xvrl
@zhangyue19921010
The text was updated successfully, but these errors were encountered: