- [SPM] Due to a breaking change in OpenTelemetry's prometheus exporter (details)
metric names will no longer be normalized by default, meaning that the expected metric names would be
calls
andduration_*
. Backwards compatibility with older OpenTelemetry Collector versions can be achieved through the following flags:prometheus.query.normalize-calls
: If true, normalizes the "calls" metric name. e.g. "calls_total".prometheus.query.normalize-duration
: If true, normalizes the "duration" metric name to include the duration units. e.g. "duration_milliseconds_bucket".
OTLP receiver is now enabled by default (#4494). This change follows the Jaeger's strategic direction of aligning closely with the OpenTelemetry project. This may cause port conflicts if jaeger-collector
is depoyed in host network namespace. The original --collector.otlp.enabled
option is still available and MUST be set to false
if OTLP receiver is not desired.
- Make OTLP receiver enabled by default (@yurishkuro in #4494)
- [SPM] Add support for OpenTelemetry SpanMetrics Connector (@albertteoh in #4452). See Migration README.
- Log processor error in Kafka consumer (@pavolloffay in #4399)
- [bug] Remove TerminateAfter from Elasticsearch/Opensearch query resulting in incomplete span count/list (@Jakob3xD in #4336)
- [agent] Use RawConn.Control to get fd instead of Fd() to prevent deadlock on shutdown (@ChenX1993 in #4449)
- [SPM] Fix docker compose command (@tqi-raurora in #4444)
- [test] Fix flaky test - TestSpanProcessorWithOnDroppedSpanOption (@yurishkuro in #4489)
- [ci] Skip debug builds when not making a release (@psk001 in #4496)
- Fix some function comments (@cuishuang in #4410)
- Increase dependabot open-pull-requests-limit=10 (@yurishkuro in 04548fc)
- Add jkowall as release manger for July (@jkowall in #4446)
- Fix versions in release schedule (@yurishkuro in 8a9d13a)
- UI pinned to version 1.30.0.
- Add HTTP post port mapping to docker command (@albertteoh in #4407)
- Simplify ES config and factory (@yurishkuro in #4396)
- Add otlp-grpc for tracegen's trace-exporter (@boysusu in #4374)
- Allow follows-from reference as a parent span id (@kubarydz in #4382)
- Expose drop span hook as an option in Collector SpanProcessor (@ChenX1993 in #4387)
- UI pinned to version 1.29.1.
- Store span warnings as tags in Cassandra (@utsavoza in #4313)
- Add Keep-Alive flag for Zipkin HTTP server (@topjung3 in #4366)
- Log access to static assets; remove favicon test (@yurishkuro in #4302)
- UI pinned to version 1.29.0.
- Support secure OTLP exporter config for hotrod (@graphaelli in #4231)
- Jaeger YugabyteDB(YCQL) support (@HarshDaryani896 in #4220)
- Replace pkg/multierror with standard errors.Join (@ClementRepo in #4293)
- Remove pkg/multicloser (@yurishkuro in #4291)
- Refactor build linux artifacts only for PR (@Eileen-Yu in #4286)
- Speed-up CI by using published UI artifacts (@shubbham1219 in #4251)
- Update Go version to 1.20 (@SaarthakMaini in #4206)
- Use http.MethodGet instead of "GET" (@my-git9 in #4248)
- Updating all-in-one path (@bigfleet in #4234)
- Migrate the use of fsnotify to fswatcher in cert_watcher.go (@haanhvu in #4232)
- Restore baggage support in HotROD 🚗 (@yurishkuro in #4225)
- UI pinned to version 1.28.1.
- HotROD 🚗 application is switched from Jaeger SDK to OpenTelemetry SDK (@yurishkuro in #4187). Some environment variables previously accepted are no longer recognized. See PR for details.
- Map old env vars from Jaeger SDK to OTel SDK vars (@yurishkuro in #4200)
- Use patched version of github.com/opentracing-contrib/go-grpc in HotROD (@yurishkuro in #4210)
tracegen
utility is switched from Jaeger SDK to OpenTelemetry SDK (@yurishkuro in #4189). Some environment variables previously accepted are no longer recognized. See PR for details.
- Add CLI flags for controlling HTTP server timeouts (@yurishkuro in #4167)
- Watch directories for certificate hot-reload (@tsaarni in #4159)
- Support tenant header propagation in query service and grpc-plugin (@pavolloffay in #4151)
- Replace Thrift-gen with Proto-gen types for sampling strategies (@yurishkuro in #4181)
- Use Protobuf-based JSON output for sampling strategies (@yurishkuro in #4173)
- [tests]: Use
t.Setenv
to set env vars in tests (@Juneezee in #4169) - ci(lint): bump golangci-lint to v1.50.1 (@mmorel-35 in #4195)
- Convert token propagation integration test to plain unit test (@yurishkuro in #4162)
- Refactor scripts/es-integration-test.sh (@yurishkuro in #4161)
- Fix "index out of range" when receiving a dual client/server Zipkin span (@yurishkuro in #4160)
- No changes.
- Remove global platform arg in cassandra schema dockerfile (@jkandasa in #4123)
- Add multi arch support to cassandra-schema container (@jkandasa in #4122)
- No changes.
- Fix cassandra schema scripts to be able to run from a remote node (@yurishkuro in #4087)
- Catch panic from Prometheus client on invalid label strings (@yurishkuro in #4051)
- Span tags of type int64 may lose precision (@shubbham1215 in #4034)
- UI pinned to version 1.27.3.
- Add support for OpenSearch 2.x (@gaurav-05 in #3966)
- Pin SBOM action to commit (@yurishkuro in bb49249)
- Remove auth requirement on monitor demo (@joe-elliott in #4005)
- Increase sleep time when waiting for ES/OS backend (@yurishkuro in b9805f7)
- Fix CVE-2022-32149 for gotlang.org/x/text package (@mehta-ankit in #3992)
- Expose otel configured thrift http port (@albertteoh in #3986)
- Adding anchore for SBOM signing during release (@jkowall in #3987)
- Bump sarama to 1.33.0 (@pavolloffay in #3983)
- Add note on jaeger grpc storage compliance (@arajkumar in #3985)
- Added link to FOSSA and Artifact Hub to README (@jkowall in #3964)
- Add grafana container to monitor docker-compose (@albertteoh in #3955)
- Expose storage integration helpers as go pkg (@arajkumar in #3944)
- UI pinned to version 1.27.2.
- Bump github.com/open-telemetry/opentelemetry-collector-contrib/pkg/translator/jaeger (@dependabot[bot] in #3939)
- Bump github.com/apache/thrift from 0.16.0 to 0.17.0 (@dependabot[bot] in #3936)
- Bump github.com/hashicorp/go-hclog from 1.2.2 to 1.3.1 (@dependabot[bot] in #3934)
- Bump go.opentelemetry.io/otel from 1.9.0 to 1.10.0 (@dependabot[bot] in #3932)
- Bump github.com/hashicorp/go-plugin from 1.4.4 to 1.4.5 (@dependabot[bot] in #3930)
- Bump github.com/spf13/viper from 1.12.0 to 1.13.0 (@dependabot[bot] in #3931)
- Bump OTEL dependencies => v0.60.0 and grpc => v1.49.0 (@yurishkuro in #3928)
- Bump golang.org/x/net to 2022-09-26 (@yurishkuro in #3927)
- Bump codecov/codecov-action from 3.1.0 to 3.1.1 (@dependabot[bot] in #3917)
- UI pinned to version 1.27.1 to bump dependencies.
- fix: jaeger-agent sampling endpoint returns backwards incompatible JSON (@vprithvi in #3897)
- fix: streaming span writer is not working in grpc based remote storage plugin (@arajkumar in #3887)
- Fix race condition when adding collector tags (@yurishkuro in #3886)
- Change build info date to commit timestamp (@TripleDogDare in #3876)
- Add 🚗 (@yurishkuro in 55a8ca9)
- Add AdditionalDialOptions to ConnBuilder (@vprithvi in #3865)
- Add sample docker-compose configuration using Kafka (@yurishkuro in 7006e9f)
- UI pinned to version 1.27.0 - see the changelog.
- Add remote-storage service (@yurishkuro in #3836)
- Fix ingester panic when span.process=nil (@locmai in #3819)
- Added windows zip file generation (@adhithyasrinivasan in #3817)
- Refactor gRPC storage plugin for better composability (@yurishkuro in #3833)
- UI pinned to version 1.26.0 - see the changelog.
- Add flag to enable span size metrics reporting (@ymtdzzz in #3782)
- Span size metrics are enabled via the
--collector.enable-span-size-metrics
flag (even if--collector.queue-size-memory
is disabled).
- Span size metrics are enabled via the
- Add multi-tenancy support (@esnible in #3688)
- Enabled when
--multi_tenancy.enabled=true
is passed to the collector. - The header carrying the tenants can be specified via the
--multi_tenancy.header
flag, which defaults tox-tenant
. - The list of allowed tenants can be set via the
--multi_tenancy.tenants
flag, which defaults to an unrestricted list of tenants.
- Enabled when
- Introduce ParentReference adjuster (@bobrik in #3786)
- Ignore the problem of self-reported spans when multi-tenant enabled (@esnible in #3787)
- Copy expvar metrics implementation from jaeger-lib (@yurishkuro in #3772)
- Copy Prometheus metrics implementation from jaeger-lib (@yurishkuro in #3771)
- Copy metrics API from jaeger-lib (@yurishkuro in #3767)
- Use file move instead of overwriting content (@yurishkuro in #3726)
- Refactor tenancy checking from gRPC to gRPC batch consumer (@esnible in #3718)
- Fix ineffectual
--skip-dependencies
flag in es-rollover (@frzifus in #3724) - Fix custom gogo codec to allow OTLP data (@yurishkuro in #3719)
- UI pinned to version 1.25.0 - see the changelog.
- Fix custom gogo codec to allow OTLP data (@yurishkuro in #3719)
- Introduce OTLP receiver configuration flags (@yurishkuro in #3710)
- Define Health Server for GRPC servers (@mmorel-35 in #3712)
- Add OTLP receiver to collector (@yurishkuro in #3701)
- Add flag to enable/disable dependencies on rollover (@rubenvp8510 in #3705)
- Add TLS configuration for Admin Server (@mmorel-35 in #3679)
- Add TLS configuration for Zipkin (@mmorel-35 in #3676)
- Fix for WithTenant() lost orig context (@esnible in #3685)
- Add entries to env command for new storage types (@yurishkuro in #3678)
- Fix Prometheus factory signature (@yurishkuro in #3681)
- UI pinned to version 1.24.0 - see the changelog.
- Add kubernetes example for hotrod app (@highb in #3645)
- Support writing via gRPC stream in storage plugin (@vuuihc in #3640)
- Instrument MetricsReader with metrics (@albertteoh in #3667)
- Sanitize spans with null process or empty service name (@yurishkuro in #3631)
- Flow tenant through processors as a string (@esnible in #3661)
- Fix es.log-level behaviour (@albertteoh in #3664)
- Mention remote gRPC storage API (@yurishkuro in cb6853d)
- Perform log.fatal if TLS flags are used when tls.enabled=false #2893 (@clock21am in #3030)
- Upgrade to Go 1.18 (@yurishkuro in #3624)
- UI pinned to version 1.23.0. The changelog is available here v1.23.0.
- Add SAMPLING_STORAGE_TYPE environment variable (@joe-elliott in #3573)
- Support min/max TLS version in TLS config (@Ashmita152 in #3567)
- Add support for ciphersuites in tls config (@Ashmita152 in #3564)
- Fix: exit on grpc plugin crash (@johanneswuerbach in #3604)
- Bump go.opentelemetry.io/collector/model from 0.47.0 to 0.48.0 (@dependabot[bot] in #3608)
- Fix format string for go1.18 (@bobrik in #3596)
- Fix favicon returning 500 inside container (@Ashmita152 in #3569)
- Elasticsearch: Do not create index templates if ILM is enabled. (@rbizos in #3610)
- UI pinned to version 1.22.0. The changelog is available here v1.22.0.
- Enable gRPC reflection service on collector/query (@yurishkuro in #3526)
- Fix query for latency metrics (@Ashmita152 in #3559)
- Fix integration tests containing spans in the future (@johanneswuerbach in #3538)
- Add system diagram using mermaid markdown (@yurishkuro in #3529)
- Fix indexDateLayout for elasticsearch dependencies #3523 (@ilyamor in #3524)
- Fix builds due to upstream OTEL proto path change (@albertteoh in #3525)
- UI pinned to version 1.21.0. The changelog is available here v1.21.0.
- Bump Go compiler in CI to 1.17.6 (@yurishkuro in #3516)
- Add support for ES index aliases / rollover to the dependency store (Resolves #2143) (@frittentheke in #2144)
- Use existing functions from xdg-go/scram pkg (@yurishkuro in #3481)
- UI pinned to version 1.20.1. The changelog is available here v1.20.1.
- Add remote gRPC option for storage plugin (@cevian in #3383)
- Build binaries for darwin/arm64 (@jhchabran in #3431)
- Add MaxConnectionAge[Grace] to collector's gRPC server (@jpkrohling in #3422)
- Fix prefixed index rollover (@albertteoh in #3457)
- Log problems communicating with Elasticsearch (@esnible in #3451)
- UI pinned to version 1.20.0. The changelog is available here v1.20.0
- Remove deprecated
--badger.truncate
CLI flag (@yurishkuro in #3410)
- Fix debug image builds by installing
build-base
to enable GCC (@yurishkuro in #3400) - Limit URL size in Elasticsearch index delete request (@jkandasa in #3375)
- UI pinned to version 1.19.0. The changelog is available here v1.19.0
- Add auth token propagation for metrics reader (@albertteoh in #3341)
- Add in-memory storage support for adaptive sampling (@lonewolf3739 in #3335)
- Do not throw error on empty indices in Elasticsach rollover lookback (@jkandasa in #3369)
- Treat input throughput data as immutable (@rbroggi in #3360)
- Remove dependencies on unused tools, install tools explicitly instead of via go.mod (@rbroggi in #3355)
- Update mockery to version 2 and adapt to
install-tools
approach (@rbroggi in #3358) - Control lightweight storage integration tests via build tags (@rbroggi in #3346)
- Remove package
integration
from coverage reports (@rbroggi in #3357) - Remove outdated reference to cover.sh (@rbroggi in #3348)
- Update monitoring mixin (@jpkrohling in #3331)
- Update Jaeger chart link (@isbang in #3328)
- Fix args order in strings.Contains in es-rollover (@pavolloffay in #3324)
- Use
(TB).TempDir
instead of non-portable/mnt/*
in Badger (@pavolloffay in #3325) - Fix
peer.service
retrieval from Zipkin'sMESSAGE_ADDR
annotation (@Git-Jiro in #3312)
- UI pinned to version 1.18.0. The changelog is available here v1.18.0
- Migrate elasticsearch rollover to go (#3242, @rubenvp8510)
- Add 'opensearch' as a supported value for SPAN_STORAGE_TYPE (#3255, @yurishkuro)
- Add support for adaptive sampling with a Cassandra backend. (#2966, @joe-elliott, @Ashmita152)
- Support graceful shutdown in grpc plugin (#3249, @slon)
- Enable gzip compression for collector grpc endpoint. (#3236, @slon)
- Use UTC in es-index-cleaner (#3261, @pavolloffay)
- Upgrade to alpine-3.14 (#3304, @nontw)
- refactor: move from io/ioutil to io and os package (#3294, @Juneezee)
- Changed sampling type env var and updated collector help text (#3302, @joe-elliott)
- Close #3270: Prevent rollover lookback from passing the Unix epoch (#3299, @ctreatma)
- Fixing otel configuration in docker compose (#3286, @Ashmita152)
- Added ability to pass config file to grpc plugin in integration tests (#3253, @EinKrebs)
- UI pinned to version 1.17.0. The changelog is available here v1.17.0
- Add cassandra v4 support (@Ashmita152 in #3225).
- Note: Users running on existing schema version v3 without issues, need not run an upgrade to v4. The new schema simply updates the table metadata, primarily removing
dclocal_read_repair_chance
deprecated table option, which automatically gets ignored/removed when upgrading to Cassandra 4.0.
- Note: Users running on existing schema version v3 without issues, need not run an upgrade to v4. The new schema simply updates the table metadata, primarily removing
- Add es-index-cleaner golang implementation (@pavolloffay in #3204)
- Add CLI Option for gRPC Max Receive Message Size (@js8080 in #3214 and #3192)
- Automatically detect OpenSearch version (@pavolloffay in #3198)
- Add SendGetBodyAs on elasticsearch (@NatMarchand in #3193)
- Set lookback in ES rollover to distant past (@pavolloffay in #3169)
- Check for invalid --agent.tags (@esnible in #3246)
- Replace old linters with golangci-lint (@albertteoh in #3237)
- Fix panic on empty findTraces query (@akuzni2 in #3232)
- Upgrade to Go v1.17 (@Ashmita152 in #3220)
- Add docker buildx make target (@pavolloffay in #3213)
- Fix the name for elasticsearch integration tests (@Ashmita152 in #3208)
- Upgrade ES images in integration tests (@pavolloffay in #3185)
- UI pinned to version 1.16.0 - https://github.com/jaegertracing/jaeger-ui/releases/tag/v1.16.0
- Add query service with OTLP (#3086, @pavolloffay)
- Add ppc64le support on multiarch docker images (#3160, @krishvoor)
- Fix base path in grpc gateway for api_v3 (#3139, @pavolloffay)
- Add /api prefix for /v3 API (#3178, @pavolloffay)
- Define
http.Server.ErrorLog
to forward logs to Zap (#3157, @yurishkuro) - Add ATM dev environment docker-compose and API doc (#3171, @albertteoh)
- Log the source of sampling strategies (#3166, @yurishkuro)
- Pin elasticsearch-py to older version without elastic.co product check (#3180, @pavolloffay)
- Upgrade Badger from v1.6.2 to v3.2103.0 (#3096, @Ashmita152):
- Deprecated
--badger.truncate
flag. - All badger related expvar prefix has changed from
badger
tobadger_v3
.
- Deprecated
- Add docker images for linux/arm64 (#3124, @GaruGaru)
- Add s390x support on multiarch docker images (#2948, @kun-lu20)
- Add TLS support for Prometheus reader (#3096, @albertteoh)
- Add HTTP handler for metrics querying #3095, @albertteoh)
- Add MetricsQueryService grcp handler #3091, @albertteoh)
- Hook up MetricsQueryService to main funcs (#3079, @albertteoh)
- Add metrics query capability to query service (#3061, @albertteoh)
- Add build info metrics to Jaeger components (#3087, @Ashmita152)
- Upgrade gRPC to 1.38.x (#3096, @pavolloffay)
- Remove unused
--es-archive.max-span-age
flag (#2865, @albertteoh):
- Inject trace context to grpc metadata (#2870, @lujiajing1126)
- Passing default sampling strategies file as environment variable (#3027, @Ashmita152)
- [es] Add index rollover mode that can choose day and hour (#2965, @WalkerWang731)
- Add a TIMEOUT environment variable for es rollover (#2938, @ediezh)
- Allow the ILM policy name to be configurable (#2971, @jrRibeiro)
- [es] Add remote read clusters option for cross-cluster querying (#2874, @dgrizzanti)
- Enable logging in ES client (#2862, @albertteoh)
- Fix jaeger-agent reproducible memory leak (#3050, @jpkrohling)
- Changed Range Query to use startTimeMillis date field instead of startTime field (#2980, @Sreevani871)
- Verify FindTraces() received a query (#2979, @esnible)
- Set Content-Type in healthcheck's http response (#2926, @logeable)
- Add jaeger-query HTTP handler diagnostic logging (#2906, @albertteoh)
- Fix es-archive namespace default values (#2865, @albertteoh)
-
Remove deprecated TLS flags (#2790, @albertteoh):
--cassandra.tls
is replaced by--cassandra.tls.enabled
--cassandra-archive.tls
is replaced by--cassandra-archive.tls.enabled
--collector.grpc.tls
is replaced by--collector.grpc.tls.enabled
--collector.grpc.tls.client.ca
is replaced by--collector.grpc.tls.client-ca
--es.tls
is replaced by--es.tls.enabled
--es-archive.tls
is replaced by--es-archive.tls.enabled
--kafka.consumer.tls
is replaced by--kafka.consumer.tls.enabled
--kafka.producer.tls
is replaced by--kafka.producer.tls.enabled
--reporter.grpc.tls
is replaced by--reporter.grpc.tls.enabled
-
Remove deprecated flags of Query Server
--query.port
and--query.host-port
, please use dedicated HTTP--query.http-server.host-port
(defaults to:16686
) and gRPC--query.grpc-server.host-port
(defaults to:16685
) host-ports flags instead (#2772, @rjs211)- By default, if no flags are set, the query server starts on the dedicated ports. To use common port for gRPC and HTTP endpoints, the host-port flags have to be explicitly set
-
Remove deprecated CLI flags (#2751, @LostLaser):
--collector.http-port
is replaced by--collector.http-server.host-port
--collector.grpc-port
is replaced by--collector.grpc-server.host-port
--collector.zipkin.http-port
is replaced by--collector.zipkin.host-port
-
Remove deprecated flags
--health-check-http-port
&--admin-http-port
, please use--admin.http.host-port
(#2752, @pradeepnnv) -
Remove deprecated flag
--es.max-num-spans
, please use--es.max-doc-count
(#2482, @BernardTolosajr) -
Remove deprecated flag
--jaeger.tags
, please use--agent.tags
(#2753, @yurishkuro) -
Remove deprecated Cassandra flags (#2789, @albertteoh):
--cassandra.enable-dependencies-v2
- Jaeger will automatically detect the version of the dependencies table--cassandra.tls.verify-host
- please use--cassandra.tls.skip-host-verify
instead
-
Remove incorrectly scoped downsample flags from the query service (#2782, @joe-elliott)
--downsampling.hashsalt
removed from jaeger-query--downsampling.ratio
removed from jaeger-query
-
Add TLS Support for gRPC and HTTP endpoints of the Query and Collector servers (#2337, #2772, #2798, @rjs211)
- If TLS in enabled on either or both of gRPC or HTTP endpoints, the gRPC host-port and the HTTP host-port have to be different
- If TLS is disabled on both endpoints, common HTTP and gRPC host-port can be explicitly set using the following host-port flags respectively:
- Query:
--query.http-server.host-port
and--query.grpc-server.host-port
- Collector:
--collector.http-server.host-port
and--collector.grpc-server.host-port
- Query:
-
Add support for Kafka SASL/PLAIN authentication via SCRAM-SHA-256 or SCRAM-SHA-512 mechanism (#2724, @WalkerWang731)
-
[agent] Add metrics to show connections status between agent and collectors (#2657, @WalkerWang731)
-
Add plaintext as a supported kafka auth option (#2721, @pdepaepe)
-
Add ability to use JS file for UI configuration (#123 from jaeger-ui) (#2707, @th3M1ke)
-
Support Elasticsearch ILM for managing jaeger indices (#2796, @bhiravabhatla)
-
Push official images to quay.io, in addition to Docker Hub (#2783, @Ashmita152)
-
Add status command (#2684, @sniperking1234)
- Usage:
$ ./cmd/collector/collector-darwin-amd64 status {"status":"Server available","upSince":"2021-02-19T17:57:12.671902+11:00","uptime":"25.241233383s"}
- Usage:
-
Support configurable date separator for Elasticsearch index names (#2637, @sniperking1234)
- Use workaround for windows x509.SystemCertPool() (#2756, @Ashmita152)
- Guard against mal-formed payloads received by the agent, potentially causing high memory utilization (#2780, @jpkrohling)
- Expose cache TTL for ES span writer index+service (#2737, @necrolyte2)
- Copy spans from memory store (#2720, @bobrik)
- [pkg/queue] Add
StartConsumersWithFactory
function (#2714, @mx-psi) - Fix potential cross-site scripting issue (#2697, @yurishkuro)
- Updated gRPC Storage Plugin README with example (#2687, @js8080)
- Deduplicate collector tags (#2658, @Betula-L)
- Add latency metrics on collector HTTP endpoints (#2664, @dimitarvdimitrov)
- Fix collector panic due to sarama sdk (#2654, @Betula-L)
- Handle collector Start error (#2647, @albertteoh)
- [anonymizer] Save trace in UI format (#2629, @yurishkuro)
- UI pinned to version 1.13.0. The changelog is available here v1.13.0
- Add trace anonymizer utility (#2621, #2585, @Ashmita152)
- Add URL option for sampling strategies file (#2519, @goku321)
- Expose tunning options via expvar (#2496, @dstdfx)
- Support more encodings for Kafka in OTel Ingester (#2580, @XSAM)
- Create debug docker images for jaeger backends (#2545, @Ashmita152)
- Display backend & UI versions in Jaeger UI
- Inject version info into index.html (#2547, @yurishkuro)
- Added jaeger ui version to about menu (#606, @alanisaac)
- Update x/text to v0.3.4 (#2625, @objectiser)
- Update CodeQL to latest best practices (#2615, @jhutchings1)
- Bump opentelemetry-collector to v0.14.0 (#2617, @Vemmy124)
- Bump Badger to v1.6.2 (#2613, @Ackar)
- Fix sarama consumer deadlock (#2587, @albertteoh)
- Avoid deadlock if Stop is called before Serve (#2608, @chlunde)
- Return buffers to pool on network errors or queue overflow (#2609, @chlunde)
- Clarify deadlock panic message (#2605, @yurishkuro)
- fix: don't create tags w/ empty name for internal zipkin spans (#2596, @mzahor)
- TBufferedServer: Avoid channel close/send race on Stop (#2583, @chlunde)
- Bumped OpenTelemetry Collector to v0.12.0 (#2562, @jpkrohling)
- Disable Zipkin server if port/address is not configured (#2554, @yurishkuro)
- [hotrod] Add links to traces (#2536, @yurishkuro)
- OTel Cassandra/Elasticsearch Exporter queue defaults (#2533, @joe-elliott)
- [otel] Update jaeger-lib to v2.4.0 (#2538, @dstdfx)
- Remove unnecessary ServiceName index seek if tags query is available (#2535, @burmanm)
- Update static UI assets path in contrib doc (#2548, @albertteoh)
- UI pinned to version 1.12.0. The changelog is available here v1.12.0
-
Configurable ES doc count (#2453, @albertteoh)
The
--es.max-num-spans
flag has been deprecated in favour of--es.max-doc-count
.--es.max-num-spans
is marked for removal in v1.21.0 as indicated in the flag description.If both
--es.max-num-spans
and--es.max-doc-count
are set, the lesser of the two will be used.The use of
--es.max-doc-count
(which defaults to 10,000) will limit the results from all Elasticsearch queries by the configured value, limiting counts for Jaeger UI:- Services
- Operations
- Dependencies (edges in a dependency graph)
- Span fetch size for a trace
-
The default value for the flag
query.max-clock-skew-adjustment
has changed to0s
, meaning that the clock skew adjustment is now disabled by default. See #1459.
- Grpc plugin archive storage support (#2317, @m8rge)
- Separate Ports for GRPC and HTTP requests in Query Server (#2387, @rjs211)
- Configurable ES doc count (#2453, @albertteoh)
- Add storage metrics to OTEL, metrics by span service name (#2431, @pavolloffay)
- Increase coverage on otel/app/defaultconfig and otel/app/defaultcomponents (#2515, @joe-elliott)
- Use OTEL Kafka Exporter/Receiver Instead of Jaeger Core (#2494, @joe-elliott)
- Fix OTEL kafka receiver/ingester panic (#2512, @pavolloffay)
- Disable clock-skew-adjustment by default. (#2513, @jpkrohling)
- Fix ES OTEL status code (#2501, @pavolloffay)
- OTel: Factored out Config Factory (#2495, @joe-elliott)
- Fix failing ServerInUseHostPort test on MacOS (#2477, @albertteoh)
- Fix unmarshalling in OTEL badger (#2488, @pavolloffay)
- Improve UI placeholder message (#2487, @yurishkuro)
- Translate OTEL instrumentation library to ES DB model (#2484, @pavolloffay)
- Add partial retry capability to OTEL ES exporter. (#2456, @pavolloffay)
- Log deprecation warning only when deprecated flags are set (#2479, @pavolloffay)
- Clean-up Badger's trace-not-found check (#2481, @yurishkuro)
- Run the jaeger-agent as a non-root user by default (#2466, @chgl)
- Regenerate certificates to use SANs instead of Common Name (#2461, @albertteoh)
- Support custom port in cassandra schema creation (#2472, @MarianZoll)
- Consolidated OTel ES IndexNameProviders (#2458, @joe-elliott)
- Add positive confirmation that Agent made a connection to Collector (… (#2423, @BernardTolosajr)
- Propagate TraceNotFound error from grpc storage plugins (#2455, @joe-elliott)
- Use new ES reader implementation in OTEL (#2441, @pavolloffay)
- Updated grpc-go to v1.29.1 (#2445, @jpkrohling)
- Remove olivere elastic client from OTEL (#2448, @pavolloffay)
- Use queue retry per exporter (#2444, @pavolloffay)
- Add context.Context to WriteSpan (#2436, @yurishkuro)
- Fix mutex unlock in storage exporters (#2442, @pavolloffay)
- Add Grafana integration example (#2408, @fktkrt)
- Fix TLS flags settings in jaeger OTEL receiver (#2438, @pavolloffay)
- Add context to dependencies endpoint (#2434, @yoave23)
- Fix error equals (#2429, @albertteoh)
- UI pinned to version 1.11.0. The changelog is available here v1.11.0
Upgrade to a working UI version before React refactoring.
Revert UI back to 1.9 due to a bug jaegertracing/jaeger-ui#628
The pull request #2297 aimed to add TLS support for the gRPC Query server but the flag registration is missing, so that this feature can't be used at the moment. A fix is planned for the next Jaeger version (1.20).
- Reload TLS certificates on change (#2389, @pavolloffay)
- Add --kafka.producer.batch-min-messages collector flag (#2371, @prymitive)
- Make UDP socket buffer size configurable (#2336, @kbarukhov)
- Enable batch and queued retry processors by default (#2330, @pavolloffay)
- Add trace anonymizer prototype (#2328, @yurishkuro)
- Add native OTEL ES exporter (#2295, @pavolloffay)
- Define busy error type in processor (#2314, @pavolloffay)
- Use gRPC instead of tchannel in hotrod (#2307, @pavolloffay)
- TLS support for gRPC Query server (#2297, @jan25)
- Check missing server URL in ES OTEL client (#2411, @pavolloffay)
- Fix Elasticsearch version in ES OTEL writer (#2409, @pavolloffay)
- Fix go vet warnings on Go 1.15 (#2401, @prymitive)
- Add new Elasticsearch reader implementation (#2364, @pavolloffay)
- Only add the collector port if it was not explicitly set (#2396, @joe-elliott)
- Fix panic in collector when Zipkin server is shutdown (#2392, @Sreevani871)
- Update validity of TLS certificates to 3650 days (#2390, @rjs211)
- Added span and trace id to hotrod logs (#2384, @joe-elliott)
- Jaeger agent logs "0" whenever sampling strategies are requested (#2382, @jpkrohling)
- Fix shutdown order for collector components (#2381, @Sreevani871)
- Serve jquery-3.1.1.min.js locally (#2378, @chaseSpace)
- Use a single shared set of CA, client & server keys/certs for testing (#2343, @rjs211)
- fix null pointer in jaeger-spark-dependencies (#2327, @moolen)
- Rename ARCH to TARGETARCH for multi platform build by docker buildx (#2320, @morlay)
- Mask passwords when written as json (#2302, @objectiser)
- UI pinned to version 1.10.0. The changelog is available here v1.10.0
- CVE-2020-10750: jaegertracing/jaeger: credentials leaked to container logs (@chlunde)
- Add ppc64le support (#2293, @Siddhesh-Ghadi)
- Expose option to enable TLS when sniffing an Elasticsearch Cluster (#2263, @jennynilsen)
- Enable OTEL receiver by default (#2279, @pavolloffay)
- Add Badger OTEL exporter (#2269, @pavolloffay)
- Add all-in-one OTEL component (#2262, @pavolloffay)
- Add arm64 support for binaries and docker images (#2176, @MrXinWang)
- Add Zipkin OTEL receiver (#2247, @pavolloffay)
- Remove experimental flag from rollover (#2290, @pavolloffay)
- Move ES dependencies index mapping to JSON template file (#2285, @pavolloffay)
- Bump go-plugin to 1.3 (#2261, @yurishkuro)
- Ignore chmod events on UI config watcher. (#2254, @rubenvp8510)
- Normalize CLI flags to use host:port addresses (#2212, @ohdearaugustin)
- Add kafka receiver flags to ingester (#2251, @pavolloffay)
- Configure Jaeger receiver and exporter by flags (#2241, @pavolloffay)
-
Remove Tchannel between agent and collector (#2115, #2112, @pavolloffay)
Remove
Tchannel
port (14267) from collector andTchannel
reporter from agent.These flags were removed from agent:
--collector.host-port --reporter.tchannel.discovery.conn-check-timeout --reporter.tchannel.discovery.min-peers --reporter.tchannel.host-port --reporter.tchannel.report-timeout
These flags were removed from collector:
--collector.port
-
Normalize CLI flags to use host:port addresses (#1827, @annanay25)
Flags previously accepting listen addresses in any other format have been deprecated:
collector.port
is superseded bycollector.tchan-server.host-port
collector.http-port
is superseded bycollector.http-server.host-port
collector.grpc-port
is superseded bycollector.grpc-server.host-port
collector.zipkin.http-port
is superseded bycollector.zipkin.host-port
admin-http-port
is superseded byadmin.http.host-port
- Add grpc storage plugin OTEL exporter (#2229, @pavolloffay)
- Add OTEL ingester component (#2225, @pavolloffay)
- Add Kafka OTEL receiver/ingester (#2221, @pavolloffay)
- Create Jaeger OTEL agent component (#2216, @pavolloffay)
- Merge hardcoded/default configuration with OTEL config file (#2211, @pavolloffay)
- Support periodic refresh of sampling strategies (#2188, @defool)
- Add Elasticsearch OTEL exporter (#2140, @pavolloffay)
- Add Cassandra OTEL exporter (#2139, @pavolloffay)
- Add Kafka OTEL storage exporter (#2135, @pavolloffay)
- Clock skew config (#2119, @joe-elliott)
- Introduce OpenTelemetry collector (#2086, @pavolloffay)
- Support regex tags search for Elasticseach backend (#2049, @annanay25)
- Do not skip service/operation indexing for firehose spans (#2242, @yurishkuro)
- Add build information to OTEL binaries (#2237, @pavolloffay)
- Enable service default sampling param (#2230, @defool)
- Add Jaeger OTEL agent to docker image upload (#2227, @ning2008wisc)
- Support adding process tags in OTEL via env variable (#2220, @pavolloffay)
- Bump OTEL version and update exporters to use new API (#2196, @pavolloffay)
- Support sampling strategies file flag in OTEL collector (#2195, @pavolloffay)
- Add zipkin receiver to OTEL collector (#2181, @pavolloffay)
- Add Dockerfile for OTEL collector and publish latest tag (#2167, @pavolloffay)
- Run OTEL collector without configuration file (#2148, @pavolloffay)
- Update gocql to support AWS MCS (#2133, @johanneswuerbach)
- Return appropriate gRPC errors/codes to indicate request status (#2132, @yurishkuro)
- Remove tchannel port from dockerfile and test (#2118, @pavolloffay)
- Remove tchannel between agent and collector (#2115, @pavolloffay)
- Move all tchannel packages to a single top level package (#2112, @pavolloffay)
- UI pinned to version 1.9.0. The changelog is available here v1.9.0
- Fix enable Kafka TLS when TLS auth is specified #2107, @pavolloffay)
- Migrate project to go modules #2098, @pavolloffay)
- Do not skip service/operation indexing for firehose spans #2090, @yurishkuro)
- Close the span writer on main (#2096, @jpkrohling)
- Improved graceful shutdown - Collector (#2076, @jpkrohling)
- Improved graceful shutdown - Agent (#2031, @jpkrohling)
- UI pinned to version 1.8.0. The changelog is available here v1.8.0
- [tracegen] Add service name as a command line option (#2080, @kevinearls)
- Allow the Configuration of Additional Headers on the Jaeger Query HTTP API (#2056, @joe-elliott)
- Warn about time adjustment in tags (#2052, @bobrik)
- Add CLI flags for Kafka batching params (#2047, @apm-opentt)
- Added support for dynamic queue sizes (#1985, @jpkrohling)
- [agent] Process data loss stats from clients (#2010, @yurishkuro)
- Add /api/sampling endpoint in collector (#1990, @RickyRajinder)
- Add basic authentication to Kafka storage (#1983, @chandresh-pancholi)
- Make operation_strategies part also be part of default_strategy (#1749, @rutgerbrf)
- Upgrade gRPC to ^1.26 (#2077, @yurishkuro)
- Remove pkg/errors from dependencies (#2073, @yurishkuro)
- Update dependencies, pin grpc<1.27 (#2072, @yurishkuro)
- Refactor collector mains (#2060, @jpkrohling)
- Clarify that "kafka" is not a real storage backend (#2066, @yurishkuro)
- Added -trimpath option to go build (#2064, @kadern0)
- Use memory size flag to activate dyn queue size feature (#2059, @jpkrohling)
- Add before you push to the queue to prevent race condition on size (#2044, @joe-elliott)
- Count received batches from conforming clients (#2030, @yurishkuro)
- [agent] Do not increment data loss counters on the first client batch (#2028, @yurishkuro)
- Allow raw port numbers for UDP servers (#2025, @yurishkuro)
- Publish tracegen (#2022, @jpkrohling)
- Build binaries for Linux on IBM Z / s390x architecture (#1982, @prankkelkar)
- Admin/Query: Log the real port instead of the provided one to enable the use of port 0 (#2002, @ChadiEM)
- Split agent's HTTP server and handler (#1996, @yurishkuro)
- Clean-up collector handlers builder (#1991, @yurishkuro)
- Added 'resize' operation to BoundedQueue (#1948, @jpkrohling)
- Add common TLS configuration (#1838, @backjo)
- UI pinned to version 1.7.0. The changelog is available here v1.7.0
- Endpoint changes:
- Both Http & gRPC servers now take new optional parameter
spanKind
in addition toservice
. When spanKind is absent or empty, operations from all kinds of spans will be returned. - Instead of returning a list of string, both Http & gRPC servers return a list of operation struct. Please
update your client code to process the new response. Example response:
curl 'http://localhost:6686/api/operations?service=UserService&spanKind=server' | jq { "data": [{ "name": "UserService::getExtendedUser", "spanKind": "server" }, { "name": "UserService::getUserProfile", "spanKind": "server" }], "total": 2, "limit": 0, "offset": 0, "errors": null }
- The legacy http endpoint stay untouched:
/services/{%s}/operations
- Both Http & gRPC servers now take new optional parameter
- Storage plugin changes:
- Memory updated to support spanKind on write & read, no migration is required.
- Badger & ElasticSearch
to be implemented:
For now
spanKind
will be set as empty string during read & write, onlyname
will be valid operation name. - Cassandra updated to support spanKind on write & read (#1937, @guo0693):
If you don't run the migration script, nothing will break, the system will use the old table
operation_names
and set emptyspanKind
in the response. Steps to get the updated functionality:- You will need to run the command below on the host where you can use
cqlsh
to connect to Cassandra:The script will create new tableKEYSPACE=jaeger_v1 CQL_CMD='cqlsh host 9042 -u test_user -p test_password --request-timeout=3000' bash ./v002tov003.sh
operation_names_v2
and migrate data from the old table.spanKind
column will be empty for those data. At the end, it will ask you whether you want to drop the old table or not. - Restart ingester & query services so that they begin to use the new table
- You will need to run the command below on the host where you can use
Trace and Span IDs are always padded to 32 or 16 hex characters with leading zeros (#1956, @yurishkuro)
Previously, Jaeger backend always rendered trace and span IDs as the shortest possible hex string, e.g. an ID
with numeric value 255 would be rendered as a string ff
. This change makes the IDs to always render as 16 or 32
characters long hex string, e.g. the same id=255 would render as 00000000000000ff
. It mostly affects how UI
displays the IDs, the URLs, and the JSON returned from jaeger-query
service.
Motivation: Among randomly generated and uniformly distributed trace IDs, only 1/16th of them start with 0 followed by a significant digit, 1/256th start with two 0s, and so on in decreasing geometric progression. Therefore, trimming the leading 0s is a very modest optimization on the size of the data being transmitted or stored.
However, trimming 0s leads to ambiguities when the IDs are used as correlations with other monitoring systems, such as logging, that treat the IDs as opaque strings and cannot establish the equivalence between padded and unpadded IDs. It is also incompatible with W3C Trace Context and Zipkin B3 formats, both of which include all leading 0s, so an application instrumented with OpenTelemetry SDKs may be logging different trace ID strings than application instrumented with Jaeger SDKs (related issue #1657).
Overall, the change is backward compatible:
- links with non-padded IDs in the UI will still work
- data stored in Elasticsearch (where IDs are represented as strings) is still readable
However, some custom integration that rely on exact string matches of trace IDs may be broken.
Change default rollover conditions to 2 days (#1963, @pavolloffay)
Change default rollover conditions from 7 days to 2 days.
Given that by default Jaeger uses daily indices and some organizations do not keep data longer than 7 days the default of 7 days seems unreasonable - it might result in a too big index and running curator would immediately remove the old index.
- Support collector tags, similar to agent tags (#1854, @radekg)
- Support insecure TLS and only CA cert for Elasticsearch (#1918, @pavolloffay)
- Allow tracer config via env vars (#1919, @yurishkuro)
- Allow turning off tags/logs indexing in Cassandra (#1915, @joe-elliott)
- Blacklisting/Whitelisting tags for Cassandra indexing (#1904, @joe-elliott)
- Support custom basepath in HotROD (#1894, @jan25)
- Deprecate tchannel reporter flags (#1978, @objectiser)
- Do not truncate tags in Elasticsearch (#1970, @pavolloffay)
- Export SaveSpan to enable multiplexing (#1968, @albertteoh)
- Make rollover init step idempotent (#1964, @pavolloffay)
- Update python urllib3 version required by curator (#1965, @pavolloffay)
- Allow changing max log level for gRPC storage plugins (#1962, @yyyogev)
- Fix the bug that operation_name table can not be init more than once (#1961, @guo0693)
- Improve migration script (#1946, @guo0693)
- Fix order of the returned results from badger backend. (#1939, @burmanm)
- Update python pathlib to pathlib2 (#1930, @objectiser)
- Use proxy env vars if they're configured (#1910, @zoidbergwill)
- UI pinned to version 1.6.0. The changelog is available here v1.6.0
- Build platform binaries as part of CI (#1909, @yurishkuro)
- Upgrade and fix dependencies (#1907, @yurishkuro)
-
The default value for the Ingester's flag
ingester.deadlockInterval
has been changed to0
(#1868, @jpkrohling)With the new default, the ingester won't
panic
if there are no messages for the last minute. To restore the previous behavior, set the flag's value to1m
. -
Mark
--collector.grpc.tls.client.ca
flag as deprecated for jaeger-collector. (#1840, @yurishkuro)The deprecated flag will still work until being removed, it's recommended to use
--collector.grpc.tls.client-ca
instead.
- Support TLS for Kafka (#1414, @MichaHoffmann)
- Add ack and compression parameters for Kafka #1359 (#1712, @chandresh-pancholi)
- Propagate the bearer token to the gRPC plugin server (#1822, @radekg)
- Add Truncate and ReadOnly options for badger (#1842, @burmanm)
- Use correct context on ES search methods (#1850, @rubenvp8510)
- Handling of expected error codes coming from grpc storage plugins #1741 (#1814, @chandresh-pancholi)
- Fix ordering of indexScanKeys after TraceID parsing (#1809, @burmanm)
- Small memory optimizations in badger write-path (#1771, @burmanm)
- Set an empty value when a default env var value is missing (#1777, @jpkrohling)
- Decouple storage dependencies and bump Go to 1.13.x (#1886, @yurishkuro)
- Update gopkg.in/yaml.v2 dependency to v2.2.4 (#1865, @objectiser)
- Upgrade jaeger-client 2.19 and jaeger-lib 2.2 and prom client 1.x (#1810, @yurishkuro)
- Unpin grpc version and use serviceConfig to set the load balancer (#1786, @guanw)
- UI pinned to version 1.5.0. The changelog is available here v1.5.0
-
Create ES index templates instead of indices (#1627, @pavolloffay)
This can break existing Elasticsearch deployments if security policies are applied. For instance Jaeger
X-Pack
configuration now requires permission to create index templates -manage_index_templates
.
-
Add Elasticsearch version configuration to rollover script (#1769, @pavolloffay)
-
Add Elasticsearch version flag (#1753, @pavolloffay)
-
Add Elasticsearch 7 support (#1690, @gregoryfranklin)
The index mappings in Elasticsearch 7 are not backwards compatible with the older versions. Therefore using Elasticsearch 7 with data created with older version would not work. Elasticsearch 6.8 supports 7.x, 6.x, 5.x compatible mappings. The upgrade has to be done first to ES 6.8, then apply data migration or wait until old daily indices are removed (this requires to start Jaeger with
--es.version=7
to force using ES 7.x mappings for newly created indices).Jaeger by default uses Elasticsearch ping endpoint (
/
) to derive the version which is used for index mappings selection. The version can be overridden by flag--es.version
. -
Added support for hot reload of UI config (#1688, @jpkrohling)
-
Added base Grafana dashboard and Alert rules (#1745, @jpkrohling)
-
Add the jaeger-mixin for monitoring (#1668, @gouthamve)
-
Added flags for driving cassandra connection compression through config (#1675, @sagaranand015)
-
Support index cleaner for rollover indices and add integration tests (#1689, @pavolloffay)
-
Collector kafka producer protocol version config (#1658, @marqc)
-
Configurable kafka protocol version for msg consuming by jaeger ingester (#1640, @marqc)
-
Use credentials when describing keyspaces in cassandra schema builder (#1655, @MiLk)
-
Add connect-timeout for Cassandra (#1647, @sagaranand015)
- Fix gRPC over cmux and add unit tests (#1758, @yurishkuro)
- Add CA certificates to agent image (#1764, @yurishkuro)
- Fix badger merge-join algorithm to correctly filter indexes (#1721, @burmanm)
- Change Zipkin CORS origins and headers to comma separated list (#1556, @JonasVerhofste)
- Added null guards to 'Process' when processing an incoming span (#1723, @jpkrohling)
- Export expvar metrics of badger to the metricsFactory (#1704, @burmanm)
- Pass TTL as int, not as float64 (#1710, @yurishkuro)
- Use find by regex for archive index in index cleaner (#1693, @pavolloffay)
- Allow token propagation if token type is not specified (#1685, @rubenvp8510)
- Fix duplicated spans when querying Elasticsearch (#1677, @pavolloffay)
- Fix the threshold precision issue (#1665, @guanw)
- Badger filter duplicate results from a single indexSeek (#1649, @burmanm)
- Badger make default dirs work in Windows (#1653, @burmanm)
- UI pinned to version 1.4.0. The changelog is available here v1.4.0
-
The traces related metrics on collector now have a new tag
sampler_type
(#1576, @guanw)This might break some existing metrics dashboard (if so, users need to update query to aggregate over this new tag).
The list of metrics affected:
traces.received
,traces.rejected
,traces.saved-by-svc
. -
Remove deprecated index prefix separator
:
from Elastic (#1620, @pavolloffay)In Jaeger 1.9.0 release the Elasticsearch index separator was changed from
:
to-
. To keep backwards compatibility the query service kept querying indices with:
separator, however the new indices were created only with-
. This release of Jaeger removes the query capability for indices containing:
, therefore it's recommended to keep using older version until indices containing old separator are not queried anymore.
- Passthrough OAuth bearer token supplied to Query service through to ES storage (#1599, @rubenvp8510)
- Kafka kerberos authentication support for collector/ingester (#1589, @rubenvp8510)
- Allow Cassandra schema builder to use credentials (#1635, @PS-EGHornbostel)
- Add docs generation command (#1572, @pavolloffay)
- Fix data race between
Agent.Run()
andAgent.Stop()
(#1625, @tigrannajaryan) - Use json number when unmarshalling data from ES (#1618, @pavolloffay)
- Define logs as nested data type (#1622, @pavolloffay)
- Fix archive storage not querying old spans older than maxSpanAge (#1617, @pavolloffay)
- Query service: fix logging errors on SIGINT (#1601, @jan25)
- Direct grpc logs to Zap logger (#1606, @yurishkuro)
- Fix sending status to health check channel in Query service (#1598, @jan25)
- Add tmp-volume to all-in-one image to fix badger storage (#1571, @burmanm)
- Do not fail es-cleaner if there are no jaeger indices (#1569, @pavolloffay)
- Automatically set
GOMAXPROCS
(#1560, @rubenvp8510) - Add CA certs to all-in-one image (#1554, @chandresh-pancholi)
- UI pinned to version 1.3.0. The changelog is available here v1.3.0
-
The
kafka
flags were removed in favor ofkafka.producer
andkafka.consumer
flags (#1424, @ledor473)The following flags have been removed in the Collector and the Ingester:
--kafka.brokers --kafka.encoding --kafka.topic --ingester.brokers --ingester.encoding --ingester.topic --ingester.group-id
In the Collector, they are replaced by:
--kafka.producer.brokers --kafka.producer.encoding --kafka.producer.topic
In the Ingester, they are replaced by:
--kafka.consumer.brokers --kafka.consumer.encoding --kafka.consumer.topic --kafka.consumer.group-id
-
Add Admin port and group all ports in one file (#1442, @yurishkuro)
This change fixes issues #1428, #1332 and moves all metrics endpoints from API ports to admin ports. It requires re-configuring Prometheus scraping rules. Each Jaeger binary has its own admin port that can be found under
--admin-http-port
command line flag by running the${binary} help
command.
- Add gRPC resolver using external discovery service (#1498, @guanw)
- gRPC storage plugin framework (#1461, @chvck)
- Supports customized kafka client id (#1507, @newly12)
- Support gRPC for query service (#1307, @annanay25)
- Expose tls.InsecureSkipVerify to es.tls.* CLI flags (#1473, @stefanvassilev)
- Return info msg for
/health
endpoint (#1465, @stefanvassilev) - Add pprof endpoint to admin endpoint (#1375, @konradgaluszka)
- Add inbound transport as label to collector metrics #1446 (guanw)
- Sorted key/value store
badger
backed storage plugin (#760, @burmanm) - Add Admin port and group all ports in one file (#1442, @yurishkuro)
- Adds support for agent level tag (#1396, @annanay25)
- Add a Downsampling writer that drop a percentage of spans (#1353, @guanw)
- Sort traces in memory store to return most recent traces (#1394, @jacobmarble)
- Add span format tag for jaeger-collector (#1493, @guo0693)
- Upgrade gRPC to 1.20.1 (#1492, @guanw)
- Switch from counter to a gauge for partitions held (#1485, @bobrik)
- Add CORS handling for Zipkin collector service (#1463, @JonasVerhofste)
- Check elasticsearch nil response (#1467, @YEXINGZHE54)
- Disable sampling in logger -
zap
(#1460, @psinghal20) - New layout for proto definitions and generated files (#1427, @annanay25)
- Upgrade Go to 1.12.1 (#1437 ,@yurishkuro)
- UI pinned to version 1.2.0. The changelog is available here v1.2.0
-
Introduce
kafka.producer
andkafka.consumer
flags to replacekafka
flags (#1360, @ledor473)The following flags have been deprecated in the Collector and the Ingester:
--kafka.brokers --kafka.encoding --kafka.topic
In the Collector, they are replaced by:
--kafka.producer.brokers --kafka.producer.encoding --kafka.producer.topic
In the Ingester, they are replaced by:
--kafka.consumer.brokers --kafka.consumer.encoding --kafka.consumer.group-id
- Support secure gRPC channel between agent and collector (#1391, @ghouscht, @yurishkuro)
- Allow to use TLS with ES basic auth (#1388, @pavolloffay)
- Make
esRollover.py init
idempotent (#1407 and #1408, @pavolloffay) - Allow thrift reporter if grpc hosts are not provided (#1400, @pavolloffay)
- Deprecate colon in index prefix in ES dependency store (#1386, @pavolloffay)
- Make grpc reporter default and add retry (#1384, @pavolloffay)
- Use
CQLSH_HOST
in final call tocqlsh
(#1372, @funny-falcon)
- UI pinned to version 1.1.0. The changelog is available here v1.1.0
- Discover dependencies table version automatically (#1364, @black-adder)
- Separate query-service functionality from http handler (#1312, @annanay25)
- Remove cassandra SASI indices (#1328, @black-adder)
Migration Path:
- Run
plugin/storage/cassandra/schema/migration/v001tov002part1.sh
which will copy dependencies into a csv, update thedependency UDT
, create a newdependencies_v2
table, and write dependencies from the csv into thedependencies_v2
table. - Run the collector and query services with the cassandra flag
cassandra.enable-dependencies-v2=true
which will instruct jaeger to write and read to and from the newdependencies_v2
table. - Update spark job to write to the new
dependencies_v2
table. The feature will be done in #58. - Run
plugin/storage/cassandra/schema/migration/v001tov002part2.sh
which will DELETE the old dependency table and the SASI index.
Users who wish to continue to use the v1 table don't have to do anything as the cassandra flag cassandra.enable-dependencies-v2
will default to false. Users may migrate on their own timeline however new features will be built solely on the dependencies_v2
table. In the future, we will remove support for v1 completely.
- Support certificates in elasticsearch scripts (#1339, @pavolloffay)
- Add ES Rollover support to main indices (#1309, @pavolloffay)
- Load ES auth token from file (#1319, @pavolloffay)
- Add username/password authentication to ES index cleaner (#1318, @gregoryfranklin)
- Add implementation of FindTraceIDs function for Elasticsearch reader (#1280, @vlamug)
- Support archive traces for ES storage (#1197, @pavolloffay)
- Use Zipkin annotations if the timestamp is zero (#1341, @geobeau)
- Use GRPC round robin balancing even if only one hostname (#1329, @benley)
- Tolerate whitespaces in ES servers and kafka brokers (#1305, @verma-varsha)
- Let cassandra servers contain whitespace in config (#1301, @karlpokus)
- Change Elasticsearch index prefix from
:
to-
(#1284, @pavolloffay)
Changed index prefix separator from :
to -
because Elasticsearch 7 does not allow :
in index name.
Jaeger query still reads from old indices containing -
as a separator, therefore no configuration or migration changes are required.
- Add CLI configurable
es.max-num-spans
while retrieving spans from ES (#1283, @annanay25)
The default value is set to 10000. Before no limit was applied.
- Update to jaeger-lib 2 and latest sha for jaeger-client-go, to pick up refactored metric names (#1282, @objectiser)
Update to latest version of jaeger-lib
, which includes a change to the naming of counters exported to
prometheus, to follow the convention of using a _total
suffix, e.g. jaeger_query_requests
is now
jaeger_query_requests_total
.
Jaeger go client metrics, previously under the namespace jaeger_client_jaeger_
are now under
jaeger_tracer_
.
- Add gRPC metrics to agent (#1180, @pavolloffay)
The following metrics:
jaeger_agent_tchannel_reporter_batch_size{format="jaeger"} 0
jaeger_agent_tchannel_reporter_batch_size{format="zipkin"} 0
jaeger_agent_tchannel_reporter_batches_failures{format="jaeger"} 0
jaeger_agent_tchannel_reporter_batches_failures{format="zipkin"} 0
jaeger_agent_tchannel_reporter_batches_submitted{format="jaeger"} 0
jaeger_agent_tchannel_reporter_batches_submitted{format="zipkin"} 0
jaeger_agent_tchannel_reporter_spans_failures{format="jaeger"} 0
jaeger_agent_tchannel_reporter_spans_failures{format="zipkin"} 0
jaeger_agent_tchannel_reporter_spans_submitted{format="jaeger"} 0
jaeger_agent_tchannel_reporter_spans_submitted{format="zipkin"} 0
jaeger_agent_collector_proxy{endpoint="baggage",result="err"} 0
jaeger_agent_collector_proxy{endpoint="baggage",result="ok"} 0
jaeger_agent_collector_proxy{endpoint="sampling",result="err"} 0
jaeger_agent_collector_proxy{endpoint="sampling",result="ok"} 0
have been renamed to:
jaeger_agent_reporter_batch_size{format="jaeger",protocol="tchannel"} 0
jaeger_agent_reporter_batch_size{format="zipkin",protocol="tchannel"} 0
jaeger_agent_reporter_batches_failures{format="jaeger",protocol="tchannel"} 0
jaeger_agent_reporter_batches_failures{format="zipkin",protocol="tchannel"} 0
jaeger_agent_reporter_batches_submitted{format="jaeger",protocol="tchannel"} 0
jaeger_agent_reporter_batches_submitted{format="zipkin",protocol="tchannel"} 0
jaeger_agent_reporter_spans_failures{format="jaeger",protocol="tchannel"} 0
jaeger_agent_reporter_spans_failures{format="zipkin",protocol="tchannel"} 0
jaeger_agent_reporter_spans_submitted{format="jaeger",protocol="tchannel"} 0
jaeger_agent_reporter_spans_submitted{format="zipkin",protocol="tchannel"} 0
jaeger_agent_collector_proxy{endpoint="baggage",protocol="tchannel",result="err"} 0
jaeger_agent_collector_proxy{endpoint="baggage",protocol="tchannel",result="ok"} 0
jaeger_agent_collector_proxy{endpoint="sampling",protocol="tchannel",result="err"} 0
jaeger_agent_collector_proxy{endpoint="sampling",protocol="tchannel",result="ok"} 0
- Rename tcollector proxy metric in agent (#1182, @pavolloffay)
The following metric:
jaeger_http_server_errors{source="tcollector-proxy",status="5xx"}
has been renamed to:
jaeger_http_server_errors{source="collector-proxy",status="5xx"}
- Add tracegen utility for generating traces (#1245, @yurishkuro)
- Use DCAwareRoundRobinPolicy as fallback for TokenAwarePolicy (#1285, @vprithvi)
- Add Zipkin Thrift as kafka ingestion format (#1256, @geobeau)
- Add
FindTraceID
to the spanstore interface (#1246, @vprithvi) - Migrate from glide to dep (#1240, @isaachier)
- Make tchannel timeout for reporting in agent configurable (#1034, @gouthamve)
- Add archive traces to all-in-one (#1189, @pavolloffay)
- Start moving components of adaptive sampling to OSS (#973, @black-adder)
- Add gRPC communication between agent and collector (#1165, #1187, #1181 and #1180, @pavolloffay)
- Update exposed ports in ingester dockerfile (#1289, @objectiser)
- Upgrade Shopify/Sarama for proper handling newest kafka servers 2.x by ingester (#1248, @vprithvi)
- Fix sampling strategies overwriting service entry when no sampling type is specified (#1244, @objectiser)
- Fix dot replacement for int (#1272, @pavolloffay)
- Add C* query to error logs (#1250, @vprithvi)
- Add locking around partitionIDToState map accesses (#1239, @vprithvi)
- Reorganize config manager packages in agent (#1198, @pavolloffay)
- UI pinned to version 1.0.0. The changelog is available here v1.0.0
- Fix link in scatter plot when embed mode (#283, @aljesusg)
- Fix rendering X axis in TraceResultsScatterPlot - pass milliseconds to moment.js (#274, @istrel)
- Make agent timeout for reporting configurable and fix flags overriding (#1034, @gouthamve)
- Fix metrics handler registration in agent (#1178, @pavolloffay)
- Refactor agent configuration (#1092, @pavolloffay)
The following agent flags has been deprecated in order to support multiple reporters:
--collector.host-port
--discovery.conn-check-timeout
--discovery.min-peers
New flags:
--reporter.tchannel.host-port
--reporter.tchannel.discovery.conn-check-timeout
--reporter.tchannel.discovery.min-peers
- Various changes around metrics produced by jaeger-query: Names scoped to the query component, generated for all span readers (not just ES), consolidate query metrics and include result tag (#1074, #1075 and #1096, @objectiser)
For example, sample of metrics produced for find_traces
operation before:
jaeger_find_traces_attempts 1
jaeger_find_traces_errLatency_bucket{le="0.005"} 0
jaeger_find_traces_errors 0
jaeger_find_traces_okLatency_bucket{le="0.005"} 0
jaeger_find_traces_responses_bucket{le="0.005"} 1
jaeger_find_traces_successes 1
And now:
jaeger_query_latency_bucket{operation="find_traces",result="err",le="0.005"} 0
jaeger_query_latency_bucket{operation="find_traces",result="ok",le="0.005"} 2
jaeger_query_requests{operation="find_traces",result="err"} 0
jaeger_query_requests{operation="find_traces",result="ok"} 2
jaeger_query_responses_bucket{operation="find_traces",le="0.005"} 2
- Configurable deadlock detector interval for ingester (#1134, @marqc)
- Emit spans for elastic storage backend (#1128, @annanay25)
- Allow to use TLS certificates for Elasticsearch authentication (#1139, @clyang82)
- Add ingester metrics, healthcheck and rename Kafka cli flags (#1094, @ledor473)
- Add a metric for number of partitions held (#1154, @vprithvi)
- Log jaeger-collector tchannel port (#1136, @mindaugasrukas)
- Support tracer env based initialization in hotrod (#1115, @eundoosong)
- Publish ingester as binaries and docker image (#1086, @ledor473)
- Use Go 1.11 (#1104, @isaachier)
- Tag images with commit SHA and publish to
-snapshot
repository (#1082, @pavolloffay)
- Fix child span context while tracing cassandra queries (#1131, @annanay25)
- Deadlock detector hack for Kafka driver instability (#1087, @vprithvi)
- Fix processor overriding data in a buffer (#1099, @pavolloffay)
- Span Search - Highlight search results (#238), @davit-y
- Span Search - Improve search logic (#237), @davit-y
- Span Search - Add result count, navigation and clear buttons (#234), @davit-y
- Use correct duration format for scatter plot (#266), @tiffon)
- Fix collapse all issues (#264), @tiffon)
- Use a moderately sized canvas for the span graph (#257), @tiffon)
- Compare two traces (#228, @tiffon)
- Make tags clickable (#223, @divdavem)
- Directed graph as React component (#224, @tiffon)
- Timeline Expand and Collapse Features (#221, @davit-y)
- Integrate Google Analytics into Search Page (#220, @davit-y)
jaeger-standalone
binary has been renamed tojaeger-all-in-one
. This change also includes package rename fromstandalone
toall-in-one
(#1062, @pavolloffay)
- (Experimental) Allow storing tags as object fields in Elasticsearch for better Kibana support((#1018, @pavolloffay)
- Enable tracing of Cassandra queries (#1038, @yurishkuro)
- Make Elasticsearch index configurable (#1009, @pavolloffay)
- Add flags to allow changing ports for HotROD services (#951, @cboornaz17)
- (Experimental) Kafka ingester (#952, #942, #944, #940, @davit-y and @vprithvi))
- Use tags in agent metrics (#950, @eundoosong)
- Add support for Cassandra reconnect interval (#934, @nyanshak)
-
The storage implementations no longer write the parentSpanID field to storage (#856). If you are upgrading to this version, you must upgrade query service first!
-
Update Dockerfiles to reference executable via ENTRYPOINT (#815) by Zachary DiCesare (@zdicesare)
It is no longer necessary to specify the binary name when passing flags to containers. For example, to execute the
help
command of the collector, instead of$ docker run -it --rm jaegertracing/jaeger-collector /go/bin/collector-linux help
run
$ docker run -it --rm jaegertracing/jaeger-collector help
-
Detect HTTP payload format from Content-Type (#916) by Yuri Shkuro (@yurishkuro)
When submitting spans in Thrift format to HTTP endpoint
/api/traces
, theformat
argument is no longer required, but the Content-Type header must be set to "application/vnd.apache.thrift.binary". -
Change metric tag from "service" to "svc" (#883) by Won Jun Jang (@black-adder)
-
Add Kafka as a Storage Plugin (#862) by David Yeghshatyan (@davit-y)
The collectors can be configured to write spans to Kafka for further data mining.
-
Package static assets inside the query-service binary (#918) by Yuri Shkuro (@yurishkuro)
It is no longer necessary (but still possible) to pass the path to UI static assets to jaeger-query and jaeger-standalone binaries.
-
Replace domain model with Protobuf/gogo-generated model (#856) by Yuri Shkuro (@yurishkuro)
First step towards switching to Protobuf and gRPC.
-
Include HotROD binary in the distributions (#917) by Yuri Shkuro (@yurishkuro)
-
Improve HotROD demo (#915) by Yuri Shkuro (@yurishkuro)
-
Add DisableAutoDiscovery param to cassandra config (#912) by Bill Westlin (@whistlinwilly)
-
Add connCheckTimeout flag to agent (#911) by Henrique Rodrigues (@Henrod)
-
Ability to use multiple storage types (#880) by David Yeghshatyan (@davit-y)
- [ES storage] Log number of total and failed requests (#902) by Tomasz Adamski (@tmszdmsk)
- [ES storage] Do not log requests on error (#901) by Tomasz Adamski (@tmszdmsk)
- [ES storage] Do not exceed ES _id length limit (#905) by Łukasz Harasimowicz (@harnash) and Tomasz Adamski (@tmszdmsk)
- Add cassandra index filter (#876) by Won Jun Jang (@black-adder)
- Close span writer in standalone (#863) (4 weeks ago) by Pavol Loffay (@pavolloffay)
- Log configuration options for memory storage (#852) (6 weeks ago) by Juraci Paixão Kröhling (@jpkrohling)
- Update collector metric counters to have a name (#886) by Won Jun Jang (@black-adder)
- Add CONTRIBUTING_GUIDELINES.md (#864) by (@PikBot)
- Add bounds to memory storage (#845) by Juraci Paixão Kröhling (@jpkrohling)
- Add metric for debug traces (#796) by Won Jun Jang (@black-adder)
- Change metrics naming scheme (#776) by Juraci Paixão Kröhling (@jpkrohling)
- Remove ParentSpanID from domain model (#831) by Yuri Shkuro (@yurishkuro)
- Add ability to adjust static sampling probabilities per operation (#827) by Won Jun Jang (@black-adder)
- Support log-level flag on agent (#828) by Won Jun Jang (@black-adder)
- Add healthcheck to standalone (#784) by Eundoo Song (@eundoosong)
- Do not use KeyValue fields directly and use KeyValues as decorator only (#810) by Yuri Shkuro (@yurishkuro)
- Upgrade to go 1.10 (#792) by Prithvi Raj (@vprithvi)
- Do not create Cassandra index if it already exists (#782) by Greg Swift (@gregswift)
- None
- Publish binaries for Linux, Darwin, and Windows (#765) - thanks to @grounded042
- View Trace JSON buttons return formatted JSON (fixes #199)
- Support traces with >10k spans in Elasticsearch (#668) - thanks to @sramakr
- Allow slash '/' in service names (#586)
- Log errors from HotROD services (#769)
- Add sampling handler with file-based configuration for agents to query (#720) (#674)
- Allow overriding base path for UI/API routes and remove --query.prefix (#748)
- Add Dockerfile for hotrod example app (#694) <Guilherme Baufaker Rêgo>
- Publish hotrod image to docker hub (#702)
- Dockerize es-index-cleaner script (#741)
- Add a flag to control Cassandra consistency level (#700)
- Collect metrics from ES bulk service (#688)
- Allow zero replicas for Elasticsearch (#754)
- Apply namespace when creating Prometheus metrics factory (fix for #732) (#733)
- Disable double compression on Prom Handler - fixes #697 (#735) <Juraci Paixão Kröhling>
- Use the default metricsFactory if not provided (#739)
- Avoid duplicate expvar metrics - fixes #716 (#726)
- Make sure different tracers in HotROD process use different random generator seeds (#718)
- Test that processes with identical tags are deduped (#708)
- When converting microseconds to time.Time ensure UTC timezone (#712)
- Add to WaitGroup before the goroutine creation (#711)
- Pin testify version to ^1.2.1 (#710)
- Support running Jaeger behind a reverse proxy (fixes #42)
- Track Javascript errors via Google Analytics (fixes #39)
- Add Google Analytics event tracking for actions in trace view (#191)
- Clearly identify traces without a root span (fixes #190)
- Fix #166 JS error on search page after viewing 404 trace
- Use elasticsearch bulk API (#656)
- Support archive storage in the query-service (#604)
- Introduce storage factory framework and composable CLI (#625)
- Make agent host port configurable in hotrod (#663)
- Add signal handling to standalone (#657)
- Remove the override of GOMAXPROCS (#679)
- Use UTC timezone for ES indices (#646)
- Fix elasticsearch create index race condition error (#641)
- Use Ant Design instead of Semantic UI (jaegertracing/jaeger-ui#169)
- Fix 2 digit lookback (12h, 24h) parsing (jaegertracing/jaeger-ui#167)
- Add support for retrieving unadjusted/raw traces (#615)
- Add CA certificates to collector/query images (#485)
- Parse zipkin v2 high trace id (#596)
- Skip nil and zero length hits in ElasticSearch storage (#601)
- Make Cassandra service_name_index inserts idempotent (#587)
- Align atomic int64 to word boundary to fix SIGSEGV (#592)
- Add adjuster that removes bad span references (#614)
- Set operationNames cache initial capacity to 10000 (#621)
- Change tag search input syntax to logfmt (jaegertracing/jaeger-ui#145)
- Make threshold for enabling DAG view configurable (jaegertracing/jaeger-ui#130)
- Show better error messages for failed API calls (jaegertracing/jaeger-ui#127)
- Add View Option for raw/unadjusted trace (jaegertracing/jaeger-ui#153)
- Add timezone tooltip to custom lookback form-field (jaegertracing/jaeger-ui#161)
- Use consistent icons for logs expanded/collapsed (jaegertracing/jaeger-ui#86)
- Encode service name in API calls to allow '/' (jaegertracing/jaeger-ui#138)
- Fix endless trace HTTP requests (jaegertracing/jaeger-ui#128)
- Fix JSON view when running in dev mode (jaegertracing/jaeger-ui#139)
- Fix trace name resolution (jaegertracing/jaeger-ui#134)
- Only JSON.parse JSON strings in tags/logs values (jaegertracing/jaeger-ui#162)
- Support Prometheus metrics as default for all components (#516)
- Enable TLS client connections to Cassandra (#555)
- Fix issue where Domain to UI model converter double reports references (#579)
- Make dependencies tab configurable (#122)
- Verify stored search settings jaegertracing/jaeger-ui#111
- Fix browser back button not working correctly jaegertracing/jaeger-ui#110
- Handle FOLLOWS_FROM ref type jaegertracing/jaeger-ui#118
- Allow embedding custom UI config in index.html #490
- Add startTimeMillis field to JSON Spans submitted to ElasticSearch #491
- Introduce version command and handler #517
- Fix ElasticSearch aggregation errors when index is empty #535
- Change package from uber to jaegertracing #528
- Introduce logging level configuration #514
- Support Zipkin v2 json #518
- Add HTTP compression handler #545
- Refactor trace detail jaegertracing/jaeger-ui#53
- Virtualized scrolling for trace detail view jaegertracing/jaeger-ui#68
- Mouseover expands truncated text to full length in left column in trace view jaegertracing/jaeger-ui#71
- Make left column adjustable in trace detail view jaegertracing/jaeger-ui#74
- Fix trace mini-map blurriness when < 60 spans jaegertracing/jaeger-ui#77
- Fix Google Analytics tracking jaegertracing/jaeger-ui#81
- Improve search dropdowns jaegertracing/jaeger-ui#84
- Add keyboard shortcuts and minimap UX jaegertracing/jaeger-ui#93
- Add tracing to the query server #454
- Support configuration files #462
- Add cassandra tag filter #442
- Handle ports > 32k in Zipkin JSON #488
- Convert to Apache 2.0 License