Skip to content

Latest commit

 

History

History
279 lines (278 loc) · 31.6 KB

CHANGELOG.md

File metadata and controls

279 lines (278 loc) · 31.6 KB

Changelog

  • [CHANGE] Add new metric slow_request_server_throughput to track the throughput of slow queries. #619
  • [CHANGE] Log middleware updated to honor logRequestHeaders in all logging scenarios. #615
  • [CHANGE] Roll back the gRPC dependency to v1.65.0 to allow downstream projects to avoid a performance regression and maybe a bug in v1.66.0. #581
  • [CHANGE] Update the gRPC dependency to v1.66.0 and deprecate the grpc_server_recv_buffer_pools_enabled option that is no longer supported by it. #580
  • [CHANGE] ring.DoBatchWithOptions (and ring.DoBatch) reports the cancelation cause when the context is canceled instead of context.Canceled.
  • [CHANGE] Multierror: Implement Unwrap() []error. This allows to use multierror.MultiError with both errors.Is() and errors.As(). Previously implemented Is(error) bool has been removed. #522
  • [CHANGE] Add a new grpc_server_recv_buffer_pools_enabled option that enables recv buffer pools in the gRPC server (assuming grpc_server_stats_tracking_enabled is disabled). #510
  • [CHANGE] Add a new grpc_server_stats_tracking_enabled option that allows us to disable stats tracking and potentially improve server memory reuse. #507
  • [CHANGE] Add support for PROXY protocol to Server. #508
  • [CHANGE] Add a new middleware to cancel http requests cleanly based on http.TimeoutHandler. #504
  • [CHANGE] Exemplar label changes from "traceID" to "trace_id". #487
  • [CHANGE] server: import godeltaprof/http/pprof adding /debug/pprof/delta_{heap,mutex,block} endpoints to DefaultServeMux #450
  • [CHANGE] Move httpgrpc request & response utils up from httpgrpc/server to httpgrpc package #434, #435
  • [CHANGE] Updated the minimum required Go version to 1.20 and update Drone config #420
  • [CHANGE] Change WaitRingStability and WaitInstanceState methods signature to rely on ReadRing instead. #251
  • [CHANGE] Added new -consul.cas-retry-delay flag. It has a default value of 1s, while previously there was no delay between retries. #178
  • [CHANGE] Flagext: DayValue now always uses UTC when parsing or displaying dates. #71
  • [CHANGE] Closer: remove the closer package since it's trivial to just copy/paste. #70
  • [CHANGE] Memberlist: allow specifying address and port advertised to the memberlist cluster members by setting the following configuration: #37
    • -memberlist.advertise_addr
    • -memberlist.advertise_port
  • [CHANGE] Removed global metrics for KV package. Making a KV object will now require a prometheus registerer that will be used to register all relevant KV class metrics. #22
  • [CHANGE] Added CHANGELOG.md and Pull Request template to reference the changelog
  • [CHANGE] Remove cortex_ prefix for metrics registered in the ring. #46
  • [CHANGE] Rename kv/kvtls to crypto/tls. #39
  • [CHANGE] spanlogger: Take interface implementation for extracting tenant ID. #59
  • [CHANGE] The status_code label on gRPC client metrics has changed from '200' and '500' to '2xx', '5xx', '4xx', 'cancel' or 'error'. #68
  • [CHANGE] Memberlist: changed probe interval from 1s to 5s and probe timeout from 500ms to 2s. #90
  • [CHANGE] Remove package math. #104
  • [CHANGE] time: Remove time package. #103
  • [CHANGE] grpcutil: Convert Resolver into concrete type. #105
  • [CHANGE] grpcutil.Resolver.Resolve: Take a service parameter. #102
  • [CHANGE] grpcutil.Update: Remove gRPC LB related metadata. #102
  • [CHANGE] concurrency.ForEach: deprecated and reimplemented by new concurrency.ForEachJob. #113
  • [CHANGE] grpcclient: Bump default grpc-max-send-msg-size flag to 100 Mb. #123
  • [CHANGE] ring/client: It's now possible to set different value than consul as default KV store. #120
  • [CHANGE] Lifecycler: Default value of lifecycler's final-sleep is now 0s (i.e. no sleep). #121
  • [CHANGE] Minor cosmetic changes in ring and memberlist HTTP status templates. #149
  • [CHANGE] flagext.Secret: value field is no longer exported. Value can be read using String() method and set using Set method. #154
  • [CHANGE] spanlogger.SpanLogger: Log the user ID from the context with the user label instead of org_id. #156
  • [CHANGE] ring: removed the following metrics from ring client and lifecycler: #161
    • member_ring_tokens_owned
    • member_ring_tokens_to_own
    • ring_tokens_owned
    • ring_member_ownership_percent
  • [CHANGE] Memberlist: -memberlist.abort-if-join-fails option now defaults to false.
  • [CHANGE] Remove middleware package. #182
  • [CHANGE] Memberlist: disabled TCP-based ping fallback, because dskit already uses a custom transport based on TCP. #194
  • [CHANGE] Memberlist: KV store now fast-joins memberlist cluster before serving any KV requests. #195
  • [CHANGE] Ring: remove duplicate state in NewOp func #203
  • [CHANGE] Memberlist: Increase the leave timeout to 10x the connection timeout, so that we can communicate the leave to at least 1 node, if the first 9 we try to contact times out.
  • [CHANGE] Netutil: removed debug log message "found network interfaces with private IP addresses assigned". #216
  • [CHANGE] Runtimeconfig: Listener channels created by Manager no longer receive current value on every reload period, but only if any configuration file has changed. #218
  • [CHANGE] Services: FailureWatcher.WatchService() and FailureWatcher.WatchManager() now panic if FailureWatcher is nil. #219
  • [CHANGE] Memberlist: cluster label verification feature (-memberlist.cluster-label and -memberlist.cluster-label-verification-disabled) is now marked as stable. #222
  • [CHANGE] Cache: Switch Memcached backend to use github.com/grafana/gomemcache repository instead of github.com/bradfitz/gomemcache. #248
  • [CHANGE] Multierror: Implement Is(error) bool. This allows to use multierror.MultiError with errors.Is(). MultiError will in turn call errors.Is() on every error value. #254
  • [CHANGE] Cache: Remove the context.Context argument from the Cache.Store method and rename the method to Cache.StoreAsync. #273
  • [CHANGE] ring: make it harder to leak contexts when using DoUntilQuorum. #319
  • [CHANGE] ring: CountTokens(), used for the calculation of ownership in the ring page has been changed in such a way that when zone-awareness is enabled, it calculates the ownership per-zone. Previously zones were not taken into account. #325
  • [CHANGE] memberlist: MetricsRegisterer field has been removed from memberlist.KVConfig in favor of registrerer passed via into NewKV function. #327
  • [CHANGE] gRPC client: use default connect timeout of 5s, and therefore enable default connect backoff max delay of 5s. #332
  • [CHANGE] Remove github.com/grafana/dskit/errors in favour of Go's errors package. #357
  • [CHANGE] Remove grpcutil.IsGRPCContextCanceled() in favour of grpcutil.IsCanceled(). #357
  • [CHANGE] Remove logrus and log.Interface. #359
  • [CHANGE] Remove jaeger-specific opentracing usage in middleware.HTTPGRPCTracer. #372
  • [CHANGE] Always include source IPs in HTTP and gRPC server spans. #386
  • [CHANGE] ring: Change PoolFactory function type to an interface and create function implementations. #387
  • [CHANGE] server: fix incorrect spelling of "gRPC" in server.Config fields. #422
  • [CHANGE] memberlist: re-resolve JoinMembers during full joins to the cluster (either at startup or periodic). The re-resolution happens on every 100 attempted nodes. This helps speed up joins and respect context cancelation #411
  • [CHANGE] ring: ring.DoBatch() was deprecated in favor of DoBatchWithOptions(). #431
  • [CHANGE] tenant: Remove tenant.WithDefaultResolver() and SingleResolver in favor of global functions tenant.TenantID(), tenant.TenantIDs(), or MultiResolver. #445
  • [CHANGE] Cache: Remove legacy metrics from Memcached client that contained _memcached_ in the name. #461
  • [CHANGE] memberlist: Change default for memberlist.stream-timeout from 10s to 2s. #458
  • [CHANGE] Cache: Remove errors from set methods on RemoteCacheClient interface. #466
  • [CHANGE] Expose BuildHTTPMiddleware to enable dskit Server instrumentation addition on external *mux.Routers. #459
  • [CHANGE] Remove RouteHTTPToGRPC option and related functionality #460
  • [CHANGE] Cache: Remove MemcachedCache and RedisCache structs in favor of RemoteCacheAdapter or using the underlying clients directly. #471
  • [CHANGE] Removed unused time.Duration parameter from ShouldLog() function in middleware.OptionalLogging interface. #513
  • [CHANGE] Changed ShouldLog() function signature in middleware.OptionalLogging interface to ShouldLog(context.Context) (bool, string): the returned string contains an optional reason. When reason is valued, GRPCServerLog adds (<reason>) suffix to the error. #514
  • [CHANGE] Cache: Remove superfluous cache.RemoteCacheClient interface and unify all caches using the cache.Cache interface. #520
  • [CHANGE] Updated the minimum required Go version to 1.21. #540
  • [CHANGE] Backoff: added Backoff.ErrCause() which is like Backoff.Err() but returns the context cause if backoff is terminated because the context has been canceled. #538
  • [CHANGE] memberlist: Metric memberlist_client_messages_in_broadcast_queue is now split into queue="local" and queue="gossip" values. #539
  • [CHANGE] memberlist: Failure to fast-join a cluster via contacting a node is now logged at info instead of debug. #585
  • [CHANGE] Service.AddListener and Manager.AddListener now return function for stopping the listener. #564
  • [CHANGE] ring: Add InstanceRingReader interface to ring package. #597
  • [FEATURE] Cache: Add support for configuring a Redis cache backend. #268 #271 #276
  • [FEATURE] Add support for waiting on the rate limiter using the new WaitN method. #279
  • [FEATURE] Add log.BufferedLogger type. #338
  • [FEATURE] Add flagext.ParseFlagsAndArguments() and flagext.ParseFlagsWithoutArguments() utilities. #341
  • [FEATURE] Add log.RateLimitedLogger for limiting the rate of logging. The logger_rate_limit_discarded_log_lines_total metrics traces the total number of discarded log lines per level. #352
  • [FEATURE] Add middleware.HTTPGRPCTracer for more detailed server-side tracing spans and tags on httpgrpc.HTTP/Handle requests
  • [FEATURE] Server: Add support for GrpcInflightMethodLimiter -- limiting gRPC requests before reading full request into the memory. This can be used to implement global or method-specific inflight limits for gRPC methods. #377 #392
  • [FEATURE] Server: Add -grpc.server.num-workers flag that configures the grpc.NumStreamWorkers() option. This can be used to start a fixed base amount of workers to process gRPC requests and avoid stack allocation for each call. #400
  • [FEATURE] Add PartitionRing. The partitions ring is hash ring to shard data between partitions. #474 #476 #478 #479 #481 #483 #484 #485 #488 #489 #493 #496 #497 #498 #503 #509
  • [FEATURE] Add methods Increment, FlushAll, CompareAndSwap, Touch to cache.MemcachedClient #477
  • [FEATURE] Add concurrency.ForEachJobMergeResults() utility function. #486
  • [FEATURE] Add ring.DoMultiUntilQuorumWithoutSuccessfulContextCancellation(). #495
  • [ENHANCEMENT] Add option to hide token information in ring status page #633
  • [ENHANCEMENT] Display token information in partition ring status page #631
  • [ENHANCEMENT] Add ability to log all source hosts from http header instead of only the first one. #444
  • [ENHANCEMENT] Add configuration to customize backoff for the gRPC clients.
  • [ENHANCEMENT] Use SecretReader interface to fetch secrets when configuring TLS. #274
  • [ENHANCEMENT] Add middleware package. #38
  • [ENHANCEMENT] Add the ring package #45
  • [ENHANCEMENT] Add limiter package. #41
  • [ENHANCEMENT] Add grpcclient, grpcencoding and grpcutil packages. #39
  • [ENHANCEMENT] Replace go-kit/kit/log with go-kit/log. #52
  • [ENHANCEMENT] Add spanlogger package. #42
  • [ENHANCEMENT] Add runutil.CloseWithLogOnErr function. #58
  • [ENHANCEMENT] Add cache, gate and stringsutil packages. #239
  • [ENHANCEMENT] Cache: add Redis support. #245
  • [ENHANCEMENT] Optimise memberlist receive path when used as a backing store for rings with a large number of members. #76 #77 #84 #91 #93
  • [ENHANCEMENT] Memberlist: prepare the data to send on the write before starting counting the elapsed time for -memberlist.packet-write-timeout, in order to reduce chances we hit the timeout when sending a packet to other node. #89
  • [ENHANCEMENT] Memberlist: parallelize processing of messages received by memberlist. #110
  • [ENHANCEMENT] flagext: for cases such as DeprecatedFlag() that need a logger, add RegisterFlagsWithLogger. #80
  • [ENHANCEMENT] Added option to BasicLifecycler to keep instance in the ring when stopping. #97
  • [ENHANCEMENT] Add WaitRingTokensStability function to ring, to be able to wait on ring stability excluding allowed state transitions. #95
  • [ENHANCEMENT] Trigger metrics update on ring changes instead of doing it periodically to speed up tests that wait for certain metrics. #107
  • [ENHANCEMENT] Add an HTTP hedging library. #115
  • [ENHANCEMENT] Ring: Add ring page handler to BasicLifecycler and Lifecycler. #112
  • [ENHANCEMENT] Lifecycler: It's now possible to change default value of lifecycler's final-sleep. #121
  • [ENHANCEMENT] Memberlist: Update to latest fork of memberlist. #160
  • [ENHANCEMENT] Memberlist: extracted HTTP status page handler to memberlist.HTTPStatusHandler which now can be instantiated with a custom template. #163
  • [ENHANCEMENT] Lifecycler: add flag to clear tokens on shutdown. #167
  • [ENHANCEMENT] ring: Added InstanceRegisterDelegate. #177
  • [ENHANCEMENT] ring: optimize shuffle-shard computation when lookback is used, and all instances have registered timestamp within the lookback window. In that case we can immediately return origial ring, because we would select all instances anyway. #181
  • [ENHANCEMENT] Runtimeconfig: Allow providing multiple runtime config yaml files as comma separated list file paths. #183
  • [ENHANCEMENT] Memberlist: Add cluster label support to memberlist client. #187
  • [ENHANCEMENT] Runtimeconfig: Don't unmarshal and merge runtime config yaml files if they haven't changed since last check. #218
  • [ENHANCEMENT] ring: DoBatch now differentiates between 4xx and 5xx GRPC errors and keeps track of them separately. It only returns when there is a quorum of either error class. If your errors do not implement GRPCStatus() *Status from google.golang.org/grpc/status, then this change does not affect you. #201
  • [ENHANCEMENT] Added <prefix>.tls-min-version and <prefix>.tls-cipher-suites flags to client configurations. #217
  • [ENHANCEMENT] Concurrency: Add LimitedConcurrencySingleFlight to run jobs concurrently and with in-flight deduplication. #214
  • [ENHANCEMENT] Add the ability to define custom gRPC health checks. #227
  • [ENHANCEMENT] Import Bytes type, DeleteAll function and DNS package from Thanos. #228
  • [ENHANCEMENT] Execute health checks in ring client pool concurrently. #237
  • [ENHANCEMENT] Cache: Add the ability to use a custom memory allocator for cache results. #249
  • [ENHANCEMENT] Add the metrics package to support per tenant metric aggregation. #258
  • [ENHANCEMENT] Cache: Add Delete method to cache.Cache interface. #255
  • [ENHANCEMENT] Cache: Add backward compatible metrics exposition for cache. #255
    • <prefix>_cache_client_info{backend="[memcached|redis]",...}
    • <prefix>_cache_dns_failures_total{backend="[memcached|redis]",...}
    • <prefix>_cache_dns_lookups_total{backend="[memcached|redis]",...}
    • <prefix>_cache_dns_provider_results{backend="[memcached|redis]",...}
    • <prefix>_cache_getmulti_gate_duration_seconds_bucket{backend="[memcached|redis]",...}
    • <prefix>_cache_getmulti_gate_duration_seconds_count{backend="[memcached|redis]",...}
    • <prefix>_cache_getmulti_gate_duration_seconds_sum{backend="[memcached|redis]",...}
    • <prefix>_cache_getmulti_gate_queries_concurrent_max{backend="[memcached|redis]",...}
    • <prefix>_cache_getmulti_gate_queries_in_flight{backend="[memcached|redis]",...}
    • <prefix>_cache_hits_total{backend="[memcached|redis]",...}
    • <prefix>_cache_operation_data_size_bytes_bucket{backend="[memcached|redis]",...}
    • <prefix>_cache_operation_data_size_bytes_count{backend="[memcached|redis]",...}
    • <prefix>_cache_operation_data_size_bytes_sum{backend="[memcached|redis]",...}
    • <prefix>_cache_operation_duration_seconds_bucket{backend="[memcached|redis]",...}
    • <prefix>_cache_operation_duration_seconds_count{backend="[memcached|redis]",...}
    • <prefix>_cache_operation_duration_seconds_sum{backend="[memcached|redis]",...}
    • <prefix>_cache_operation_failures_total{backend="[memcached|redis]",...}
    • <prefix>_cache_operation_skipped_total{backend="[memcached|redis]",...}
    • <prefix>_cache_operations_total{backend="[memcached|redis]",...}
    • <prefix>_cache_requests_total{backend="[memcached|redis]",...}
  • [ENHANCEMENT] Lifecycler: Added InstancesInZoneCount and InstancesCount functions returning respectively the total number of instances in the ring and the number of instances in the ring that are registered in lifecycler's zone, updated during the last heartbeat period. #270
  • [ENHANCEMENT] Memcached: add MinIdleConnectionsHeadroomPercentage support. It configures the minimum number of idle connections to keep open as a percentage of the number of recently used idle connections. If negative (default), idle connections are kept open indefinitely. #269
  • [ENHANCEMENT] Memcached: Add support for using TLS or mTLS with Memcached based caching. #278
  • [ENHANCEMENT] Ring: improve performance of shuffle sharding computation. #281
  • [ENHANCEMENT] Add option to enable IPv6 address detection in ring and memberlist handling. #185
  • [ENHANCEMENT] Ring: cache results of shuffle sharding with lookback where possible. #283 #295
  • [ENHANCEMENT] BasicLifecycler: functions ShouldKeepInstanceInTheRingOnShutdown and SetKeepInstanceInTheRingOnShutdown for checking and setting whether instances should be kept in the ring or unregistered on shutdown have been added. #290
  • [ENHANCEMENT] Ring: add DoUntilQuorum method. #293
  • [ENHANCEMENT] Ring: add ReplicationSet.ZoneCount() method. #298
  • [ENHANCEMENT] Ring: add request minimization to DoUntilQuorum method. #306
  • [ENHANCEMENT] grpcclient: add <prefix>.initial-stream-window-size and <prefix>.initial-connection-window-size configuration flags to alter HTTP flow control options for a gRPC client. #312
  • [ENHANCEMENT] Added TokenGenerator interface with RandomTokenGenerator (generating random tokens), and SpreadMinimizingTokenGenerator (generating tokens with almost even distribution) implementation. By default RandomTokenGenerator is used. #321
  • [ENHANCEMENT] Lifecycler: Added RingTokenGenerator configuration that specifies the TokenGenerator implementation that is used for token generation. Default value is nil, meaning that RandomTokenGenerator is used. #323
  • [ENHANCEMENT] BasicLifecycler: Added RingTokenGenerator configuration that specifies the TokenGenerator implementation that is used for token generation. Default value is nil, meaning that RandomTokenGenerator is used. #323
  • [ENHANCEMENT] Ring: add support for hedging to DoUntilQuorum when request minimization is enabled. #330
  • [ENHANCEMENT] Lifecycler: allow instances to register in ascending order of ids in case of spread minimizing token generation strategy. #326
  • [ENHANCEMENT] Remove dependency on github.com/weaveworks/common package by migrating code to a corresponding package in github.com/grafana/dskit. #342
  • [ENHANCEMENT] Add ability to pass TLS certificates and keys inline when configuring server-side TLS. #349 #363
  • [ENHANCEMENT] Migrate github.com/weaveworks/common/aws and github.com/weaveworks/common/test packages to corresponding packages in github.com/grafana/dskit. #356
  • [ENHANCEMENT] Ring: optionally emit logs and trace events in DoUntilQuorum. #361
  • [ENHANCEMENT] metrics: Add MetricFamilyMap.MinGauges and MetricFamiliesPerTenant.SendMinOfGauges methods. #366
  • [ENHANCEMENT] metrics: add MatchesLabels, FindMetricsInFamilyMatchingLabels and FindHistogramWithNameAndLabels test helper methods. #365
  • [ENHANCEMENT] SpanLogger: much more efficient debug logging. #367
  • [ENHANCEMENT] server: clarify documentation for -server.grpc-max-concurrent-streams CLI flag. #369
  • [ENHANCEMENT] Ring: Add ID attribute to InstanceDesc for ring members. #387
  • [ENHANCEMENT] httpgrpc, grpcutil: added constants and functions for adding request details into outgoing grpc metadata. #391
  • [ENHANCEMENT] Ring: clarify the message logged by DoUntilQuorum when the first failure in a zone occurs. #402
  • [ENHANCEMENT] Ring: include instance ID in log messages emitted by DoUntilQuorum. #402
  • [ENHANCEMENT] Ring: add support for aborting early if a terminal error is returned by a request initiated by DoUntilQuorum. #404 #413
  • [ENHANCEMENT] Memcached: allow to configure write and read buffer size (in bytes). #414
  • [ENHANCEMENT] Server: Add -server.http-read-header-timeout option to specify timeout for reading HTTP request header. It defaults to 0, in which case reading of headers can take up to -server.http-read-timeout, leaving no time for reading body, if there's any. #423
  • [ENHANCEMENT] Make httpgrpc.Server produce non-loggable errors when a header with key httpgrpc.DoNotLogErrorHeaderKey and any value is present in the HTTP response. #421
  • [ENHANCEMENT] Server: Add -server.report-grpc-codes-in-instrumentation-label-enabled CLI flag to specify whether gRPC status codes should be used in status_code label of request duration metric. It defaults to false, meaning that gRPC status codes are represented with error value. #424 #425
  • [ENHANCEMENT] Added middleware.ReportGRPCStatusOption that can be passed to the following functions to enable reporting of gRPC status codes in "status_code" label (instead of simplified "error", "cancel" or "success" values): #424
  • middleware.UnaryServerInstrumentInterceptor
  • middleware.StreamServerInstrumentInterceptor
  • middleware.UnaryClientInstrumentInterceptor
  • middleware.StreamClientInstrumentInterceptor
  • middleware.Instrument
  • [ENHANCEMENT] Server: Added new -server.http-log-closed-connections-without-response-enabled option to log details about closed connections that received no response. #426
  • [ENHANCEMENT] ring: Added new function DoBatchWithClientError() that extends an already existing DoBatch(). The former differentiates between client and server errors by the filtering function passed as a parameter. This way client and server errors can be tracked separately. The function returns only when there is a quorum of either error class. #427
  • [ENHANCEMENT] ring: Replaced DoBatchWithClientError() with DoBatchWithOptions(), allowing a new option to use a custom Go(func()) implementation that may use pre-allocated workers instead of spawning a new goroutine for each request. #431
  • [ENHANCEMENT] ring: add support for prioritising zones when using DoUntilQuorum with request minimisation and zone awareness. #440
  • [ENHANCEMENT] ballast: add utility function to allocate heap ballast. #446
  • [ENHANCEMENT] ring: use and provide context cancellation causes in DoUntilQuorum. #449
  • [ENHANCEMENT] ring: ring.DoBatch and ring.DoBatchWithOptions now check the context cancellation while calculating the replication instances, failing if the context was canceld. #454
  • [ENHANCEMENT] Cache: Export Memcached and Redis client types instead of returning the interface, RemoteCacheClient, they implement. #453
  • [ENHANCEMENT] SpanProfiler: add spanprofiler package. #448
  • [ENHANCEMENT] Server: Add support for ReportHTTP4XXCodesInInstrumentationLabel, which specifies whether HTTP 4xx status codes should be used in status_code label of request duration metric. It defaults to false, meaning that HTTP 4xx status codes are represented with success value. Moreover, when ReportHTTP4XXCodesInInstrumentationLabel is set to true, responses with HTTP status code 4xx are returned as errors. #457
  • [ENHANCEMENT] Ring: Add HasReplicationSetChangedWithoutStateOrAddr to detect replication state changes ignoring IP address changes due to e.g. member restarts. #464
  • [ENHANCEMENT] SpanLogger: Add SetSpanAndLogTag method. #467
  • [ENHANCEMENT] Add servicediscovery package. #469
  • [ENHANCEMENT] Expose InstancesInZoneCount and ZonesCount in ring.ReadRing interface. #494
  • [ENHANCEMENT] Add optimization to run concurrency.ForEachJob() with no parallelism when there's only 1 job. #486 #495
  • [ENHANCEMENT] Reduced memory allocations by user.ExtractFromGRPCRequest(). #502
  • [ENHANCEMENT] Add native histogram version of some metrics: #501
    • gate_duration_seconds
    • kv_request_duration_seconds
    • operation_duration_seconds
  • [ENHANCEMENT] Add outcome label to gate_duration_seconds metric. Possible values are rejected_canceled, rejected_deadline_exceeded, rejected_other, and permitted. #512
  • [ENHANCEMENT] Expose InstancesWithTokensCount and InstancesWithTokensInZoneCount in ring.ReadRing interface. #516
  • [ENHANCEMENT] Middleware: determine route name in a single place, and add middleware.ExtractRouteName() method to allow consuming applications to retrieve the route name. #527
  • [ENHANCEMENT] SpanProfiler: do less work on unsampled traces. #528
  • [ENHANCEMENT] Log Middleware: if the trace is not sampled, log its ID as trace_id_unsampled instead of trace_id. #529
  • [EHNANCEMENT] httpgrpc: httpgrpc Server can now use error message from special HTTP header when converting HTTP response to an error. This is useful when HTTP response body contains binary data that doesn't form valid utf-8 string, otherwise grpc would fail to marshal returned error. #531
  • [ENHANCEMENT] memberlist: use separate queue for broadcast messages that are result of local updates, and prioritize locally-generated messages when sending broadcasts. On stopping, only wait for queue with locally-generated messages to be empty. #539
  • [ENHANCEMENT] memberlist: Added -<prefix>memberlist.broadcast-timeout-for-local-updates-on-shutdown option to set timeout for sending locally-generated updates on shutdown, instead of previously hardcoded 10s (which is still the default). #539
  • [ENHANCEMENT] tracing: add ExtractTraceSpanID function.
  • [EHNANCEMENT] crypto/tls: Support reloading client certificates #537 #552
  • [ENHANCEMENT] Add read only support for ingesters in the ring and lifecycler. #553 #554 #556 #573 #578 #587
  • [ENHANCEMENT] Added new ring methods to expose number of writable instances with tokens per zone, and overall. #560 #562
  • [ENHANCEMENT] services.FailureWatcher can now be closed, which unregisters all service and manager listeners, and closes channel used to receive errors. #564
  • [ENHANCEMENT] Runtimeconfig: support gzip-compressed files with .gz extension. #571
  • [ENHANCEMENT] grpcclient: Support custom gRPC compressors. #583
  • [ENHANCEMENT] Adapt metrics.SendSumOfGaugesPerTenant to use metrics.MetricOption. #584
  • [ENHANCEMENT] Cache: Add .Add() and .Set() methods to cache clients. #591
  • [ENHANCEMENT] Cache: Add .Advance() methods to mock cache clients for easier testing of TTLs. #601
  • [ENHANCEMENT] Memberlist: Add concurrency to the transport's WriteTo method. #525
  • [ENHANCEMENT] Memberlist: Notifications can now be processed once per interval specified by -memberlist.notify-interval to reduce notify storms in large clusters. #592
  • [ENHANCEMENT] KV: Add MockCountingClient, which wraps the kv.client and can be used in order to count calls at specific functions of the interface. #618
  • [BUGFIX] spanlogger: Support multiple tenant IDs. #59
  • [BUGFIX] Memberlist: fixed corrupted packets when sending compound messages with more than 255 messages or messages bigger than 64KB. #85
  • [BUGFIX] Ring: ring_member_ownership_percent and ring_tokens_owned metrics are not updated on scale down. #109
  • [BUGFIX] Allow in-memory kv-client to support multiple codec #132
  • [BUGFIX] Modules: fix a module waiting for other modules depending on it before stopping. #141
  • [BUGFIX] Multi KV: fix panic when using function to modify primary KV store in runtime. #153
  • [BUGFIX] Lifecycler: if the ring backend storage is reset, the instance adds itself back to the ring with an updated registration timestamp set to current time. #165
  • [BUGFIX] Ring: fix bug where hash ring instances may appear unhealthy in the web UI even though they are not. #172
  • [BUGFIX] Lifecycler: if existing ring entry is reused, ring is updated immediately, and not on next heartbeat. #175
  • [BUGFIX] stringslicecsv: handle unmarshalling empty yaml string #206
  • [BUGFIX] Memberlist: retry joining memberlist cluster on startup when no nodes are resolved. #215
  • [BUGFIX] Ring status page: display 100% ownership as "100%", rather than "1e+02%". #231
  • [BUGFIX] Memberlist: fix crash when methods from memberlist.Delegate interface are called on *memberlist.KV before the service is fully started. #244
  • [BUGFIX] runtimeconfig: Fix runtime_config_last_reload_successful metric after recovery from bad config with exact same config hash as before. #262
  • [BUGFIX] Ring status page: fixed the owned tokens percentage value displayed. #282
  • [BUGFIX] Ring: prevent iterating the whole ring when using ExcludedZones. #285
  • [BUGFIX] grpcclient: fix missing . in flag name for initial connection window size flag. #314
  • [BUGFIX] ring.Lifecycler: Handle when previous ring state is leaving and the number of tokens has changed. #79
  • [BUGFIX] memberlist metrics: fix registration of memberlist_client_kv_store_count metric and disable expiration of metrics exposed by memberlist library. #327
  • [BUGFIX] middleware: fix issue where successful gRPC streaming requests are not always captured in metrics. #344
  • [BUGFIX] Ring: DoUntilQuorum and DoUntilQuorumWithoutSuccessfulContextCancellation will now cancel the context for a zone as soon as first failure for that zone is encountered. #364
  • [BUGFIX] MetricFamiliesPerTenant.SendMinOfGauges will no longer return 0 if some tenant doesn't have the gauge in their registry, and other tenants have positive values only. #368
  • [BUGFIX] MetricFamiliesPerTenant.SendMaxOfGauges will no longer return 0 if some tenant doesn't have the gauge in their registry, and other tenants have negative values only. #368
  • [BUGFIX] MetricFamiliesPerTenant.SendMaxOfGaugesPerTenant no longer includes tenants, which do not have specified gauge. #368
  • [BUGFIX] Correctly format host:port addresses when using IPv6. #388
  • [BUGFIX] Memberlist's TCP transport will now reject bind addresses that are not IP addresses, such as "localhost", rather than silently converting these to 0.0.0.0 and therefore listening on all addresses. #396
  • [BUGFIX] Ring: use zone-aware logging when all zones are required for quorum. #403
  • [BUGFIX] Services: moduleService could drop stop signals to its underlying service. #409 #417
  • [BUGFIX] ring: don't mark trace spans as failed if DoUntilQuorum terminates due to cancellation. #449
  • [BUGFIX] middleware: fix issue where applications that used the httpgrpc tracing middleware would generate duplicate spans for incoming HTTP requests. #451
  • [BUGFIX] httpgrpc: store headers in canonical form when converting from gRPC to HTTP. #518
  • [BUGFIX] Memcached: Don't truncate sub-second TTLs to 0 which results in them being cached forever. #530
  • [BUGFIX] Cache: initialise the operation_failures_total{reason="connect-timeout"} metric to 0 for each cache operation type on startup. #545
  • [BUGFIX] spanlogger: include correct caller information in log messages logged through a SpanLogger. #547
  • [BUGFIX] Ring: shuffle shard without lookback no longer returns entire ring when shard size >= number of instances. Instead proper subring is computed, with correct number of instances in each zone. Returning entire ring was a bug, and such ring can contain instances that were not supposed to be used, if zones are not balanced. #554 #556
  • [BUGFIX] Memberlist: clone value returned by function used in CAS operation before storing the value into KV store. #586