2.15.0-rc0
Pre-release
Pre-release
Major Features and Improvements
- No major features or improvements.
Breaking Changes
- No breaking changes.
Bug Fixes and Other Changes
- Moves model server TFRT integration code oss (commit: 50ebab4)
- Add an option to override to the size of GPU system (commit: 445a87b)
- This cl is causing test failures and we are rolling it back. (commit: a39289b)
- Default signature_method_check to false (commit: 4711a8d)
- Add an option to propagate current Context in periodic functions from AspiredVersionsManager. (commit: e4a8a87)
- Refactor
Servable::PredictStreamed
so that implementations can support bidirectional streaming if needed (commit: a8c3ea6) - Create koltin proto library for the tensor flow protos. (commit: cae3164)
- Create and use Kotlin proto targets for model.proto and predict.proto (commit: ea9529e)
- Add release notes for tf-serving 2.13.1 (commit: 45fae91)
- Resubmit to move model server TFRT integration code oss (commit: eb5b3a5)
- Enable BF16 Automatic Mixed Precision (commit: 970c630)
- Follow expected format (commit: 60a3d73)
- Remove upper_cost_threshold in TFRT serving (commit: 7f8d9d7)
- Build tensorflow_model_server with -rdynamic (commit: fc89240)
- Add peak memory resource kind. (commit: 96e0661)
- Fix typo (commit: c0b35c7)
- Update warmup documentation (commit: 90148d7)
- Implement Freeze() in pathways/tfrt serving. (commit: 0117fd4)
- This CL is a no-op (commit: b75349d)
- OSS remote_op_config_rewriter.proto (commit: ba47377)
- Add release notes for tf-serving 2.14.0-rc0 (commit: 4d5ecfd)
- Add flags for gpu multi-streaming support. (commit: 77cabde)
- Add release notes for tf-serving 2.14.0-rc1 (commit: a3023de)
- Add 3 new resource kinds constants for GPU. (commit: 6b6dea3)
- Adding flag allowing to turn off automatic TPU system initialization on startup. (commit: f83bc0c)
- Add release notes for tf-serving 2.14.0 (commit: 60976ef)
- Annotate which model is missing inputs. (commit: c99b18b)
- ebpf-transport-monitoring adding dependency on net_http. (commit: 152ef4e)
- Add release notes for tf-serving 2.14.1 (commit: 83d9709)
- OSS saved_model_config library, removes saved_model_config_stub/impl, moves GraphRewriter related API from session_bundle_util to graph_rewriter.h. (commit: 7356bbd)
- No-op. (commit: 9d02d89)
- Upgrading Bazel version from 6.1.0 to 6.4.0 (commit: 34521dc)
- Set xnn_enable_avxvnni=false in .bazelrc (commit: 4aed749)
- Add cuda-nvml-dev-11-8 to Dockerfile.gpu (commit: b2def71)
- Revert problem with incorrect Dart build rules and targets. (commit: b6bccce)
- Add cuda-nvml-dev-11-8 to Dockerfile.devel-gpu (and remove from Dockerfile.gpu) (commit: 028aac5)
- OSS tfrt_http_api_handler*. (commit: 8ded4ce)
- Added FileAcl to tsl::FileSystem. (commit: d6c0917)
- Remove metadata size check in GetModelMetadata method in order to be consistent with other servable impl. (commit: a635552)
- Replace the global registration with a registration class so that when we move server_init_internal to OSS we won't run into undetermined global registration sequence issue. (commit: 21d8f88)
- Move TPU runner init stub to tensorflow serving OSS directory. (commit: 2b9e58c)
- Add util function to verify if override resource have a subset of device kind of base resource. This is not used by OSS. (commit: 06ff18d)
- Add streaming options for predict request. (commit: 8ccd8a5)
- Define how tensors will be split for SPLIT streamed requests. (commit: b581572)
- Add a client_id field for custom servables. (commit: eb57852)
- Add option to configure the name of the input layer of remote model. (commit: f1e1341)
- Added grpc reflection service to the serving binary. (commit: c140e01)
- Add the option to enable GRPC health checking to model_server. This is useful for clients that want to use health checking with load balancing channels (if not we get errors on the client side). The current implementation is trivial, once we open our serving port we assume we we always be healthy but users may want to tweak this, specially if they need a mandated version, etc. (commit: a9a8e7b)
- Automated Code Change (commit: f761fc7)
- Update description of model versioning. (commit: d820234)
- Exported
FindMetaGraphDef
function. (commit: 0df0975) - Automated Code Change (commit: 27923d3)
- Automated Code Change (commit: 704e250)
- If accepting_requests_ is not set Terminate() returns without doing anything. (commit: c45fe14)
- Automated Code Change (commit: fce1804)
- Modify PredictStreamed to return a response or an error. (commit: 5b5d30f)
- Add support to use a MockServable in MockServerCore. (commit: 5b6e0b6)
- Fix OSS cpu build. (commit: 72acbaf)
- Adds functionality to send TSL metrics over model_service RPC. (commit: 9564ef6)
- Add a method in tensorflow::serving::Servable to indicate whether a servable is critical. (commit: 5c0299e)
- Upgrade to CUDA 12.2 and CuDNN 8.9.4 (commit: f82600a)
- Fixes tensorflow_serving continuous build. (commit: a99fb9c)
- Add headers. (commit: fab7271)
- Remove the criticality field in the BatchingSessionTask. (commit: b8663d0)
- Move gpu docker build clang. (commit: 611c5a9)
- Updated Dockerfile.devel-gpu to run setup.sources.sh from repo. (commit: d3102f0)
- Add an interface for all Servables that support paging. (commit: e4716e5)
- Update cuda libraries to match TF (commit: 45446cf)
- Match libraries with Dockerfile.devel-gpu (commit: f6ef270)
- Update version for 2.15.0-rc0 release. (#2209) (commit: 73ba2b9)
- Resolve breakages for 2.15 release. (commit: 3181292)