What's Changed
- feat: info route by @lorenzomassimiani in #368
- Fix CI: async tests with ASGITransport by @aniketmaurya in #376
- Fix: Replace Deprecated
max_tokens
withmax_completion_tokens
in OpenAI Spec by @rittik9 in #375 - feat: Customizable Loops 1/n by @aniketmaurya in #374
- customizable loop - wire up Loops to LitServer 2/n by @aniketmaurya in #378
- Improve CI: retry flaky tests by @aniketmaurya in #379
- [pre-commit.ci] pre-commit suggestions by @pre-commit-ci in #377
- check device format while initialising litserver by @ali-alshaar7 in #380
- Release 0.2.6.dev0 by @aniketmaurya in #383
- Update PR template by @rittik9 in #381
- Include user field and
base64
literal for encoding_format by @aniketmaurya in #388 - Improve error handling and debugging experience by @aniketmaurya in #389
- improved logging with sensible defaults by @aniketmaurya in #391
- add continuous batching loop 1/n by @aniketmaurya in #387
- Add
loop.pre_setup
to allow fine-grained LitAPI validation based on inference loop by @aniketmaurya in #393 - Make
LitAPI.predict
optional and validate API implementation by @aniketmaurya in #394 - Fix OpenAISpec with continuous batching loop by @aniketmaurya in #395
- add tests for continuous batching and Default loops by @aniketmaurya in #396
- Set LitServer.stream using LitSpec.stream by @aniketmaurya in #398
- fix openai usage info for non-streaming response by @aniketmaurya in #399
- [pre-commit.ci] pre-commit suggestions by @pre-commit-ci in #400
- Async continuous batching loop by @aniketmaurya in #401
- add validation for
stream=False
withyield
usage by @aniketmaurya in #402 - fix callback runner to execute after predict by @aniketmaurya in #406
- integrate zmq by @aniketmaurya in #403
- warn users when predict/unbatch output length is not same as #requests by @aniketmaurya in #408
- move built in loops inside classes by @aniketmaurya in #409
- add justus and thomas as codeowners by @aniketmaurya in #410
- enable multiple workers for ZMQ by @aniketmaurya in #411
- Fix: Add Callback Events and Align Hooks in Streaming Loop by @bhimrazy in #407
- bump:
Lightning-AI/utilities
usedmain
by @Borda in #415 - Release v0.2.6 by @aniketmaurya in #413
- fix: don't start zmq when fast_queue=false by @aniketmaurya in #417
- fix release ci by @aniketmaurya in #416
New Contributors
- @rittik9 made their first contribution in #375
- @ali-alshaar7 made their first contribution in #380
Full Changelog: v0.2.5...v0.2.6