[Bug] LLMParams and LLMParamsDoc Pydantic Model Error #1078

jjmaturino · 2024-12-26T06:52:24Z

Priority

Undecided

OS type

Ubuntu

Hardware type

Xeon-GNR

Installation method

Pull docker images from hub.docker.com
Build docker images from source

Deploy method

Docker compose
Docker
Kubernetes
Helm

Running nodes

Single Node

What's the version?

N/A

Description

TL;DR: Pydantic Model accepts streaming parameter and not stream parameter as defined by TGI api spec.

There is a discrepancy between the TGI interface standard and the pydantic models defined.

The current difference is that the repo models define the stream parameter as streaming.

This causes the json that is accepted, (marshaled and unmarshalled) to expect streaming as the json key rather than the TGI standard of stream

Discovered during this PR where I was noticing during testing that when I posted to the chat endpoint via curl with the stream json key,

The program would not successfully unmarshal the json object.

After talking with @xiguiw , and looking at the TGI documentation, I believe that this is an error in the codebase.

This might be a breaking change if fixed.

Reproduce steps

GenAIComps/comps/cores/proto/docarray.py

Line 187 in 8d6b4b0

streaming: bool = True

GenAIComps/comps/cores/proto/docarray.py

Line 232 in 8d6b4b0

streaming: bool = True

https://huggingface.github.io/text-generation-inference/

Raw log

No response

Attachments

No response

xiguiw · 2025-01-02T01:47:27Z

@jjmaturino

Thank for catching this!
@XinyaoWa will help to fix it.
It's working in process.

joshuayao · 2025-01-10T01:37:26Z

Hi @jjmaturino, the bug was fixed. Could you please help verify it with the latest code? Thanks.

jjmaturino added the bug Something isn't working label Dec 26, 2024

jjmaturino mentioned this issue Dec 26, 2024

fix: changed stream parameter to streaming in curl command #1023

Closed

1 task

xiguiw assigned XinyaoWa Jan 2, 2025

xiguiw mentioned this issue Jan 3, 2025

ChatCompletionRequest stream False fails in curl #830

Closed

This was referenced Jan 3, 2025

Rename streaming to stream to align with OpenAI API #1098

Merged

Rename streaming to stream to align with OpenAI API opea-project/GenAIExamples#1332

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] LLMParams and LLMParamsDoc Pydantic Model Error #1078

[Bug] LLMParams and LLMParamsDoc Pydantic Model Error #1078

jjmaturino commented Dec 26, 2024 •

edited

Loading

xiguiw commented Jan 2, 2025 •

edited

Loading

joshuayao commented Jan 10, 2025

[Bug] LLMParams and LLMParamsDoc Pydantic Model Error #1078

[Bug] LLMParams and LLMParamsDoc Pydantic Model Error #1078

Comments

jjmaturino commented Dec 26, 2024 • edited Loading

Priority

OS type

Hardware type

Installation method

Deploy method

Running nodes

What's the version?

Description

Reproduce steps

Raw log

Attachments

xiguiw commented Jan 2, 2025 • edited Loading

joshuayao commented Jan 10, 2025

jjmaturino commented Dec 26, 2024 •

edited

Loading

xiguiw commented Jan 2, 2025 •

edited

Loading