Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Task freeze #5

Open
damienlaine opened this issue Nov 25, 2024 · 2 comments
Open

Task freeze #5

damienlaine opened this issue Nov 25, 2024 · 2 comments

Comments

@damienlaine
Copy link
Member

If for some reason the llm service fails to respond. Jobs might get stuck forever

See logs :

linto_llm-gateway.1.1qeqpejy2j4n@linagora-linto-bm-02    | 25/11/2024 15:46:13 http_server INFO: Task 0f78d8f8-38b9-4623-a227-98d19bc01527 queued
linto_llm-gateway.1.1qeqpejy2j4n@linagora-linto-bm-02    | 25/11/2024 15:46:13 http_server INFO: Task 0f78d8f8-38b9-4623-a227-98d19bc01527 processing started
linto_llm-gateway.1.1qeqpejy2j4n@linagora-linto-bm-02    | 25/11/2024 15:46:13 backend INFO: Loading prompt for service: summary
linto_llm-gateway.1.1qeqpejy2j4n@linagora-linto-bm-02    | 25/11/2024 15:46:13 backend INFO: Prompt fields: 2
linto_llm-gateway.1.1qeqpejy2j4n@linagora-linto-bm-02    | 25/11/2024 15:46:13 backend INFO: Setting up backend with params: {'name': 'llama3', 'modelName': 'casperhansen/llama-3-8b-instruct-awq', 'totalContextLength': 8192, 'maxGenerationLength': 2048, 'tokenizerClass': 'LlamaTokenizer', 'createNewTurnAfter': 300, 'summaryTurns': 2, 'maxNewTurns': 10, 'temperature': 0.1, 'top_p': 0.8} for task: 0f78d8f8-38b9-4623-a227-98d19bc01527
linto_llm-gateway.1.1qeqpejy2j4n@linagora-linto-bm-02    | You are using the default legacy behaviour of the <class 'transformers.models.llama.tokenization_llama_fast.LlamaTokenizerFast'>. This is expected, and simply means that the `legacy` (previous) behavior will be used so nothing changes for you. If you want to use the new behaviour, set `legacy=False`. This should only be set if you understand what it means, and thoroughly read the reason why this was added as explained in https://github.com/huggingface/transformers/pull/24565 - if you loaded a llama tokenizer from a GGUF file you can ignore this message.
linto_llm-gateway.1.1qeqpejy2j4n@linagora-linto-bm-02    | 25/11/2024 15:46:15 backend ERROR: Error publishing: Connection error.
linto_llm-gateway.1.1qeqpejy2j4n@linagora-linto-bm-02    | 25/11/2024 15:46:15 http_server ERROR: An error occurred in processing tasks : cannot unpack non-iterable NoneType object
@damienlaine
Copy link
Member Author

@htagourti

@htagourti htagourti self-assigned this Nov 27, 2024
@htagourti
Copy link

htagourti commented Nov 27, 2024

Fixed in "houssem_llm_backend" branch.
When an exception is raised in the backend, the celery task will fail and not block the queue
PR coming soon

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

2 participants