Added Semaphore for restricting amount of concurrent LLM calls #1043

KolodziejczykWaldemar · 2024-09-30T20:01:11Z

This PR introduces a semaphore-based approach to limit the number of concurrent async tasks when executing test cases. The main changes include:

Implementation of asyncio.Semaphore to control concurrency.
Modification of the main execution loop to use the new semaphore-controlled function.

Impact on LLM Requests
This change significantly reduces the number of simultaneous requests made to the Language Model (LLM) API. By limiting concurrent tasks to a configurable number (default set to 10), we achieve:

Better management of API rate limits.
Reduced risk of encountering "too many requests" errors.
More consistent and predictable request patterns.

vercel · 2024-09-30T20:01:16Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
evals-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Sep 30, 2024 8:01pm

penguine-ip · 2024-10-03T09:01:11Z

Extremely useful, thanks! @KolodziejczykWaldemar

Added Semaphore for restricting amount of concurrent LLM calls

1232862

vercel bot deployed to Preview September 30, 2024 20:01 View deployment

penguine-ip merged commit 0c50b9b into confident-ai:main Oct 3, 2024
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Semaphore for restricting amount of concurrent LLM calls #1043

Added Semaphore for restricting amount of concurrent LLM calls #1043

KolodziejczykWaldemar commented Sep 30, 2024

vercel bot commented Sep 30, 2024 •

edited

Loading

penguine-ip commented Oct 3, 2024

Added Semaphore for restricting amount of concurrent LLM calls #1043

Added Semaphore for restricting amount of concurrent LLM calls #1043

Conversation

KolodziejczykWaldemar commented Sep 30, 2024

vercel bot commented Sep 30, 2024 • edited Loading

penguine-ip commented Oct 3, 2024

vercel bot commented Sep 30, 2024 •

edited

Loading