Feature request: Add Asynchronous Message Queue Support with Rate Limiting #149

kaustavbecs · 2024-12-11T16:15:02Z

Use case

Regular enterprise use cases require support for spiky workloads without violating rate limits from LLM Providers. The Multi Agent Orchestrator should be able to:

Store the incoming messages in a queue
Check the classifier to identify top LLMs for the request
Check the rate limits and current consumption
Request the appropriate LLM endpoint
Receive the response and send it back to the requestor via Websockets

Solution/User Experience

The Multi Agent Orchestrator should be able to:

Store the incoming messages in a queue
Check the classifier to identify top LLMs for the request
Check the rate limits and current consumption
Request the appropriate LLM endpoint
Receive the response and send it back to the requestor via Websockets

Alternative solutions

Frameworks such as LLamaIndex supports async mode - but that targeted towards async IO non blocking http calls only. For a comprehensive enterprise grade Multi Agent Orchestrator solution, we need a custom solution

cornelcroi · 2024-12-12T18:02:42Z

Hi @kaustavbecs, thank you for the proposition.

To have a async behaviour I think the best way is to handle this per agent (not at orchestrator level), because each agent is not necessary an LLM and because each agent could have different limits.
To implement this you could create a custom agent and handle the asynchronous communication with the LLM inside.
I could see this being implemented in a LambdaAgent for example and this lambda will have the asynchronous communication with the LLM by using queues or any other mechanism.
It would be interesting to create such agent to be part of built in agents.
Happy to review a solution before you can start the implementation if you want to contribute to this repostitory.

kaustavbecs added the triage label Dec 11, 2024

kaustavbecs changed the title ~~Feature request: Support for async Multi Agent Orchestrator with support for massive scale~~ Feature request: Add Asynchronous Message Queue Support with Rate Limiting Dec 11, 2024

cornelcroi self-assigned this Dec 12, 2024

brnaba-aws moved this to Todo in multi-agent-orchestrator board Dec 20, 2024

brnaba-aws added this to multi-agent-orchestrator board Dec 20, 2024

awslabs deleted a comment from tica3467 Dec 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Add Asynchronous Message Queue Support with Rate Limiting #149

Feature request: Add Asynchronous Message Queue Support with Rate Limiting #149

kaustavbecs commented Dec 11, 2024

cornelcroi commented Dec 12, 2024

Feature request: Add Asynchronous Message Queue Support with Rate Limiting #149

Feature request: Add Asynchronous Message Queue Support with Rate Limiting #149

Comments

kaustavbecs commented Dec 11, 2024

Use case

Solution/User Experience

Alternative solutions

cornelcroi commented Dec 12, 2024