Feat: Chunking large inputs #28

rihp · 2023-07-14T10:52:47Z

There are some cases where the result of an invokation is very long and exceeds the token window limit

In those cases we can implement some type of chunking mechanism that processes the input in batches that the LLM can handle.

For some reference see this issue and specifically this comment

The idea would be to apply the following new steps when sending a message to the agent:

Create an env variable to set the chunking size
When the agent is about to process any input, check the length of the input with tiktoken
If the input is larger than the context window, chunk it based on env config length.
Save the output in a variable
Process the next chunk and append the result to the previous variable to update its contents with the new information.
Repeat the process until you´ve processed all the chunks
Return the combination of all the responses to the chunked bits

rihp · 2023-07-14T17:56:20Z

This one is being implemented and works as a Proof of Concept in #29

rihp · 2023-08-01T09:49:57Z

The summarization module was refactored,

today, I think we should remove the summarization of permanent messages as it messages with the initialization prompt, the user goal and the loaded wraps.

Essentially removing this line or implementing it differently

PolyGPT/src/chat.ts

Line 111 in 4acd195

await this._summarize("persistent");

rihp mentioned this issue Jul 14, 2023

Feat: Chunking big outputs so that they can be processed in the context window limit #29

Merged

rihp added the enhancement New feature or request label Jul 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Chunking large inputs #28

Feat: Chunking large inputs #28

rihp commented Jul 14, 2023

rihp commented Jul 14, 2023

rihp commented Aug 1, 2023 •

edited

Loading

Feat: Chunking large inputs #28

Feat: Chunking large inputs #28

Comments

rihp commented Jul 14, 2023

rihp commented Jul 14, 2023

rihp commented Aug 1, 2023 • edited Loading

rihp commented Aug 1, 2023 •

edited

Loading