Break code into chunks #90

Yiannis128 · 2023-10-13T13:46:20Z

All LLMs have issues with medium-large code samples in fixing and optimizing the code. A strategy that could be done to improve this is to break the input code into chunks and incrementally give it to the LLM, so that it doesn't have access to the entire code. So for example, a 2000 line code file could be broken into chunks segmented by functions.

juancolonna · 2023-10-13T15:59:58Z

Hi Yiannis, thanks for opening this issue. That's right, from the last table you presented today it is clear that with smaller codes the method works better, so if we find a way to reduce larger code into small verifiable snippets, we will have a win-win strategy. My next concern is: can ESBMC check code snippets?

Yiannis128 · 2023-10-14T01:02:27Z

Hi Yiannis, thanks for opening this issue. That's right, from the last table you presented today it is clear that with smaller codes the method works better, so if we find a way to reduce larger code into small verifiable snippets, we will have a win-win strategy.

While it is worth running the experiment, it is worth mentioning that it could in some cases affect the code in a bad way. For example, let's say we provide a single function every time, then if that function calls another function from the same or a different source file, then the LLM may not know what the function is doing. Perhaps we could avoid this by including additional metadata with each function call describing textually what each function does, please see this example:

int x() {
  int res = y();
}

Could be transformed into:

int x() {
  int res = y(); // Call function y which is supposed to do .........
}

My next concern is: can ESBMC check code snippets?

I am not sure about ESBMC processing code snippets, I personally think that as long as you contain in the source file everything such that the code compiles into an object file, then yes, it would be able to. To rephrase it, if you remove all irrelevant functions and variables, then yes, it would work. I don't think that you could go lower than that to breaking down functions, but perhaps @lucasccordeiro can answer that question?

Yiannis128 added enhancement New feature or request research This issue isn't being worked on, it is being considered, and researched. labels Oct 13, 2023

Yiannis128 added this to the Version 0.4.0 milestone Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Break code into chunks #90

Break code into chunks #90

Yiannis128 commented Oct 13, 2023

juancolonna commented Oct 13, 2023

Yiannis128 commented Oct 14, 2023

Break code into chunks #90

Break code into chunks #90

Comments

Yiannis128 commented Oct 13, 2023

juancolonna commented Oct 13, 2023

Yiannis128 commented Oct 14, 2023