Coder Agent

This is an experimental project to create a coder agent which uses LLMs to write Python code.

Procedure

Ask the user for a problem specification.
Use the "programmer agent" to write one or more python functions which solves the problem. It is important that the functions doesn't perform any side effects like printing or writing to the file system as that is hard to test. The system prompt for the agent includes instructions to add type annotations and write docstrings for all functions.
Type checking is performed in the following steps: a) The code is run through the static type checker Mypy. If it passes type checking, then we are done, else goto step b). b) The programmer agent is passed a prompt with the original code, the problem specification from the user, and the type errors, it is asked to solve the type errors. c) Go back to step a) if not the maximum number of retries is reached, then exit.
Ask the "test designer agent" to write tests for the functions: a) Generate a so called stub file with Stubgen. A stub file is basically all function definitions with type annotations and docstrings but excluding function bodies. b) Feed the problem specification and the stub file to the test designer agent and ask it to write tests for all functions using unittest.
Merge the code and the tests into the same file and type check in the same way as in step 3.
Run the tests. If they succeed, we're done, else go to step 7.
Feed the code (but not the tests) and the test errors to the programmer agent and ask it to update the code so that it passes the tests.
Go back to step 5 and retry with the updated code. There is of course a maximum number of retries here as well.

Running

You need at least Python 3.10 and Python Poetry installed. You can then run the following commands:

poetry install
poetry run coder_agent

You may specify arguments like --backend, --temperature, --retries and --no-interactive on the command line. For reference run:

poetry run coder_agent --help

Example Queries

Write a function which sums two numbers.
Write a function which computes the factorial of an integer.
Write a function which computes the squared factorial of an integer.

Limitations

One limitation is that the tests will never be improved. So if there is a bug in the tests the programmer agent has no chance of correcting the code.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
coder_agent		coder_agent
.env		.env
.gitignore		.gitignore
README.md		README.md
Wenhans example.ipynb		Wenhans example.ipynb
groq_example.ipynb		groq_example.ipynb
openai_example.ipynb		openai_example.ipynb
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Coder Agent

Procedure

Running

Example Queries

Limitations

About

Releases

Packages

Contributors 3

Languages

tage64/CoderAgent

Folders and files

Latest commit

History

Repository files navigation

Coder Agent

Procedure

Running

Example Queries

Limitations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages