Abstraction that allows us to develop different agents, frontend, backend, and evaluation in parallel #68

xingyaoww · 2024-03-20T16:11:54Z

As discuss here, i suggests that we can:

Come up with an Agent abstraction (with every necessary method for Devin) where everyone agrees on
Then we can implement different Agent under agenthub/ so we can leverage our potential large pool of contributors to experiment and explore different ideas
While frontend and backend folks can stick with the same Agent abstraction, and when MVP implementation is completed, the same front- and back-end system can easily switch between multiple different Agent implementations

I took the first (not perfect) step towards this goal - I draft the first version of Agent abstraction, merge @rbren's agent under research/langchains_agent (pls let me know if you prefer a different name! :P ), subclass the general Agent abstraction, and make the original agents/main.py more general (you can just change a command line argument to switch between completely different agent implementation!)

And i made sure the code works now for @rbren's agent:

research/langchains_agent/build-and-run.sh

produces outputs:

STEP 0
run {'command': 'ls'}
---
output {'output': ''}
==============
STEP 1
write {'path': 'hello_world.sh', 'contents': "#!/bin/bash\necho 'hello world'"}
---
output {'output': ''}
==============
STEP 2
run {'command': 'chmod +x hello_world.sh'}
---
output {'output': ''}
==============
STEP 3
run {'command': './hello_world.sh'}
---
output {'output': 'hello world\n'}
==============
STEP 4
Done!

TODOs after this PR is merged:

Make
@rbren Is research/langchains_agent/regression supposed to test whether an agent can complete the tasks? If so, I'd suggest maybe we can move it into a sub-folder of evaluation and add a TODO that writes actual test cases these to test any Agent implementation against these regression tests - what do you think?
Remove the Dockerfile for langchains_agent once Minimal Docker Sandbox with GPT-3.5 Execution Example #48 is merged (no big difference) and update build-and-run.sh correspondingly
Merge requirements.txt with the whole project requirements.txt (or maybe not since these will be installed inside the docker container?)

huybery · 2024-03-20T16:41:27Z

I think agenthub is more appropriate than research?

rbren · 2024-03-20T18:22:33Z

This is awesome, super helpful for experimentation 🚀

neubig

LGTM too, thanks for this!

…kend, and evaluation in parallel (All-Hands-AI#68) * move agent to langchains_agent * remove old .env * remove the old agent folder * add preliminary version of Agent abstraction * add preliminary version of the main.py * merge controlloop and main into a Agent class * add init * fix json import * fix missing arg * get langchains_agent working after abstraction * rename `research` to `agenthub` * rename: rename research to agenthub --------- Co-authored-by: huybery <huybery@gmail.com>

xingyaoww added 10 commits March 20, 2024 10:22

move agent to langchains_agent

66d9144

remove old .env

1ff38c7

remove the old agent folder

e5d98ac

add preliminary version of Agent abstraction

1d62e54

add preliminary version of the main.py

e8f0896

merge controlloop and main into a Agent class

d97ec1d

add init

2d4d4fe

fix json import

806ff2e

fix missing arg

9d45dc2

get langchains_agent working after abstraction

23308f6

huybery requested a review from neubig March 20, 2024 16:23

huybery self-requested a review March 20, 2024 16:41

rename research to agenthub

e204086

rename: rename research to agenthub

8c2a76c

huybery force-pushed the abstraction branch from 6e08244 to 8c2a76c Compare March 20, 2024 18:25

huybery approved these changes Mar 20, 2024

View reviewed changes

neubig approved these changes Mar 20, 2024

View reviewed changes

neubig merged commit 0380070 into All-Hands-AI:main Mar 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Abstraction that allows us to develop different agents, frontend, backend, and evaluation in parallel #68

Abstraction that allows us to develop different agents, frontend, backend, and evaluation in parallel #68

Abstraction that allows us to develop different agents, frontend, backend, and evaluation in parallel #68

Abstraction that allows us to develop different agents, frontend, backend, and evaluation in parallel #68

Conversation

Choose a reason for hiding this comment