Desgin: The memory system for octogen #165

imotai · 2023-10-19T05:23:32Z

Background and Why

Even GPT-4, a well-performing model, can generate outdated code due to outdated training data. However, LLMs have a strong language understanding ability, which can be used to correct these errors through prompts.

To enable LLM to use command-line tools and libraries that are not covered by the training data. This includes two cases:
- Tools and code repositories don't being included in the training data at all.
- The model is trained on data from outdated tools or libraries, which means that the model cannot use them right
To prevent LLM from repeating the same mistakes. When a LLM uses an incorrect code usage or tool, it will always repeat the incorrect code and then recall the correct result through long-term memory before executing the code. However, this can be improved by using short-term memory to store the correct tool or code usage in the instructions.

Desgin

Other Memory Desgin

https://arxiv.org/pdf/2310.08560.pdf this paper can provide some data proof for the desgin
https://arxiv.org/abs/1909.09436

imotai added octogen-agent-service bash-interpreter python-interpter labels Oct 19, 2023

imotai added this to the v0.5.0 milestone Oct 19, 2023

imotai mentioned this issue Oct 19, 2023

Octogen roadmap for v0.6.0 #110

Open

13 tasks

imotai removed this from the v0.5.0 milestone Oct 19, 2023

imotai mentioned this issue Oct 31, 2023

Add short memory to the agent #181

Closed

imotai self-assigned this Dec 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Desgin: The memory system for octogen #165

Desgin: The memory system for octogen #165

imotai commented Oct 19, 2023 •

edited

Loading

Desgin: The memory system for octogen #165

Desgin: The memory system for octogen #165

Comments

imotai commented Oct 19, 2023 • edited Loading

Background and Why

Desgin

Other Memory Desgin

imotai commented Oct 19, 2023 •

edited

Loading