Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Desgin: The memory system for octogen #165

Open
Tracked by #110
imotai opened this issue Oct 19, 2023 · 0 comments
Open
Tracked by #110

Desgin: The memory system for octogen #165

imotai opened this issue Oct 19, 2023 · 0 comments

Comments

@imotai
Copy link
Contributor

imotai commented Oct 19, 2023

Background and Why

Even GPT-4, a well-performing model, can generate outdated code due to outdated training data. However, LLMs have a strong language understanding ability, which can be used to correct these errors through prompts.

  1. To enable LLM to use command-line tools and libraries that are not covered by the training data. This includes two cases:
    • Tools and code repositories don't being included in the training data at all.
    • The model is trained on data from outdated tools or libraries, which means that the model cannot use them right
  2. To prevent LLM from repeating the same mistakes. When a LLM uses an incorrect code usage or tool, it will always repeat the incorrect code and then recall the correct result through long-term memory before executing the code. However, this can be improved by using short-term memory to store the correct tool or code usage in the instructions.

Desgin

image

Other Memory Desgin

  1. https://arxiv.org/pdf/2310.08560.pdf this paper can provide some data proof for the desgin
  2. https://arxiv.org/abs/1909.09436
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant