[General]: Very little RAM is used #154

hardWorker254 · 2024-12-30T12:52:07Z

Why do LLMs like Llama 10B use very little RAM?
Before loading

After loading

What about adding the ability to attach files to text? Maybe use file=path/to/file after promt or add special button? And use the view:
[begin of something.txt file]
Content of something.txt
...
...
[end of something.txt file]

a-ghorbani · 2025-01-02T20:54:47Z

Please ensure that auto offload/load is disabled in the app (on the settings page). Otherwise, the model will be offloaded when the app goes into the background. I am going to close this issue as this is not a direct app issue, but if the models doesn't work feel free to open the issue again.

As for file attach it is not planed yet. RAG is challenging to implement for on-device, unless we fit the entire text in the context.

hardWorker254 added the question Further information is requested label Dec 30, 2024

a-ghorbani closed this as completed Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[General]: Very little RAM is used #154

[General]: Very little RAM is used #154

hardWorker254 commented Dec 30, 2024

a-ghorbani commented Jan 2, 2025

[General]: Very little RAM is used #154

[General]: Very little RAM is used #154

Comments

hardWorker254 commented Dec 30, 2024

a-ghorbani commented Jan 2, 2025