Cache llm chain to avoid loading the model at every utterance [ENG 437] #12728

varunshankar · 2023-08-13T18:11:59Z

Proposed changes:

Move llm factory into init method to skip loading the model at every utterance

Status (please check what you already did):

varunshankar added 2 commits August 9, 2023 23:17

Cache llm

d640913

Merge branch 'dm2' into ENG-437-cache-llm-chain

9bc1805

varunshankar requested a review from a team as a code owner August 13, 2023 18:12

varunshankar self-assigned this Aug 13, 2023

tmbo approved these changes Aug 13, 2023

View reviewed changes

tmbo closed this Aug 18, 2023

varunshankar deleted the ENG-437-cache-llm-chain branch October 27, 2023 21:56

Provide feedback