You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
some attention stats are computed but never used, only the attention scores from the last transformer block (?) are provided to the sampler. Is it just an oversight or am I missing something?
Also, is there a Discord (or similar) to discuss stuff like this? Seems better suited that GH issues.
Thanks!
The text was updated successfully, but these errors were encountered:
I'm currently studying the code to get a better sense of how the method works.
In
entropix/entropix/main.py
Line 86 in e55e9a3
stats
are computed but never used, only the attentionscores
from the last transformer block (?) are provided to the sampler. Is it just an oversight or am I missing something?Also, is there a Discord (or similar) to discuss stuff like this? Seems better suited that GH issues.
Thanks!
The text was updated successfully, but these errors were encountered: