Skip to content

Refactor turbomind attention by precomputing rotary embed #2395

Refactor turbomind attention by precomputing rotary embed

Refactor turbomind attention by precomputing rotary embed #2395

Triggered via pull request December 2, 2024 13:06
@irexycirexyc
synchronize #2801
irexyc:rope
Status Success
Total duration 32m 29s
Artifacts

pr_ete_test.yml

on: pull_request
pr_functions_test
32m 14s
pr_functions_test
Fit to window
Zoom out
Zoom in