Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pthread] init async gpu -> cpu #49

Merged
merged 8 commits into from
Nov 12, 2024
Merged

[pthread] init async gpu -> cpu #49

merged 8 commits into from
Nov 12, 2024

Conversation

botbw
Copy link
Collaborator

@botbw botbw commented Oct 29, 2024

Add write_tensor, which moves tensors from GPU to CPU when needed:

For aio and uring backend, it does the transfer in blocking manner.

For pthread, it's non blocking and pipelined with file io:

image

@botbw botbw added the enhancement New feature or request label Oct 29, 2024
@botbw botbw requested a review from ver217 October 29, 2024 03:49
@botbw botbw self-assigned this Oct 29, 2024
@botbw botbw force-pushed the gpu2cpu branch 2 times, most recently from 8aa59d9 to 4d19a44 Compare October 29, 2024 08:49
@ver217 ver217 merged commit a1bf816 into main Nov 12, 2024
1 check passed
@ver217 ver217 deleted the gpu2cpu branch November 12, 2024 02:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants