Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trying to run gpu-burn without the nvidia drivers loaded doesn't immediately exit with a non-zero code, and console is flooded with "terminate called after..." error messages #60

Open
bladernr opened this issue Jul 8, 2022 · 1 comment

Comments

@bladernr
Copy link

bladernr commented Jul 8, 2022

$ ./gpu_burn 30
Burning for 30 seconds.
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >terminate called after throwing an instance of
'terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator

'
terminate called after throwing an instance of 'terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator
'
terminate called after throwing an instance of 'terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator
'
std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator
std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
'
std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator >'
terminate called after throwing an instance of 'terminate called after throwing an instance of 'terminate called after throwing an instance of 'std::__cxx11::basic_string<ch
ar, std::char_traits, std::allocator >'
std::__cxx11::basic_string<char, std::char_traits, std::allocator >' terminate called after throwing an instance of 'terminate called after throwing an instance of 'std::__cxx11::basic_string<char, std::char_traits, std::allocator
'

I eventually had to CTRL-C out of this. It's on Ubuntu 22.04 with the latest gpu_burn source and cuda toolkit installed. I'm doing some bug testing of a wrapper I am using, when I hit this.

@jiaolovekt
Copy link

I've encountered same error with ubuntu2204, cuda11.8/12.1.
After debugging, it seems that GPU cannot be initialized and the cuInit(0) returns 999.
And soon I realized that this may be cause by the built in NVIDIA drivers.
I tried apt install -y nvidia-cuda-toolkit nvidia-modprobe and nvidia-modprobe -u. Then update the Makefile's CUDAPATH and NVCC path. And It works.
Hope this will help a little.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants