-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add builder for translations GPU images using multiuser generic-worker #142
Conversation
Do we want L3 only or L1 and L3 versions of this image? |
Just L1 for now is fine. (I do expect to be requesting L2 or L3 images in the near future, but I'm not quite ready for them yet, and not sure which we'll end up using.) |
Building the following at 71d7783:
|
Thank you! I'm going to test this out this week! |
Hmm, I know we do some EDID tweaks in the ubuntu-jammy-from-community-gui scripts. I wonder if the CUDA install is messing with something. We could try reordering the scripts. |
That one is still hitting the same errors :(:
|
Hm, sounds like the nvidia driver is interacting with Wayland in some way (in a more complex way than I initially thought). We could wait for a g-w multi that doesn't require a GUI (taskcluster/taskcluster#6786, er taskcluster/taskcluster#4595) or we could investigate further. I think pmoore is working on a 2404 config in community (but not sure if that's going to solve the wayland nvidia issues - someone recently made a comment that everyone with a nvidia gpu is still using X11. not sure if that's true). |
Do we have any idea on a timeline for the non-GUI multi engine generic worker? (I'd be happy to help guinea pig it in Translations.) |
No, no timeline yet. We're going to discuss at our next RelSRE/Taskcluster meeting (I think we're going to try to invite Releng to this next one). |
OK! Maybe we can figure out next steps here after that? |
Headless is out. We should revisit this. |
71d7783
to
9fa8d33
Compare
I'm ready to give it another go anytime! |
cb325e5
to
ebd19b1
Compare
I've updated this with the latest Taskcluster version & a fix for the kernel uninstall (the ubuntu version number part changed...). |
Getting an error:
The build just seems to hang after that. I'll debug some more. |
0bb51c4
to
3b7e5b4
Compare
3b7e5b4
to
31065cf
Compare
We're not going to do this; translations will switch to the ubuntu 24.04 images in https://github.com/mozilla-platform-ops/worker-images instead. |
Over in mozilla/translations#466 I'm working on adding support for automatic uploads of artifacts with one of our scriptworkers. Using scriptworkers requires enabling chain of trust. While testing this, I discovered that the
simple
engine doesn't support chain of trust, which means we'll need to move GPU workers to the multiuser engine.This patch is a first shot at something that might work. I based it on
gw_fxci_gcp_l1_gui.yaml
with cuda, papertrail, and translations requirements added. I was able to build it with papertrail disabled (I don't have those secrets), but I'm unable to test my own built images properly, so it's difficult to be certain this will work. Feel free to throw this out if there's a different configuration that's preferred.