-
-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kohya_ss flux #5
Comments
@dimonnwc3 I have built a new image for the flux branch. I have not had time to test it, but feel free to give it a try. If we're still having issues with the shared libs I'll update the base image |
thank you I tried to use image with After explicitly defining KOHYA_REF, to make sure I'm on the right branch, I still get same error after starting the training:
|
after some debugging I found out that building custom docker image with arg: also because of the |
Did you ever get this working? Im also trying to use this for flux. If so, would you mind sharing your docker file? |
I ended up building custom image like this |
I was trying to fine-tune
flux
model with thesd3-flux.1
branch, by addingKOHYA_REF=sd3-flux.1
env variable.Although the container starts, training fails immediately with the following error:
Could not load library libnvrtc.so.12. Error: libnvrtc.so.12: cannot open shared object file: No such file or directory
I checked
/usr/lib/x86_64-linux-gnu
directory, wherelibnvrtc.so
andlibnvrtc.so.12
files are missing for some reason.Then I tried to mount volume
x86_64-linux-gnu
from the host, by changing my docker-compose file:Fine tuning starts working, but during startup and training it still shows some errors:
1.
ERROR: ld.so: object 'libtcmalloc.so' from LD_PRELOAD cannot be preloaded (cannot open shared object file): ignored.
This one, happens on startup and multiple times later.
2.
this one happens when fine-tuning starts
Seems my solution by mapping
/usr/lib/x86_64-linux-gnu
is not correct and there has to be another one to fix the original error:Could not load library libnvrtc.so.12. Error: libnvrtc.so.12: cannot open shared object file: No such file or directory
.Any ideas, how to make it work?
The text was updated successfully, but these errors were encountered: