-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Discussion] How to train using more than 1 GPU? #275
Comments
I don't think so. Tensorflows documentation states that it does not place operations into multiple GPUs automatically. Tensorflow does not easily share graphs or sessions among multiple processess. There are some blogs on this discussion on the towardsdatascience.com site I assume you have been able to get the training working. What is your set-up? |
@stickyninja3 There's something called Distributed training that implies that it is possible.
Also looking at the Spleeter source code, it implies that multiple machines can be used to train a model. What I wonder now is why multiple machines before taking full advantage of multiple GPU's, Either way I'd love if the Spleeter devs would address this as it would greatly benefit the community. So what would be nice to address is:
|
Hi @aidv We have no plans to work on this feature for the moment. We don't have much experience with the Distributed training strategies and as @stickyninja3 said, it would probably require quite a lot of tuning to make it efficient. If you feel that it can be achieved with minor changes, feel free to send us a gist of code and we'll look into it. |
@mmoussallam thank you for addressing that. So what about the multiple machines? In Spleeter file Reading some of the comments I can see a whole bunch of info regarding distributed training. |
Hi all, I think distributed training is easier with version 2 of tensorflow. The blogs i read all stated that tensorflow 1.14 / 1.13 don't share models across GPUs. It would be interesting to see what improvements could be made, but I can't even get training working on a single GPU. Nothing i have tried seems to work. It would be interesting to know the exact environments you use. I have been given my Dads old work laptop, which has a GTX1660. Going to reformat and try Ubuntu 18.04 now |
@stickyninja3 I wonder how hard it would be to convert Spleeter code to use v2 of Tensorflow 🤔 Are you on Windows or MacOS? I'm on Windows and it's actually pretty easy to get it up an running. Give me your email and I'll send you a message. |
Hi aidv, [email protected] is my email. I have tried using Windows but couldn't get training working. I have a laptop to use. It has 64Gb memory. Core i7 gen9. GeForce GTX 1660ti. It would be great to get this working. Thanks, |
Is this possible?
Can I train using multiple GPUs?
The text was updated successfully, but these errors were encountered: