Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

trainSVM scaling with multiple CPUs #4

Open
tobigithub opened this issue May 26, 2016 · 2 comments
Open

trainSVM scaling with multiple CPUs #4

tobigithub opened this issue May 26, 2016 · 2 comments

Comments

@tobigithub
Copy link

Hi,
this is for finger1d 1.4 and shen_ISMB2014.py
Scaling for trainSVM(ckm_f, fingerprints, np=....) (np= number of processors)

1 CPU:
real 0m19.516s
user 0m30.252s
sys 0m2.136s

2 CPUs
real 0m12.498s
user 0m30.048s
sys 0m1.856s

4 CPUs
real 0m9.657s
user 0m29.472s
sys 0m1.800s

8 CPUs
real 0m8.742s
user 0m31.916s
sys 0m1.828s

16 CPUs
real 0m7.935s
user 0m34.060s
sys 0m2.280s

So there is basically for this small set fgtree_folder = "test_data/train_trees/" there is no scaling beyond 4 CPUs. There is some multiprocessing at the first 5-10 seconds of the code. The most time consuming part of the code (again only single CPU at 100% use) is then Writing LI kernel to LI_kernel.txt and Writing RLI kernel to RLI_kernel.txt and the general output.

Cheers
Tobias

@huibinshen
Copy link
Owner

I think may be before new job started, the old one has finished so it use the old CPU not the new ones. This training tasks are so small and it runs on the fly.

@huibinshen
Copy link
Owner

huibinshen commented May 26, 2016

Also, in the example, it has 10 fingerprints to predict, so more than 10 processes would not be useful.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants