Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[13pt] Look into OverClocking for some FIM pipeline py scripts that have multi-processing #1370

Open
RobHanna-NOAA opened this issue Dec 10, 2024 · 0 comments
Labels
enhancement New feature or request Low Priority

Comments

@RobHanna-NOAA
Copy link
Contributor

RobHanna-NOAA commented Dec 10, 2024

Note: Now part of 1377 EPIC: FIM Sys Admin Tasks (and a few related FIM tasks)

A good handful of our tools have multi processing in them. Do some tests on over-clocking which is letting the job number exceed the number of CPU's available. Most of our products make very little usage of CPU usage and Memory usage. With such poor usage of almost all CPU's in multi-processing, it will be pretty safe to over-clock and add more jobs than CPUs. Each product that uses multi-processing, will need testing and small code adjustments for this to happen. The amount of jobs for overclocking on each product will need to be figured our or at least a safe range. Keeping an eye on the EC2's CPU and Memory counter tools now included on many of our ECS's can help. Also watching docker stats can help figure this out.

Gain? Potentially very significant.

We have already tested this to some degree on the latest Catfim which saw performance gains of 20% but CatFIM has multi-proc in multi-proc so it was hard to be definitive on it, but overclocking clearly had a major impact.

I added multi-proc and overclocking to test_case_by_hydro_id.py and safely had it process a full batch in 2 hrs and 40 mins. Various testing without mp and without overclocking in different enviros showed durations ranging from 32 hrs, 24, 20 adn 12 hrs all pre-mp and overclocking. Overclocking tests against the tool after adding mp also showed major improvements in the range of 50% improvements.

@RobHanna-NOAA RobHanna-NOAA added enhancement New feature or request Low Priority labels Dec 10, 2024
@RobHanna-NOAA RobHanna-NOAA changed the title 13pt] Look into OverClocking for some FIM pipeline py scripts that have multi-processing [13pt] Look into OverClocking for some FIM pipeline py scripts that have multi-processing Dec 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request Low Priority
Projects
None yet
Development

No branches or pull requests

1 participant