Nvidia support #391

ilya-zlobintsev · 2024-10-25T20:54:22Z

With #388 being merged, LACT now has basic support for Nvidia GPUs through NVML (nvidia management library). This issue tracks the feature support for nvidia.

Information reporting
- The UI now only shows fields that have data present, as it doesn't make sense to report vendors-specific info such as CUDA cores or compute units when the value will always be empty on that GPU
Real-time stats reporting (clockspeed, power usage, power states, fan speed, throttling)
Power limit configuration
Custom fan curves
- Largely uses the same logic as pre-RDNA3 AMD
Clockspeed configuration
- Implemented in feat: implement clocks control on Nvidia #398 (only for max values)

Not possible to implement currently:

Voltage configuration - doesn't appear to be supported in NVML

ilya-zlobintsev · 2024-11-01T21:34:30Z

All the main functionality has been implemented, now it just needs a bit more testing across multiple GPU generations.

Dekamir · 2024-11-15T22:59:34Z

Will we ever be able to toggle individual PowerMizer power/performance levels (or edit them) on NVIDIA?

AbdulrahmanObaido · 2024-11-22T17:53:23Z

can we get intel arc support fan speed set on kernel 6.12 ?

ilya-zlobintsev · 2024-11-22T19:19:56Z

@AbdulrahmanObaido 6.12 only added support for reading the fan speed on ARC, it's not possible to set it (or do any other configuration on intel GPUs)
See #401 for more info

Jimmytalksalot · 2024-12-30T03:17:49Z

what is the config yaml for fan control?

stanislav-kozyrev · 2025-01-14T19:41:41Z

@ilya-zlobintsev Thanks for Nvidia GPU support. By the way, the memory clock offset value in UI and config file is multiplied by 2 compared to GreenWithEnvy and MSI Afterburner. Default memory clock is determined correctly (e.g., 11501 for 23 Gbps). In those programs the offset should be 1500 (AB) instead of 3000 (LACT) to get 26 Gbps on RTX 4080 Super. Please check attached debug snapshot.
info.json

stanislav-kozyrev · 2025-01-14T19:46:49Z

@ilya-zlobintsev Also forgot to mention that sometimes Nvidia drivers don't create libnvidia-ml.so (especially beta or from CUDA repo), but there is still libnvidia-ml.so.1. I had to make a missing symlink to make lactd to detect Nvidia. Maybe it makes sense to fallback to libnvidia-ml.so.1 if libnvidia-ml.so is not found?

ilya-zlobintsev · 2025-01-14T21:27:26Z

@stanislav-kozyrev it was changed to libnvidia-ml.so.1 since the last stable release, see #414 for more info

stanislav-kozyrev · 2025-01-16T06:45:27Z

@ilya-zlobintsev Thanks for the update. Tried out release 0.7.0 and both issues (library name and memory offset) are resolved.

HorstBaerbel · 2025-01-16T15:34:37Z

Thanks for the nvidia support! RTX 3060 user here. The GPU and VRAM clocks do not seem to adjust correctly. LACT seems to do something to the clocks, but the actual values are a bit off. Also I'd expect the Power states to update to reflect the new values:

LACT-v0.7.0-snapshot-20250116-163055.tar.gz

stanislav-kozyrev · 2025-01-16T17:15:26Z

@HorstBaerbel Could you please try to reproduce the issue with either GreenWithEnvy (check Flatpacks) or official NVIDIA X Server Settings app? Both apps allow to adjust core and memory offsets, but don't forget to apply defaults with LACT first. Power limit can be changed with nvidia-smi. Also, try to get 100% load with GPU heavy benchmark like Unigine Superposition.

The core clock on modern (since last decade or so) GPUs is more like a cap, i.e. upper limit, that a vendor's boost technology aims to achieve given ideal conditions, e.g., hot spot temperature 15C, unlimited power and/or voltage, etc. For instance, out of the box my RTX 4080S core clock is set to 3105 MHz, but in reality it stays around 2655-2750 depending on the load. That's why undervolting modern chips is more beneficial than overclocking -- basically it's all about removing obstacles to let the boost algorithm stretch it's legs.

HorstBaerbel · 2025-01-16T22:28:43Z

@stanislav-kozyrev I'm on Wayland here, so LACT is basically my only hope. GreenWithEnvy and official NVIDIA X Server Settings app don't support Wayland.
About the clocks: I understand that it is an upper limit, but if I increase the limits the clocks do increase but the relation is totally random. This is underclocking with the power limit set to max running FurMark:

Not even the reduced-from-default clocks are reached while the VRAM clock is too high. I'm not saying this is LACTs fault. It may very well be a driver thing.

BlueGoliath · 2025-01-17T18:53:09Z

Thanks for the nvidia support! RTX 3060 user here. The GPU and VRAM clocks do not seem to adjust correctly. LACT seems to do something to the clocks, but the actual values are a bit off. Also I'd expect the Power states to update to reflect the new values:

LACT-v0.7.0-snapshot-20250116-163055.tar.gz

NVML bug.

BlueGoliath · 2025-02-01T01:30:01Z

Thanks for the nvidia support! RTX 3060 user here. The GPU and VRAM clocks do not seem to adjust correctly. LACT seems to do something to the clocks, but the actual values are a bit off. Also I'd expect the Power states to update to reflect the new values:

LACT-v0.7.0-snapshot-20250116-163055.tar.gz

NVML bug.

FYI, this has been fixed in 570. It looks like LACT doesn't update those values as an app restart is required for new values to be shown. Everything overclocking works fine now.

Aspect250 · 2025-02-02T05:24:14Z

Why not implement core clock offset and memory clock offset like in these projects
https://github.com/WickedLukas/nvidia-tuner
https://github.com/Dreaming-Codes/nvidia_oc
as they mention that they use NVML.
this would provide the ability to overclock and undervolt on nvidia cards.

ilya-zlobintsev · 2025-02-02T07:00:27Z

Why not implement core clock offset and memory clock offset like in these projects https://github.com/WickedLukas/nvidia-tuner https://github.com/Dreaming-Codes/nvidia_oc as they mention that they use NVML. this would provide the ability to overclock and undervolt on nvidia cards.

The current min/max clock functionality does use offsets under the hood. I plan on showing them as offsets in the ui too, but they already work using the same nvml options.

Aspect250 · 2025-02-02T08:06:06Z

Why not implement core clock offset and memory clock offset like in these projects https://github.com/WickedLukas/nvidia-tuner https://github.com/Dreaming-Codes/nvidia_oc as they mention that they use NVML. this would provide the ability to overclock and undervolt on nvidia cards.

The current min/max clock functionality does use offsets under the hood. I plan on showing them as offsets in the ui too, but they already work using the same nvml options.

Im using this cachyos guide as my basis right now, So your using/doing nvmlDeviceSetGpcClkVfOffset according to the guide above and not nvmlDeviceSetGpuLockedClocks or the equivalent to it for the current max gpu clock slider.
Have i got that right?

ilya-zlobintsev · 2025-02-02T08:43:38Z

Why not implement core clock offset and memory clock offset like in these projects https://github.com/WickedLukas/nvidia-tuner https://github.com/Dreaming-Codes/nvidia_oc as they mention that they use NVML. this would provide the ability to overclock and undervolt on nvidia cards.

The current min/max clock functionality does use offsets under the hood. I plan on showing them as offsets in the ui too, but they already work using the same nvml options.

Im using this cachyos guide as my basis right now, So your using/doing nvmlDeviceSetGpcClkVfOffset according to the guide above and not nvmlDeviceSetGpuLockedClocks or the equivalent to it for the current max gpu clock slider. Have i got that right?

Yes, that's correct - both the max GPU and VRAM clock settings currently use nvmlDeviceSetGpcClkVfOffset/nvmlDeviceSetMemClkVfOffset, and the max clock values shown in the GUI are calculated as clockspeed of max pstate + the offset. It was done this way because it allowed to reuse the existing GUI and config options which were originally made for AMD. nvmlDeviceSetGpuLockedClocks is not used anywhere.

Here is the relevant code: https://github.com/ilya-zlobintsev/LACT/blob/master/lact-daemon/src/server/gpu_controller/nvidia.rs#L567

However I do plan on replacing the nvmlDeviceSet*ClkOffset functions (which are marked as deprecated in NVML docs) with nvmlDeviceSetClockOffsets, which allows to configure clock offsets per each pstate separately. It was a bit buggy when I tested it originally, but it appears to be improved with the 570 beta driver.

Aspect250 · 2025-02-02T09:03:18Z

Thanks for clarifying. So much easier to undervolt/overclock right now.

Cheers for adding NVIDIA support.

HorstBaerbel · 2025-02-28T11:14:02Z

Only here to report that this works much better now with 570.86.16. Thanks!

ilya-zlobintsev added the hardware support label Oct 25, 2024

ilya-zlobintsev pinned this issue Oct 25, 2024

ilya-zlobintsev mentioned this issue Nov 1, 2024

feat: implement clocks control on Nvidia #398

Merged

ilya-zlobintsev closed this as completed Feb 27, 2025

ilya-zlobintsev unpinned this issue Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nvidia support #391

Nvidia support #391

ilya-zlobintsev commented Oct 25, 2024 •

edited

Loading

ilya-zlobintsev commented Nov 1, 2024

Dekamir commented Nov 15, 2024

AbdulrahmanObaido commented Nov 22, 2024

ilya-zlobintsev commented Nov 22, 2024

Jimmytalksalot commented Dec 30, 2024

stanislav-kozyrev commented Jan 14, 2025

stanislav-kozyrev commented Jan 14, 2025

ilya-zlobintsev commented Jan 14, 2025

stanislav-kozyrev commented Jan 16, 2025

HorstBaerbel commented Jan 16, 2025

stanislav-kozyrev commented Jan 16, 2025

HorstBaerbel commented Jan 16, 2025 •

edited

Loading

BlueGoliath commented Jan 17, 2025

BlueGoliath commented Feb 1, 2025

Aspect250 commented Feb 2, 2025

ilya-zlobintsev commented Feb 2, 2025

Aspect250 commented Feb 2, 2025 •

edited

Loading

ilya-zlobintsev commented Feb 2, 2025

Aspect250 commented Feb 2, 2025

HorstBaerbel commented Feb 28, 2025

Nvidia support #391

Nvidia support #391

Comments

ilya-zlobintsev commented Oct 25, 2024 • edited Loading

ilya-zlobintsev commented Nov 1, 2024

Dekamir commented Nov 15, 2024

AbdulrahmanObaido commented Nov 22, 2024

ilya-zlobintsev commented Nov 22, 2024

Jimmytalksalot commented Dec 30, 2024

stanislav-kozyrev commented Jan 14, 2025

stanislav-kozyrev commented Jan 14, 2025

ilya-zlobintsev commented Jan 14, 2025

stanislav-kozyrev commented Jan 16, 2025

HorstBaerbel commented Jan 16, 2025

stanislav-kozyrev commented Jan 16, 2025

HorstBaerbel commented Jan 16, 2025 • edited Loading

BlueGoliath commented Jan 17, 2025

BlueGoliath commented Feb 1, 2025

Aspect250 commented Feb 2, 2025

ilya-zlobintsev commented Feb 2, 2025

Aspect250 commented Feb 2, 2025 • edited Loading

ilya-zlobintsev commented Feb 2, 2025

Aspect250 commented Feb 2, 2025

HorstBaerbel commented Feb 28, 2025

ilya-zlobintsev commented Oct 25, 2024 •

edited

Loading

HorstBaerbel commented Jan 16, 2025 •

edited

Loading

Aspect250 commented Feb 2, 2025 •

edited

Loading