Take into account multiple AMD GPUs on the same device #15

SamlRx · 2025-05-11T19:50:04Z

This PR addresses the issue described in #170.

Previously, the code iterated over /sys/class/hwmon and selected the first AMD GPU it encountered, which could mistakenly identify a discrete GPU (dGPU) instead of the integrated GPU (iGPU).

This patch improves detection by resolving the full path of each hwmon device and extracting its associated PCI address. We then use lspci to check whether the device matches known AMD dGPU patterns. Only if the device is not a known dGPU do we consider it a valid iGPU candidate.

This should ensure more reliable and deterministic selection of the iGPU on systems with both AMD integrated and discrete GPUs.

I have also moved the find_intel_igpu function to utils.py for consistency.

… are found on the same device.

SamlRx · 2025-05-11T21:17:34Z

Should I bump version as part of this PR?

antheas · 2025-05-11T21:19:18Z

No. Can you simplify the PR? The moves make it hard to see what changed

SamlRx · 2025-05-11T21:21:25Z

Yeah sure

antheas · 2025-05-11T21:24:25Z

Checking for local CPUs was meant to act as a dGPU check, why didn't it work?

SamlRx · 2025-05-11T21:30:08Z

Checking for local CPUs was meant to act as a dGPU check, why didn't it work?

I can double check in details tomorrow. But as far as I remember I had the file local_cpulist in both directories.

I fixed the PR it should be more readable.

antheas · 2025-05-11T21:47:38Z

Since you have a dGPU, you can see the differences between them. Is there something simpler you can check for that does not require quirking per generation?

SamlRx · 2025-05-12T19:18:31Z

I can double check in details tomorrow. But as far as I remember I had the file local_cpulist in both directories.

Yeah I confirm that I have a local_cpulist in both directories,.

Is there something simpler you can check for that does not require quirking per generation?

As far as I can see nvtop is able to detect if a GPU is integrated or not. Are you aware of a lib similar to libdrm or dlsym for Python?

SamlRx · 2025-05-12T20:11:29Z

I have been digging around and I think I can use vulkaninfo to do so, that would mean the script for AMD GPUs will admit vulkaninfo is installed:

========
GPU0:
	apiVersion         = 1.4.305
	driverVersion      = 25.0.5
	vendorID           = 0x1002
	deviceID           = 0x731f
	deviceType         = PHYSICAL_DEVICE_TYPE_DISCRETE_GPU
	deviceName         = AMD Radeon RX 5700 XT (RADV NAVI10)
	driverID           = DRIVER_ID_MESA_RADV
	driverName         = radv
	driverInfo         = Mesa 25.0.5
	conformanceVersion = 1.4.0.0
	deviceUUID         = 00000000-0800-0000-0000-000000000000
	driverUUID         = 414d442d-4d45-5341-2d44-525600000000
GPU1:
	apiVersion         = 1.4.305
	driverVersion      = 25.0.5
	vendorID           = 0x1002
	deviceID           = 0x15bf
	deviceType         = PHYSICAL_DEVICE_TYPE_INTEGRATED_GPU
	deviceName         = AMD Radeon Graphics (RADV PHOENIX)
	driverID           = DRIVER_ID_MESA_RADV
	driverName         = radv
	driverInfo         = Mesa 25.0.5
	conformanceVersion = 1.4.0.0
	deviceUUID         = 00000000-6400-0000-0000-000000000000
	driverUUID         = 414d442d-4d45-5341-2d44-525600000000

SamlRx · 2025-05-12T20:34:16Z

I have modified the PR with the usage of vulkaninfo

SamlRx · 2025-05-13T20:02:04Z

If you are not confortable merging this, I can still work on it. Could you please share if something is bothering you? I can also add unit tests if this is something that could help.

antheas · 2025-05-13T20:07:45Z

I have been busy with work.

The current problem I have with this as 8 see it is that the functions where you do the string manipulation with ":" and the fact you hardcoded CPU generations seem fragile

If you can instead switch to a static check similar to local CPUs, I will merge this. You can look in the source code of e.g. nvtop for this

…ing an iGPU or a dGPU

SamlRx · 2025-05-13T21:40:49Z

Unfortunately, translating the nvtop part using Cython was out my reach. However, I believe it's possible to achieve something similar using this library: https://github.com/mark9064/pyamdgpuinfo, by extending it to include the AMDGPU_VRAM_TYPE_INTEGRATED information.

As a result, I ended up reverting most of my earlier changes and discovered that mem_info_vram_vendor only appears on discrete GPUs. Which, on reflection, makes sense. This approach preserves the core logic while offering a more reliable way to distinguish between dGPUs and iGPUs in the long run.

Change find_igpu function to take into account when multiple AMD GPUs…

ba2cac0

… are found on the same device.

SamlRx mentioned this pull request May 11, 2025

HHD daemon fail to launch with error: "ValueError: invalid literal for int() with base 10: '800MHz'" hhd-dev/hhd#170

Closed

remove specific test string for AMD gpus

be0f6b4

RxSml added 3 commits May 11, 2025 23:25

Simplify PR

2d7bcf5

Fix issue with last PR

1d4710b

Fix formatting

b876baf

RxSml added 2 commits May 11, 2025 23:36

Fix stupid mistake and add reformating

2e1db00

Remove legacy change.

758c177

RxSml added 2 commits May 12, 2025 22:32

Change logic to use Vulkaninfo for a more reliable result

1be04f8

add typing

9049053

Revert code and use mem_info_vram_vendor to determine if we are check…

02152db

…ing an iGPU or a dGPU

antheas mentioned this pull request Jun 30, 2025

FileNotFoundError when using eGPU #16

Open

antheas mentioned this pull request Aug 26, 2025

[Bug] Dual AMD GPU functionality breaks hhd's power‑profile‑daemon and CPU scheduling hhd-dev/hhd#215

Open

Take into account multiple AMD GPUs on the same device #15

Are you sure you want to change the base?

Take into account multiple AMD GPUs on the same device #15

Uh oh!

Conversation

SamlRx commented May 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SamlRx commented May 11, 2025

Uh oh!

antheas commented May 11, 2025

Uh oh!

SamlRx commented May 11, 2025

Uh oh!

antheas commented May 11, 2025

Uh oh!

SamlRx commented May 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

antheas commented May 11, 2025

Uh oh!

SamlRx commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SamlRx commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SamlRx commented May 12, 2025

Uh oh!

SamlRx commented May 13, 2025

Uh oh!

antheas commented May 13, 2025

Uh oh!

SamlRx commented May 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SamlRx commented May 11, 2025 •

edited

Loading

SamlRx commented May 11, 2025 •

edited

Loading

SamlRx commented May 12, 2025 •

edited

Loading

SamlRx commented May 12, 2025 •

edited

Loading