Qual tool cluster info - calculating the number of exec nodes on yarn should look at all executors over time when dynamic allocation on #1121

tgravescs · 2024-06-14T12:53:42Z

Describe the bug
Currently the logic for calculating the number of nodes used on a YARN cluster only looks at the active executors. This is an ok approximation but with dynamic allocation this number could change a lot over the lifetime of an application.

Our logic:

 val activeExecInfo = executorIdToInfo.values.collect {
      case execInfo if execInfo.isActive => (execInfo.host, execInfo.totalCores)
    }

 activeHosts.toSet.size

We should be looking at the maximum number of executor going at any point in time and then calculating the number of nodes needed.

The text was updated successfully, but these errors were encountered:

tgravescs added bug Something isn't working ? - Needs Triage labels Jun 14, 2024

tgravescs mentioned this issue Jun 14, 2024

Include number of executors per node in cluster information #1119

Merged

amahussein added the tools label Jun 14, 2024

amahussein removed the ? - Needs Triage label Jun 27, 2024

amahussein assigned tgravescs Jul 1, 2024

tgravescs removed their assignment Jul 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qual tool cluster info - calculating the number of exec nodes on yarn should look at all executors over time when dynamic allocation on #1121

Qual tool cluster info - calculating the number of exec nodes on yarn should look at all executors over time when dynamic allocation on #1121

tgravescs commented Jun 14, 2024

Qual tool cluster info - calculating the number of exec nodes on yarn should look at all executors over time when dynamic allocation on #1121

Qual tool cluster info - calculating the number of exec nodes on yarn should look at all executors over time when dynamic allocation on #1121

Comments

tgravescs commented Jun 14, 2024