#12 - Enable access to all @analyze.kernel profiling results via global cache #14

ConvolutedDog · 2025-12-14T09:32:58Z

Intro

This commit introduces a global profiling cache system to the nsight-python. It fundamentally improves how profiling results are collected and accessed, especially when using multiple @nsight.analyze.kernel-decorated functions within a single script.

Previously, profiling was performed in separate script executions for each decorated function, which meant only the currently-profiled function returned results while others returned None. This made it impossible to aggregate results from multiple kernels or access all profiling results within the same script run.

With this update, the profiling system saves results from every profiling execution to disk using a singleton-based cache manager. On subsequent calls, each decorated function transparently loads its result from cache if profiling has already been executed, enabling access to all profiling results in a single script run.

Refer to #12 for more details.

Key Changes

Introduced GlobalNCUProfileCache singleton.

Files Updated

Added: nsight/cache.py (new cache system)
Modified: nsight/collection/ncu.py (uses cache system)
Modified: examples/06_plot_customization.py to validate new cache behavior

…python. It fundamentally improves how profiling results are collected and accessed, especially when using multiple `@nsight.analyze.kernel`-decorated functions within a single script. Previously, profiling was performed in separate script executions for each decorated function, which meant only the currently-profiled function returned results while others returned `None`. This made it impossible to aggregate results from multiple kernels or access all profiling results within the same script run. With this update, the profiling system saves results from every profiling execution to disk using a singleton-based cache manager. On subsequent calls, each decorated function transparently loads its result from cache if profiling has already been executed, enabling access to all profiling results in a single script run. Refer to NVIDIA#12 for more details. - Introduced `GlobalNCUProfileCache` singleton. - Added: nsight/cache.py (new cache system) - Modified: nsight/collection/ncu.py (uses cache system) - Modified: examples/06_plot_customization.py to validate new cache behavior Signed-off-by: ConvolutedDog <[email protected]>

Signed-off-by: ConvolutedDog <[email protected]>

copy-pr-bot · 2025-12-14T09:33:01Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: ConvolutedDog <[email protected]>

acollins3 · 2025-12-19T12:52:26Z

/ok to test 1c36385

Signed-off-by: ConvolutedDog <[email protected]>

ConvolutedDog added 2 commits December 14, 2025 17:29

fix lint

47584ac

Signed-off-by: ConvolutedDog <[email protected]>

add type hints for singleton class and fix mypy errors

0348cc3

Signed-off-by: ConvolutedDog <[email protected]>

ConvolutedDog marked this pull request as draft December 15, 2025 03:06

[Fix] Implement process isolation and improve cache management

0aed45f

Signed-off-by: ConvolutedDog <[email protected]>

ConvolutedDog marked this pull request as ready for review December 15, 2025 05:12

Merge branch 'main' into multiple-func-single-script

1c36385

Signed-off-by: ConvolutedDog <[email protected]>

Merge branch 'main' into multiple-func-single-script

3ab76f8

Signed-off-by: ConvolutedDog <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

#12 - Enable access to all @analyze.kernel profiling results via global cache #14

#12 - Enable access to all @analyze.kernel profiling results via global cache #14

ConvolutedDog commented Dec 14, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Dec 14, 2025

Uh oh!

acollins3 commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

#12 - Enable access to all @analyze.kernel profiling results via global cache #14

Are you sure you want to change the base?

#12 - Enable access to all @analyze.kernel profiling results via global cache #14

Conversation

ConvolutedDog commented Dec 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Intro

Key Changes

Files Updated

Uh oh!

copy-pr-bot bot commented Dec 14, 2025

Uh oh!

acollins3 commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ConvolutedDog commented Dec 14, 2025 •

edited

Loading