-
Notifications
You must be signed in to change notification settings - Fork 2
Troubleshooting
FindHao edited this page Aug 31, 2025
·
5 revisions
- Ensure
CUDA_INJECTION64_PATHpoints tolib/cutracer.so - Set
KERNEL_FILTERSto match the actual kernel name (mangled or unmangled) - Verify working directory has write permission
- Ensure the kernel contains clock reads (e.g., Triton
pl.scopeadds them) - Check that
CUTRACER_ANALYSIS=proton_instr_histogramis set
- Use
opcode_onlyonly; avoidreg_trace/mem_traceunless required - Narrow
KERNEL_FILTERSand useINSTR_BEGIN/INSTR_END
- Tool handles these paths; if missing outputs, validate stream capture status and synchronization
- For captured graphs, data is flushed at
cuGraphLaunchexit; ensure proper stream sync.
-
nvcc/ptxasminimums are enforced byMakefile; check errors and adjustARCH
- Warp ID mismatch: ensure both runs target the same kernel launch; avoid filtering one side only.
- Missing
ipcvalues: cycles or instruction counts missing/zero; re-check both inputs. - Kernel hash ambiguity: pass
--kernel-hashexplicitly to the parser.