feat(vm_reexecute): add pprof profiling support #4790

Elvis339 · 2025-12-25T10:02:22Z

Why this should be merged

Adds --pprof-dir <path> flag to C-chain re-execution benchmark for CPU and memory profiling.

Part of cross-stack profiling effort: ava-labs/firewood#1582

pprof shows time spent in CGO calls as black boxes we see that ffi.(*Revision).Get takes X% of CPU but not what happens inside Firewood it's still valuable because:

Shows Go-side hotspots (GC, allocations, serialization, AvalancheGo)
Identifies which Go code paths trigger FFI calls
Measures total time per FFI function

Combined with Rust-side profiling, we get full visibility across the boundary.

How this works

When --pprof-dir <path> is passed:

Creates directory at <path> directory
Starts CPU profiling
Writes cpu.profile, mem.profile and lock.profile on exit

How this was tested

nix develop (make sure you have AWS credentials set, c-chain-reexecute requires S3 access)
Edit line 97 of vm_reexecution and set output dir then run ./scripts/run_task.sh c-chain-reexecution-firewood-101-250k
go tool pprof /tmp/pprof/cpu.profile
- top 20
- list Revision.*Get

NOTE: --pprof-dir is intentionally not exposed in benchmark_cchain_range.sh yet the profiling API may evolve as other profiling needs arise ava-labs/firewood#1582. The flag is still available when running the test directly via go run.

ROUTINE ======================== github.com/ava-labs/firewood-go-ethhash/ffi.(*Revision).Get in /Users/elvis.sabanovic/go/pkg/mod/github.com/ava-labs/firewood-go-ethhash/[email protected]/revision.go
      20ms      6.27s (flat, cum)  1.06% of Total
         .          .     61:func (r *Revision) Get(key []byte) ([]byte, error) {
         .          .     62:   if r.handle == nil {
         .          .     63:           return nil, ErrDroppedRevision
         .          .     64:   }
         .          .     65:
         .       10ms     66:   var pinner runtime.Pinner
      10ms       10ms     67:   defer pinner.Unpin()
         .          .     68:
      10ms      110ms     69:   return getValueFromValueResult(C.fwd_get_from_revision(
         .          .     70:           r.handle,
         .          .     71:           newBorrowedBytes(key, &pinner),
         .      6.14s     72:   ))
         .          .     73:}
         .          .     74:
         .          .     75:// Iter creates an iterator starting from the provided key on revision.
         .          .     76:// pass empty slice to start from beginning
         .          .     77:// It returns ErrDroppedRevision if Drop has already been called.
ROUTINE ======================== github.com/ava-labs/firewood-go-ethhash/ffi.(*Revision).Get.func1 in /Users/elvis.sabanovic/go/pkg/mod/github.com/ava-labs/firewood-go-ethhash/[email protected]/revision.go
         0      6.14s (flat, cum)  1.03% of Total
         .          .     69:   return getValueFromValueResult(C.fwd_get_from_revision(
         .          .     70:           r.handle,
         .       30ms     71:           newBorrowedBytes(key, &pinner),
         .      6.11s     72:   ))
         .          .     73:}
         .          .     74:
         .          .     75:// Iter creates an iterator starting from the provided key on revision.
         .          .     76:// pass empty slice to start from beginning
         .          .     77:// It returns ErrDroppedRevision if Drop has already been called.

Need to be documented in RELEASES.md?

No

…rofile generation

Copilot

Pull request overview

This PR adds pprof profiling support to the C-chain re-execution benchmark to enable CPU and memory profiling during benchmark runs. This supports cross-stack profiling efforts by providing Go-side performance visibility that complements Rust-side profiling.

Key changes:

Added --pprof flag to enable profiling during benchmark execution
Implemented CPU and memory profile file generation in ./pprof/ directory
Added logging of pprof enablement status to benchmark output

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/reexecute/c/vm_reexecute.go

…ld be replaced with PROFILE flag which will capture the whole profiling stack including pprof

scripts/benchmark_cchain_range.sh

tests/reexecute/c/vm_reexecute.go

…lity

…lexible profiling configuration

RodrigoVillar

Currently, the changes this PR introduces are only invokable via go run, which is not the recommended way to running the reexecution test (with preference going to ./scripts/benchmark_cchain_range.sh or using task). Can this PR also include a way to invoke profiling both the preferred options above?

This may require coordination with #4761

Elvis339 · 2026-01-05T15:46:07Z

Currently, the changes this PR introduces are only invokable via go run, which is not the recommended way to running the reexecution test (with preference going to ./scripts/benchmark_cchain_range.sh or using task). Can this PR also include a way to invoke profiling both the preferred options above?

This may require coordination with #4761

See the note in the PR description the --pprof-dir is intentionally not exposed in the scripts yet.
This PR is a prerequisite for cross-stack profiling (a larger change that will consolidate how profiling is exposed across the tooling). Breaking it down for simpler, incremental review.
Currently I'm the only one experimenting with these benchmarks with debug flags and I'm compiling vm_reexecutable myself anyway so having it available via cmd arg is fine for now. Once the full profiling story lands, we can expose it properly in the scripts.

RodrigoVillar · 2026-01-05T15:50:53Z

Currently, the changes this PR introduces are only invokable via go run, which is not the recommended way to running the reexecution test (with preference going to ./scripts/benchmark_cchain_range.sh or using task). Can this PR also include a way to invoke profiling both the preferred options above?
This may require coordination with #4761

See the note in the PR description the --pprof-dir is intentionally not exposed in the scripts yet. This PR is a prerequisite for cross-stack profiling (a larger change that will consolidate how profiling is exposed across the tooling). Breaking it down for simpler, incremental review. Currently I'm the only one experimenting with these benchmarks with debug flags and I'm compiling vm_reexecutable myself anyway so having it available via cmd arg is fine for now. Once the full profiling story lands, we can expose it properly in the scripts.

Ah I now see #4791

feat(vm_reexecute): add pprof profiling support with CPU and memory p…

4786b42

…rofile generation

Elvis339 requested a review from a team as a code owner December 25, 2025 10:02

Copilot AI review requested due to automatic review settings December 25, 2025 10:02

github-project-automation bot added this to avalanchego Dec 25, 2025

Copilot AI reviewed Dec 25, 2025

View reviewed changes

tests/reexecute/c/vm_reexecute.go Outdated Show resolved Hide resolved

tests/reexecute/c/vm_reexecute.go Outdated Show resolved Hide resolved

Elvis339 mentioned this pull request Dec 25, 2025

Cross-stack profiling for Go <-> CGO <- Rust ava-labs/firewood#1582

Open

Elvis339 self-assigned this Dec 25, 2025

Elvis339 added 3 commits December 25, 2025 14:07

feat(benchmark_cchain)

3745c1f

lint

53b601a

chore(benchmark_cchain): pprof flag, in the subsequent PRs PPROF shou…

bd6bf20

…ld be replaced with PROFILE flag which will capture the whole profiling stack including pprof

Elvis339 mentioned this pull request Dec 25, 2025

feat(reexecution): add optional profiling support to benchmarking script #4791

Draft

RodrigoVillar requested changes Dec 29, 2025

View reviewed changes

github-project-automation bot moved this to In Progress 🏗️ in avalanchego Dec 29, 2025

Elvis339 added 2 commits January 5, 2026 19:07

chore(vm_reexecute): refactor pprof setup to use unified profiler uti…

7f0d933

…lity

chore(vm_reexecute): replace pprofEnabled flag with pprofDirArg for f…

1b5ea63

…lexible profiling configuration

Elvis339 requested a review from RodrigoVillar January 5, 2026 15:18

chore: revert doc changes to benchmark_cchain_range.sh

823252f

RodrigoVillar reviewed Jan 5, 2026

View reviewed changes

RodrigoVillar approved these changes Jan 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(vm_reexecute): add pprof profiling support #4790

feat(vm_reexecute): add pprof profiling support #4790

Uh oh!

Elvis339 commented Dec 25, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RodrigoVillar left a comment

Uh oh!

Elvis339 commented Jan 5, 2026 •

edited

Loading

Uh oh!

RodrigoVillar commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(vm_reexecute): add pprof profiling support #4790

Are you sure you want to change the base?

feat(vm_reexecute): add pprof profiling support #4790

Uh oh!

Conversation

Elvis339 commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why this should be merged

How this works

How this was tested

Need to be documented in RELEASES.md?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RodrigoVillar left a comment

Choose a reason for hiding this comment

Uh oh!

Elvis339 commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RodrigoVillar commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Elvis339 commented Dec 25, 2025 •

edited

Loading

Elvis339 commented Jan 5, 2026 •

edited

Loading