Add workflow to test the performance of a package #114

ahoppen · 2025-04-10T23:10:49Z

This adds the infrastructure to take performance measurements for PRs by taking a performance measurement of the project before the changes in the PR.

This will become really useful once we can switch this to a macOS runner and are then able to measure instructions executed by an executable (like swift-format supports using swift-format --measure-instructions) – instruction counting isn’t available on Linux as far as I could find out.

But even now, we can use this to track other metrics like binary size.

Example run here (ignore the actual measurement, it’s bogus) ahoppen/swift-format#8 (comment)

FranzBusch

I’m curious to understand the direction here more. In the server ecosystem we are using the package-benchmark package to write performance tests for our packages. Packages like foundation have also started adopting it. Could we use that instead of the custom scripts here?

ahoppen · 2025-04-11T20:36:49Z

Do you have an example of a package using the package-benchmark package for performance measurements?

FranzBusch · 2025-04-15T03:22:06Z

Yes you can see it here: https://github.com/apple/swift-certificates/tree/main/Benchmarks.

This packages only uses allocation metrics but another good metrics are the cpu instructions since those are stable on CI. I would really encourage us to adopt this approach instead of hand rolling performance scripts

This adds the infrastructure to take performance measurements for PRs by taking a performance measurement of the project before the changes in the PR. This will become really useful once we can switch this to a macOS runner and are then able to measure instructions executed by an executable (like swift-format supports using `swift-format --measure-instructions`) – instruction counting isn’t available on Linux as far as I could find out. But even now, we can use this to track other metrics like binary size.

ahoppen · 2025-04-25T07:23:14Z

Great suggestion with using the benchmarks package. I have updated the workflow to be based on in. Example run can be found here: ahoppen/swift-format#10 (comment)

FranzBusch

Great stuff. I would like @rnro to take a look at this as well. We had some plans to upstream the existing benchmark https://github.com/apple/swift-nio/blob/main/.github/workflows/benchmarks.yml workflow from NIO here. That workflow runs benchmarks against multiple Swift versions which we found very helpful since it allowed us to catch compiler regressions/improvements quite fast.

ktoso · 2025-04-25T08:20:19Z

Very nice! Great to spread more use of the benchmark package <3

FYI @ser-0xff & @hassila :-)

ahoppen · 2025-04-25T10:42:14Z

I think one thing that’s worth mentioning here is that this workflow does not use a fixed baseline, which makes it very easy to handle because it means that we don’t need to ensure that performance testing always runs on exactly the same kind of machine – we don’t even need to guarantee that the Swift version stays constant.

It means that we can’t use it to test for compiler regressions (because they would necessarily run in different containers / on different nodes). That sort of testing has different trade-offs, so if we want that, we should add a different kind of workflow for it, I think.

hassila · 2025-04-25T20:03:59Z

@ahoppen for the record, cpu instructions is supported on linux, but you must have permissions to use the perf_event_open call, if you modify your local package-benchmark copy and/or make a fork and uncomment this line in the package:

//            fprintf(stderr, "Can't enable performance counters for instructions metric, error in perf_event_open syscall, failed with [%d], error: %s\n", errorCode, strerror(errorCode));

you probably will get a good hint. Likely no permissions.

hassila · 2025-04-25T21:11:43Z

Or try running "perf" and see if that works.

I.e.

perf stat -e instructions /bin/ls

ahoppen · 2025-04-29T16:50:20Z

Thanks for chiming in, @hassila. I think the problem is that we will be running the performance tests inside a Docker container in GitHub actions and my understand is that you don’t have access to perf_event_open in these setups. If you know better, I would love to hear about it.

hassila · 2025-04-29T16:59:18Z

Thanks for chiming in, @hassila. I think the problem is that we will be running the performance tests inside a Docker container in GitHub actions and my understand is that you don’t have access to perf_event_open in these setups. If you know better, I would love to hear about it.

Oh, I misread the description that instruction count was not available from the benchmark package, but you probably meant in docker. Quick google gave this as a workaround:
https://medium.com/@geekidea_81313/running-perf-in-docker-kubernetes-7eb878afcd42

Also ping @freef4ll - maybe you know for sure.

ahoppen · 2025-04-29T19:19:24Z

I think I tried something like this without success, I think GitHub Actions doesn’t allow these extended privileges. But maybe I’ll give it another try when I find the time.

freef4ll · 2025-04-30T11:02:05Z

Unfortunately it does not look like GitHub public runners expose instructions PMU counters (getting a shell into running action job via https://github.com/mxschmitt/action-tmate):

root@fv-az1283-31:~# perf stat sleep 1

 Performance counter stats for 'sleep 1':

              0.70 msec task-clock                       #    0.001 CPUs utilized
                 1      context-switches                 #    1.435 K/sec
                 0      cpu-migrations                   #    0.000 /sec
                79      page-faults                      #  113.371 K/sec
   <not supported>      cycles
   <not supported>      instructions
   <not supported>      branches
   <not supported>      branch-misses

       1.001178359 seconds time elapsed

       0.001080000 seconds user
       0.000000000 seconds sys

The VMs are running in Azure under Hyper-V and while it appears that PMU events are supported, it is not enabled.

ahoppen · 2025-05-05T09:52:24Z

Unfortunate that GitHub Action runners don’t support instruction counters by default but thanks a lot for verifying my analysis, @freef4ll!

rnro · 2025-05-07T08:02:11Z

This looks great! I think it serves a different purpose to the use-case we have in the swift-server repos where we want to compare against a fix baseline so that we can check not only PRs for regressions but also timed runs. I think having both is a good thing.

FranzBusch

LGTM. Just two minor comments

FranzBusch · 2025-05-12T09:38:00Z

.github/workflows/performance_test.yml

+          else
+            echo "has_significant_changes=false" >> "$GITHUB_OUTPUT"
+          fi
+      - name: Install gh


Wondering if we would be better off just moving this into a separate job since gh is installed on the runners by default

I tried this before but if you run it as a separate job, that shows up as a separate item in the pull request status, which is confusing.

FranzBusch · 2025-05-12T09:39:47Z

.github/workflows/performance_test.yml

+          apt update
+          apt install gh -y
+      - name: Post comment
+        if: ${{ steps.compare_performance.outputs.has_significant_changes == 'true' }}


Should we unconditionally post this? Otherwise it might seem like nothing happened.

I don’t think so. Most PRs don’t modify the performance of a package and since we run this unconditionally, I think posting a comment saying that the performance stayed constant just creates noise and users might get used to them and then not actually notice if a comment highlights a real performance change. If you want to double-check that it ran, you can open the GitHub action log.

ahoppen requested a review from a team as a code owner April 10, 2025 23:10

ahoppen force-pushed the performance-test branch from d18b430 to 889306f Compare April 10, 2025 23:12

FranzBusch reviewed Apr 11, 2025

View reviewed changes

ahoppen force-pushed the performance-test branch from 889306f to b972a0e Compare April 25, 2025 07:21

FranzBusch reviewed Apr 25, 2025

View reviewed changes

FranzBusch approved these changes May 12, 2025

View reviewed changes

ahoppen merged commit b258470 into swiftlang:main May 12, 2025
25 of 26 checks passed

ahoppen deleted the performance-test branch May 12, 2025 13:16

vanvoorden mentioned this pull request Jun 30, 2025

benchmarks failing from main swiftlang/swift-foundation#1386

Closed

ahoppen mentioned this pull request Aug 4, 2025

Test performance testing ahoppen/swift-format#10

Closed

Add workflow to test the performance of a package #114

Add workflow to test the performance of a package #114

Uh oh!

Conversation

ahoppen commented Apr 10, 2025

Uh oh!

FranzBusch left a comment

Choose a reason for hiding this comment

Uh oh!

ahoppen commented Apr 11, 2025

Uh oh!

FranzBusch commented Apr 15, 2025

Uh oh!

ahoppen commented Apr 25, 2025

Uh oh!

FranzBusch left a comment

Choose a reason for hiding this comment

Uh oh!

ktoso commented Apr 25, 2025

Uh oh!

ahoppen commented Apr 25, 2025

Uh oh!

hassila commented Apr 25, 2025

Uh oh!

hassila commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahoppen commented Apr 29, 2025

Uh oh!

hassila commented Apr 29, 2025

Uh oh!

ahoppen commented Apr 29, 2025

Uh oh!

freef4ll commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahoppen commented May 5, 2025

Uh oh!

rnro commented May 7, 2025

Uh oh!

FranzBusch left a comment

Choose a reason for hiding this comment

Uh oh!

FranzBusch May 12, 2025

Choose a reason for hiding this comment

Uh oh!

ahoppen May 12, 2025

Choose a reason for hiding this comment

Uh oh!

FranzBusch May 12, 2025

Choose a reason for hiding this comment

Uh oh!

ahoppen May 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

hassila commented Apr 25, 2025 •

edited

Loading

freef4ll commented Apr 30, 2025 •

edited

Loading