Add Jetpack performance testing CI infrastructure #46287

LiamSarsfield · 2025-12-12T14:09:06Z

Addresses HOG-438: Create Jetpack Performance Tooling for LCP

Proposed changes:

Add performance testing infrastructure under tools/performance/ to measure wp-admin dashboard LCP (Largest
Contentful Paint) with Jetpack connected
Uses Docker for isolated WordPress environment with simulated WordPress.com connection (fake tokens + mocked API
with 200ms latency)
Includes CPU throttling calibration for consistent results across different machines
Posts metrics to CodeVitals for tracking over time

Other information:

Have you written new tests for your changes, if applicable?
Have you checked the E2E test CI results, and verified that your changes do not break them?
Have you tested your changes on WordPress.com, if applicable (if so, you'll see a generated comment below with
a script to run)?

Jetpack product discussion

pc9hqz-3Rb-p2

Does this pull request change what data or activity we track or use?

No

Testing instructions:

Prerequisites: Docker running, Node 18+

cd tools/performance
pnpm install
pnpm exec playwright install chromium
pnpm calibrate
pnpm test -- --skip-codevitals

The test suite will automatically clone the pre-built plugin from jetpack-production on first run.

Expected output: LCP measurement for wp-admin dashboard with Jetpack connected (simulated)

Introduces automated LCP (Largest Contentful Paint) measurement for the wp-admin dashboard with simulated Jetpack WordPress.com connection. Key components: - Docker environment with WordPress + simulated Jetpack connection - CPU throttling calibration for consistent results across CI agents - Playwright-based LCP measurement - CodeVitals integration for metric tracking Metric posted: wp-admin-dashboard-connection-sim-largestContentfulPaint

github-actions · 2025-12-12T14:10:12Z

Thank you for your PR!

When contributing to Jetpack, we have a few suggestions that can help us test and review your patch:

✅ Include a description of your PR changes.
✅ Add a "[Status]" label (In Progress, Needs Review, ...).
✅ Add a "[Type]" label (Bug, Enhancement, Janitorial, Task).
✅ Add testing instructions.
✅ Specify whether this PR includes any changes to data or privacy.
✅ Add changelog entries to affected projects

This comment will be updated as you work on your PR and make changes. If you think that some of those checks are not needed for your PR, please explain why you think so. Thanks for cooperation 🤖

Follow this PR Review Process:

Ensure all required checks appearing at the bottom of this PR are passing.
Make sure to test your changes on all platforms that it applies to. You're responsible for the quality of the code you ship.
You can use GitHub's Reviewers functionality to request a review.
When it's reviewed and merged, you will be pinged in Slack to deploy the changes to WordPress.com simple once the build is done.

If you have questions about anything, reach out in #jetpack-developers for guidance!

- Remove outdated 4-scenario comment (only 1 scenario exists) - Remove dead baseline comparison code that never executed - Trim JSDoc to single-line descriptions across all scripts - Update eslint config to allow minimal JSDoc

Changed the Docker startup sequence to prevent race conditions where WordPress containers interfere with WP-CLI's database operations: 1. Start only the db container 2. Wait for MySQL to be ready 3. Run WP-CLI setup (WordPress container NOT running) 4. Start WordPress containers This ensures WP-CLI has exclusive database access during setup, eliminating "table doesn't exist" errors caused by concurrent access. Changes: - run-performance-tests.js: Sequential container startup - setup-wordpress.sh: Simplified (removed HTTP wait logic) - docker-compose.yml: Removed wpcli depends_on wordpress

The import plugin is not available in this context, and the base config handles import resolution. Also ensure jsdoc rules are disabled for these utility scripts.

Dependencies (playwright, dotenv) are in tools/performance/package.json, not the monorepo root, so the import resolver can't find them.

- Log calibration file path and existence - Show throttle rate, target score, calibration time, and samples - Confirm throttling is applied via CDP on first iteration

…or tools/performance

github-actions · 2025-12-14T19:45:47Z

Are you an Automattician? Please test your changes on all WordPress.com environments to help mitigate accidental explosions.

To test on WoA, go to the Plugins menu on a WoA dev site. Click on the "Upload" button and follow the upgrade flow to be able to upload, install, and activate the Jetpack Beta plugin. Once the plugin is active, go to Jetpack > Jetpack Beta, select your plugin (Jetpack or WordPress.com Site Helper), and enable the add/perf-testing-ci-mvp branch.
To test on Simple, run the following command on your sandbox:

bin/jetpack-downloader test jetpack add/perf-testing-ci-mvp

bin/jetpack-downloader test jetpack-mu-wpcom-plugin add/perf-testing-ci-mvp

Interested in more tips and information?

In your local development environment, use the jetpack rsync command to sync your changes to a WoA dev blog.
Read more about our development workflow here: PCYsg-eg0-p2
Figure out when your changes will be shipped to customers here: PCYsg-eg5-p2

jp-launch-control · 2025-12-14T20:03:10Z

Code Coverage Summary

This PR did not change code coverage!

That could be good or bad, depending on the situation. Everything covered before, and still is? Great! Nothing was covered before? Not so great. 🤷

Full summary · PHP report · JS report

…wordpress-jetpack-connected service, ensuring it starts only after the service is initiated, which helps prevent race conditions during container startup.

- Updated README.md to include new environment variables: CODEVITALS_URL, GIT_BRANCH, WP_ADMIN_USER, and WP_ADMIN_PASS. - Modified post-to-codevitals.js to streamline metric extraction by removing unused baseMetrics. - Improved run-performance-tests.js to prioritize GIT_COMMIT environment variable for git hash retrieval, ensuring accurate tracking during CI runs.

- Introduced an empty baseMetrics object in the payload to clarify that baseline normalization is not utilized in the performance metrics submission.

- Simplified browser launch to always use headless mode for consistency in performance calibration. - Removed conditional logic for headful mode, ensuring a streamlined execution in both local and CI environments.

LiamSarsfield · 2025-12-15T13:46:45Z

Hey @anomiex👋 would you mind taking a look at this when you get a chance?

This is the performance testing infrastructure I built during HACK week (more details here pc9hqz-3Rb-p2) it measures wp-admin dashboard LCP for Jetpack and posts results to CodeVitals.
It's a big PR, sorry about that. Most of it breaks down into:

scripts/ - Node.js orchestration and Playwright measurement code
docker/ - Docker Compose setup for WordPress + Jetpack environment
docker/mu-plugins/simulate-wpcom-connection.php - mu-plugin that fakes Jetpack connection and mocks WP.com API
responses

The mu-plugin is probably the most relevant bit for review from a Jetpack perspective as it intercepts pre_http_request to return mock responses for various WP.com endpoints.

Happy to walk through any of it if that's easier.

anomiex

Seems ok from a monorepo perspective. I didn't look too closely at the code.

If you're wanting a review of the faked-connection stuff in the mu-plugin, @Automattic/jetpack-vulcan would be the team to ask.

anomiex · 2025-12-15T16:03:12Z

tools/performance/README.md

+
+## CI Usage
+
+The test suite is designed to run in TeamCity. See build configuration for setup.


I wonder if it'd make more sense to run it in Actions rather than TeamCity, on each commit to trunk instead of backfilling weekly.

If you're running it in TeamCity, are you looking at the monorepo, or at https://github.com/Automattic/jetpack-production which has the already-built plugin?

One of the exploration intents is to see about reusing work. With our GHE instance, TeamCity is the runner we have to use, so seeing the pros/cons of TC here.

I wonder if it'd make more sense to run it in Actions rather than TeamCity, on each commit to trunk instead of backfilling weekly.

GitHub Actions runners can have unpredictable performance variability due to shared infrastructure. For detecting small regressions (5-10%), this noise can drown out minor regressions over time. The TeamCity build runs on a dedicated machine with no other agents interfering.

If you're running it in TeamCity, are you looking at the monorepo, or at Automattic/jetpack-production which has the already-built plugin?

The monorep is used instead of the already-built plugin mainly so we can leverage commit level tracking, so we can have the ability to bisect when issues appear (where needed). It may not be needed for now though, but you can see within CodeVitals that a point can be clicked and we can see the commit details when clicking on the commit SHA.

Note that every commit that changes anything in Jetpack gets mirrored to Automattic/jetpack-production, absent rare cases where something goes wrong with the mirroring. Each commit there also includes a footer like Upstream-Ref: Automattic/jetpack@d5b54134bb471f3a54d04b12e128e6e0e2d77bde to make it easy to find the corresponding monorepo commit.

It's up to you, but it may save having to build each monorepo commit before you can test it.

It'll also either save you testing commits that don't affect Jetpack-the-plugin at all or save you having to maintain code to try to identify which ones do. I see your P2 post mentions

that touched PHP files in Jetpack or packages

That would unnecessarily test changes to packages like packages/jetpack-mu-wpcom, unless you have a list of packages to ignore. It could also miss JS-only changes that might still affect the LCP timing.

Note that every commit that changes anything in Jetpack gets mirrored to Automattic/jetpack-production, absent rare cases where something goes wrong with the mirroring. Each commit there also includes a footer like Upstream-Ref: d5b5413 to make it easy to find the corresponding monorepo commit.

Actually yes! I should have checked this before hand, I didn't realise Automattic/jetpack-production has bissectable commits like this. It'll also help regarding the issue you mentioned for testing commits that don't affect Jetpack-the-plugin also. Thanks, I'll update the TeamCity build to use that instead.

Updated to use jetpack-production, the implementation now:

Clones jetpack-production instead of building from the monorepo

Parses Upstream-Ref from each mirror commit to track the original monorepo SHA in CodeVitals

Removes file filtering from the TC scheduler

The monorepo VCS root is still used for the tools/performance/ scripts, but the plugin itself comes directly from the pre-built mirror.

anomiex · 2025-12-15T16:13:40Z

tools/performance/eslint.config.mjs

+			// Dependencies are in tools/performance/package.json, not monorepo root
+			// so the import resolver can't find them. Disable this rule.
+			'import/no-unresolved': 'off',


That doesn't matter, eslint will look in tools/performance/node_modules for files under here.

Also, when I remove this and run eslint, it doesn't seem to complain about anything.

Nice catch! Removed the import/no-unresolved rule and its comment. Also simplified the config to match tools/cli/eslint.config.mjs

kraftbj

I'm not sure where it came from; I was given a noticed that MacOS's rsync doesn't handle symlinks well and stating I should brew install rsync. It let me proceed, but it failed.

Installing rsync via brew install rsync and trying again worked. I don't think it's a blocker but wanted to note it.

anomiex · 2025-12-16T21:49:18Z

I'm not sure where it came from; I was given a noticed that MacOS's rsync doesn't handle symlinks well and stating I should brew install rsync. It let me proceed, but it failed.

Probably from

jetpack/tools/cli/commands/rsync.js

Lines 45 to 93 in 55e48ba

    
           // Apple ships a special fork of openrsync, which has various quirks. 
        
           // In particular, it doesn't handle symlink recursion well, which breaks 
        
           // in macOS 15.4 and copies child node_modules dirs in 15.5. 
        
           // 
        
           // See also: 
        
           // * p1742486518169009-slack-CDLH4C1UZ 
        
           // * https://github.com/apple-oss-distributions/rsync/tree/main/openrsync 
        
           if ( os.platform() === 'darwin' ) { 
        
           	const { stdout: rsyncVersion } = await execa( 'rsync', [ '--version' ] ); 
        
           	isOpenrsync = rsyncVersion.indexOf( 'openrsync' ) >= 0; 
        
           	if ( isOpenrsync ) { 
        
           		const { stdout: macOS_version } = await execa( 'sw_vers', [ '--productVersion' ] ); 
        
           		if ( macOS_version === '15.4' ) { 
        
           			console.error( 
        
           				chalk.red( 
        
           					'The implementation of rsync in macOS 15.4 is unable to properly sync symlinks.' 
        
           				) 
        
           			); 
        
           			console.error( chalk.red( 'Please install standard rsync (e.g. `brew install rsync`).' ) ); 
        
           			process.exit( 1 ); 
        
           		} else { 
        
           			console.error( 
        
           				chalk.yellow( 
        
           					'The implementation of rsync in macOS is unable to properly sync symlinks.' 
        
           				) 
        
           			); 
        
           			console.error( 
        
           				chalk.yellow( 'Installing standard rsync (e.g. `brew install rsync`) is recommended.' ) 
        
           			); 
        
           			if ( argv.nonInteractive ) { 
        
           				process.exit( 1 ); 
        
           			} 
        
           			console.error(); 
        
           			await enquirer 
        
           				.prompt( { 
        
           					type: 'confirm', 
        
           					name: 'proceedWithOpenrsync', 
        
           					message: 
        
           						'Continuing will not break anything, but will copy many unneeded files.\nProceed to sync files?', 
        
           					initial: false, 
        
           				} ) 
        
           				.then( answer => { 
        
           					if ( ! answer.proceedWithOpenrsync ) { 
        
           						process.exit( 0 ); 
        
           					} 
        
           				} ); 
        
           		} 
        
           	} 
        
           }

kraftbj · 2025-12-16T22:38:15Z

Probably from

Yup! Thanks! I scanned the PR but didn't check the existing commands. I wouldn't consider it a blocker for the PR, but it is required to use the fully-leaded version of rsync, not the Mac variant.

- Consolidated rules into a single object for improved readability and maintainability. - Removed unnecessary comments and streamlined the configuration structure.

LiamSarsfield · 2025-12-18T15:07:18Z

Yup! Thanks! I scanned the PR but didn't check the existing commands. I wouldn't consider it a blocker for the PR, but it is required to use the fully-leaded version of rsync, not the Mac variant.

~~Ah nice catch, I must have included already installed rsync previously hence why I missed it, I've updated the testing instructions accordingly.~~

Actually, this is no longer relevant - I've switched to using jetpack-production (the pre-built mirror) instead of
building from the monorepo, so rsync is no longer needed at all. Updated the testing instructions to reflect the
simpler prerequisites (just Docker and Node 18+).

- Added 'plugin/' directory to .gitignore to exclude cloned plugin files. - Updated README.md with detailed setup instructions and clarified usage of the pre-built Jetpack plugin. - Modified docker-compose.yml to mount the plugin from the new directory structure. - Refactored run-performance-tests.js to clone the Jetpack plugin from the production mirror instead of using rsync, ensuring a more straightforward setup process.

LiamSarsfield · 2025-12-18T17:53:40Z

Hey @Automattic/jetpack-vulcan 👋

I've been working on performance testing infrastructure for Jetpack (measuring wp-admin LCP with Jetpack connected).
Part of this involves an mu-plugin that simulates a WordPress.com connection without actually connecting to WP.com.
Could someone from the team review the mock implementation at tools/performance/docker/mu-plugins/simulate-wpcom-connection.php?

It currently:

Sets up fake blog/user tokens via Jetpack_Options
Intercepts HTTP requests to *.wordpress.com and *.wp.com
Returns mock responses for common endpoints (token health, site info, stats, sync, etc.)
Adds configurable latency (default 200ms) to simulate real-world conditions

Specifically looking for feedback on:

Are the fake connection tokens set up correctly?
Are we missing any critical API endpoints that get called on wp-admin load?
Any concerns with this approach for performance testing?

fgiannar · 2025-12-19T07:42:03Z

Hi @LiamSarsfield ,

Thanks for the ping and working on this!

Are the fake connection tokens set up correctly?

Yes 👍

Are we missing any critical API endpoints that get called on wp-admin load?

This would depend on the Jetpack plugin module configuration. I can confirm the endpoints that are called by Connection and Sync packages, but we can't know how each consumer of the Jetpack Connection behaves.
I noticed that you only enable modules that don't require a JP Connection but I wonder if we should enable all of them to get a realistic worst case scenario.
One way to check the remote calls to WPCOM on every page load, would be to sandbox your environment and monitor debug.log as we do log all sandboxed requests to WPCOM there.

That said, it might make sense to add some logging that would answer this question within the performance testing logic itself? This could be helpful in case more endpoints are added in the future, that we don't handle within the testing infrastructure.

Any concerns with this approach for performance testing?

This is not a blocker, but you could consider refactoring get_mock_response to avoid the if else logic and use eg a factory for setting up the fake endpoints.
One additional idea I had, is around the latency. Atm my understanding is that we assume a generic latency for each endpoint. If we extracted each fake endpoint definition to each own class we could potentially set the latency per endpoint using actual real world data we have on WPCOM.
As an example, the jetpack-sync-actions endpoint has a ~470ms median response time and ~3s for the 95th percentile.
We could take it even one step further and define a median and p95 latency per endpoint and repeat our tests for both cases to simulate a normal scenario and a site under stress.

- Introduced a mechanism to log unhandled endpoints, aiding in the identification of missing mock responses. - Added a flag to track if a specific endpoint handler was matched, improving response handling clarity. - Updated comments for legacy endpoints to indicate early returns without logging.

LiamSarsfield · 2025-12-19T17:30:26Z

@fgiannar Thanks for the thorough review!

I noticed that you only enable modules that don't require a JP Connection but I wonder if we should enable all of them to get a realistic worst case scenario.

Good point. I initially took the conservative approach to avoid errors from modules expecting real WP.com responses, but enabling all modules would give us a more realistic measurement. I'll look into expanding the module list and adding mock responses for any additional endpoints they require.

It might make sense to add some logging that would answer this question within the performance testing logic itself?

Great idea, I added logging for any intercepted requests that hit the fallback response. That way it'll catch any unhandled endpoints as they appear instead of discovering them later.

You could consider refactoring get_mock_response to avoid the if else logic and use eg a factory for setting up the fake endpoints.

Agreed that the current implementation is a bit unwieldy. I'll refactor to a registry/factory pattern which will also make it easier to add per endpoint configuration.

We could take it even one step further and define a median and p95 latency per endpoint

Love this idea. Using actual latency data would make the measurements much more representative. I created a follow-up issue to:

Extract endpoint definitions to a registry with configurable latency
Add real-world latency values from WP.com metrics
Consider test modes for median vs p95 scenarios

For now I'll focus on the logging improvement as a quick win, thanks again for the detailed feedback! 🙏

github-actions bot added [Status] In Progress Docs labels Dec 12, 2025

github-actions bot added the [Status] Needs Author Reply We need more details from you. This label will be auto-added until the PR meets all requirements. label Dec 12, 2025

LiamSarsfield added 2 commits December 12, 2025 14:39

Fix eslint import/no-unresolved for standalone dependencies

e0b91db

LiamSarsfield changed the title ~~Add Jetpack performance testing CI infrastructure~~ [Do not merge] Add Jetpack performance testing CI infrastructure Dec 12, 2025

LiamSarsfield added the DO NOT MERGE don't merge it! label Dec 12, 2025

LiamSarsfield added 6 commits December 12, 2025 16:42

Fix ESLint config: remove import/no-unresolved rule

f9dfbd5

The import plugin is not available in this context, and the base config handles import resolution. Also ensure jsdoc rules are disabled for these utility scripts.

Disable import/no-unresolved rule for performance scripts

63dca30

Dependencies (playwright, dotenv) are in tools/performance/package.json, not the monorepo root, so the import resolver can't find them.

Add detailed calibration verification logging

6595660

- Log calibration file path and existence - Show throttle rate, target score, calibration time, and samples - Confirm throttling is applied via CDP on first iteration

Add tools/performance

bbdb6ea

Update pnpm-lock.yaml to include dotenv and playwright dependencies f…

c768b58

…or tools/performance

LiamSarsfield added 5 commits December 14, 2025 20:14

Update docker-compose.yml to change the dependency condition for the …

2492d1e

…wordpress-jetpack-connected service, ensuring it starts only after the service is initiated, which helps prevent race conditions during container startup.

Add baseMetrics placeholder in post-to-codevitals.js

df0cfd2

- Introduced an empty baseMetrics object in the payload to clarify that baseline normalization is not utilized in the performance metrics submission.

Refactor browser launch configuration in calibrate-throttling.js

ce3b20d

- Simplified browser launch to always use headless mode for consistency in performance calibration. - Removed conditional logic for headful mode, ensuring a streamlined execution in both local and CI environments.

Merge remote-tracking branch 'origin/trunk' into add/perf-testing-ci-mvp

55e48ba

LiamSarsfield added [Type] Infrastructure and removed [Status] Needs Author Reply We need more details from you. This label will be auto-added until the PR meets all requirements. labels Dec 15, 2025

LiamSarsfield requested a review from anomiex December 15, 2025 13:46

LiamSarsfield marked this pull request as ready for review December 15, 2025 13:46

LiamSarsfield requested review from dilirity and kraftbj December 15, 2025 13:47

LiamSarsfield removed the DO NOT MERGE don't merge it! label Dec 15, 2025

LiamSarsfield changed the title ~~[Do not merge] Add Jetpack performance testing CI infrastructure~~ Add Jetpack performance testing CI infrastructure Dec 15, 2025

LiamSarsfield added [Status] Needs Review This PR is ready for review. and removed [Status] In Progress labels Dec 15, 2025

anomiex reviewed Dec 15, 2025

View reviewed changes

kraftbj previously approved these changes Dec 16, 2025

View reviewed changes

Refactor ESLint configuration for performance testing CLI scripts

e2c9808

- Consolidated rules into a single object for improved readability and maintainability. - Removed unnecessary comments and streamlined the configuration structure.

LiamSarsfield dismissed kraftbj’s stale review via e2c9808 December 18, 2025 14:58

LiamSarsfield added 2 commits December 18, 2025 16:33

Merge remote-tracking branch 'origin/trunk' into add/perf-testing-ci-mvp

c9dc15d

Merge remote-tracking branch 'origin/trunk' into add/perf-testing-ci-mvp

e3e0860


		## CI Usage

		The test suite is designed to run in TeamCity. See build configuration for setup.

Add Jetpack performance testing CI infrastructure #46287

Are you sure you want to change the base?

Add Jetpack performance testing CI infrastructure #46287

Uh oh!

Conversation

LiamSarsfield commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes:

Other information:

Jetpack product discussion

Does this pull request change what data or activity we track or use?

Testing instructions:

Uh oh!

github-actions bot commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jp-launch-control bot commented Dec 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Coverage Summary

Uh oh!

LiamSarsfield commented Dec 15, 2025

Uh oh!

anomiex left a comment

Choose a reason for hiding this comment

Uh oh!

anomiex Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

kraftbj Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

LiamSarsfield Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anomiex Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

LiamSarsfield Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

LiamSarsfield Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

anomiex Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

LiamSarsfield Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

kraftbj left a comment

Choose a reason for hiding this comment

Uh oh!

anomiex commented Dec 16, 2025

Uh oh!

kraftbj commented Dec 16, 2025

Uh oh!

LiamSarsfield commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LiamSarsfield commented Dec 18, 2025

Uh oh!

fgiannar commented Dec 19, 2025

Uh oh!

LiamSarsfield commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

LiamSarsfield commented Dec 12, 2025 •

edited

Loading

github-actions bot commented Dec 12, 2025 •

edited

Loading

github-actions bot commented Dec 14, 2025 •

edited

Loading

jp-launch-control bot commented Dec 14, 2025 •

edited

Loading

LiamSarsfield Dec 18, 2025 •

edited

Loading

LiamSarsfield commented Dec 18, 2025 •

edited

Loading