bgpd: fix attr intern overhead #19547

krishna-samy · 2025-09-10T12:38:59Z

Issue:

There is a CPU overhead in BGPd at high scale due to bgp_attr_intern() -> attrhash_key_make() -> jhash_* when processing UPDATEs carrying whole bunch of prefixes within a single NLRI that share identical attributes. Interning was repeated per prefix, redoing hashing for the same attr received in the update msg.

Root-cause:

Within a single UPDATE's NLRI section, the same parsed attributes were redundantly interned for each prefix. This will be repeatedly invoking attribute hashing across the NLRI loop.

Fix:

Added a per-UPDATE reuse context passed into bgp_update(): struct bgp_attr_reuse_ctx
- If the attributes (after policy) haven't changed from what was parsed and we're still in the same NLRI section, reuse the already-interned attributes (just bump refs) instead of re-hashing/re-intern.
- If not, intern once and remember it in the per-UPDATE context so that following prefixes with the same unchanged attributes can reuse it.
bgp_nlri_parse_ip() creates a stack-local ctx and passes &ctx into bgp_update() for the UPDATE's NLRI section; all other calls pass NULL.
attr_unintern_safe_ctx(ctx, &attr) to invalidate the cache before unintern(in bgp_update).

Overall CPU time reduction is ~10% at high scale.

mjstapp · 2025-09-10T12:46:45Z

bgpd/bgp_attr.c

+ * Increment the refcnt for intern'd attr and its intern'd subs
+ * If any intern'd sub-object in 'struct attr' is added/deleted, update this function
+ */
+void bgp_attr_ref(struct attr *attr)


hmm, have some reservations about this. this is a ton of exposure of internal details that will have to be maintained forever - just to be used in exactly one place in bgp_route.c (right?)
could we find a way to do this through the attr_intern() code path - like maybe an adaptation or variant of that to take a reuse context? that might allow all the maintenance of the attr struct internals to remain inside attr_intern and _unintern?

ton31337 · 2025-09-10T12:52:44Z

I'm sort of afraid that with this solution we gonna have another kind of issues (timing related) or use-after-free when these interned pointers get staled/dangled.

Issue: - there is a CPU overhead in BGPd due to bgp_attr_intern() -> attrhash_key_make() -> jhash_* when processing UPDATEs carrying whole bunch of prefixes within a single NLRI that share identical attributes. Interning was repeated per prefix, redoing hashing for the same attr received in the update msg. Root-cause: - Within a single UPDATE's NLRI section, the same parsed attributes were redundantly interned for each prefix. This will be repeatedly invoking attribute hashing across the NLRI loop. Fix: - Added a per-UPDATE reuse context passed into bgp_update(): struct bgp_attr_reuse_ctx - If the attributes (after policy) haven't changed from what was parsed and we're still in the same NLRI section, reuse the already-interned attributes (just bump refs) instead of re-hashing/re-intern. - If not, intern once and remember it in the per-UPDATE context so that following prefixes with the same unchanged attributes can reuse it. - bgp_nlri_parse_ip() creates a stack-local ctx and passes &ctx into bgp_update() for the UPDATE's NLRI section; all other calls pass NULL. - attr_unintern_safe_ctx(ctx, &attr) to invalidate the cache before unintern(in bgp_update). Overall CPU time reduction is ~10% at high scale. Signed-off-by: Krishnasamy <[email protected]>

frrbot bot added bgp bugfix labels Sep 10, 2025

github-actions bot added master size/L labels Sep 10, 2025

mjstapp reviewed Sep 10, 2025

View reviewed changes

krishna-samy force-pushed the krishna/attr-hashing branch from da9288a to fa8bc02 Compare September 10, 2025 12:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bgpd: fix attr intern overhead #19547

bgpd: fix attr intern overhead #19547

Uh oh!

krishna-samy commented Sep 10, 2025

Uh oh!

mjstapp Sep 10, 2025

Uh oh!

ton31337 commented Sep 10, 2025

Uh oh!

Uh oh!

bgpd: fix attr intern overhead #19547

Are you sure you want to change the base?

bgpd: fix attr intern overhead #19547

Uh oh!

Conversation

krishna-samy commented Sep 10, 2025

Issue:

Root-cause:

Fix:

Uh oh!

mjstapp Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

ton31337 commented Sep 10, 2025

Uh oh!

Uh oh!