feat: display Google Gemini cached token stats #6860

thisisryanswift · 2026-01-04T20:19:45Z

feat: display Google Gemini cached token stats

Closes #6851

What

One-line fix to read cached token counts from Google's metadata so they show up in session stats.

Why

Google returns cached token counts in a different spot than Anthropic:

Anthropic: usage.cachedInputTokens
Google: providerMetadata.google.usageMetadata.cachedContentTokenCount

Implicit caching was already working server-side (and saving money), we just weren't displaying it.

The fix

- const cachedInputTokens = input.usage.cachedInputTokens ?? 0
+ const cachedInputTokens = input.usage.cachedInputTokens ?? 
+   (input.metadata?.["google"] as any)?.usageMetadata?.cachedContentTokenCount ?? 0

Tested

Verified locally with a couple of gemini conversations.

Future: Explicit Caching

Google has two caching modes:

Implicit (what we're using): Automatic, server-side, probabilistic
Explicit: Guaranteed cache hits, requires managing cache objects

For explicit caching, we'd need:

Add @google/generative-ai dependency
Use GoogleAICacheManager to create/update/delete caches with TTL
Pass cache name via providerOptions.google.cachedContent

This is a bigger lift but would give guaranteed savings. See:

Google caching docs
Cline's implementation for reference

Extract cached token counts from Google's response metadata so they're visible in OpenCode's usage display. Gemini 2.5+ models use implicit caching (automatic, server-side). However, OpenCode wasn't reading the cached token counts from Google's metadata location (usageMetadata.cachedContentTokenCount) rather than the standard location. This enables users to see their Gemini cache hits in the session context usage display, and cost calculations will correctly account for cached tokens. Verified working: tested with gemini-3-flash-preview, observed cache.read values of 16K, 49K, and 107K tokens in a multi-turn conversation. Future opportunity: For guaranteed cache hits, explicit caching could be implemented using GoogleAICacheManager + providerOptions.google.cachedContent. See: https://ai.google.dev/gemini-api/docs/caching

rekram1-node · 2026-01-04T22:20:56Z

so the google ai sdk provider doesnt track it?

thisisryanswift · 2026-01-04T22:44:34Z

Doesn't seem like it locally. Before and after of this change is that I can go back to sessions and actually see a cached token amount.

Could be something upstream in Vercel's AI SDK?

Transparently I mostly vibe coded here. I did try to be intentional though. Ultimately rolled back from my initial issue. I thought it wasn't caching at all. But really it's just not reporting/recording caching

rekram1-node · 2026-01-05T01:14:26Z

what provider are u using? google directly? any plugins?

thisisryanswift · 2026-01-05T02:15:46Z

Google via apikey. Only plugin is a notifier I vibe coded. None of the antigravity or gemini cli auth provider plugins.

rekram1-node · 2026-01-05T06:06:21Z

Hmm I tested this several times and couldnt get any difference in token counting but it could just be happenstance..

thisisryanswift · 2026-01-05T16:27:52Z

So I don't think it will change your total token count in the TUI. But, this change will correctly identify cached tokens. So it impacts the price displayed in the TUI and also would have those cached token counts if someone went back and analyzed their sessions. Previously, Gemini was showing zero cached tokens (at least it was for me locally) when you went back and looked at old sessions. Was just never recording the implicit/automatic caching.

danchurko · 2026-01-06T16:17:20Z

Do you think this change will apply the fix to the Vertex AI connections, too?

thisisryanswift · 2026-01-07T00:51:40Z

Do you think this change will apply the fix to the Vertex AI connections, too?

Iirc, the caching is by model not by provider generally. But there is a mix of both. If you use anthropic claude models via vertex I think current system will already cache correctly. But I didn't test this myself. If you use Gemini via vertex my assumption would be that server side implicit caching is working automatically (already). But it like suffers the same fate that this fixes where OpenCode doesn't record that. Not 100% sure.

thisisryanswift force-pushed the feature/gemini-caching branch from 23f5298 to b974489 Compare January 4, 2026 20:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: display Google Gemini cached token stats #6860

feat: display Google Gemini cached token stats #6860

thisisryanswift commented Jan 4, 2026

Uh oh!

rekram1-node commented Jan 4, 2026

Uh oh!

thisisryanswift commented Jan 4, 2026

Uh oh!

rekram1-node commented Jan 5, 2026

Uh oh!

thisisryanswift commented Jan 5, 2026

Uh oh!

rekram1-node commented Jan 5, 2026

Uh oh!

thisisryanswift commented Jan 5, 2026

Uh oh!

danchurko commented Jan 6, 2026

Uh oh!

thisisryanswift commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: display Google Gemini cached token stats #6860

Are you sure you want to change the base?

feat: display Google Gemini cached token stats #6860

Conversation

thisisryanswift commented Jan 4, 2026

feat: display Google Gemini cached token stats

What

Why

The fix

Tested

Future: Explicit Caching

Uh oh!

rekram1-node commented Jan 4, 2026

Uh oh!

thisisryanswift commented Jan 4, 2026

Uh oh!

rekram1-node commented Jan 5, 2026

Uh oh!

thisisryanswift commented Jan 5, 2026

Uh oh!

rekram1-node commented Jan 5, 2026

Uh oh!

thisisryanswift commented Jan 5, 2026

Uh oh!

danchurko commented Jan 6, 2026

Uh oh!

thisisryanswift commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants