Use structured JSON output for local model classification by jdcodes1 · Pull Request #23 · imbue-ai/bouncer

jdcodes1 · 2026-04-12T01:45:14Z

Summary

Use WebLLM's response_format: { type: 'json_object' } to force local models to output valid JSON instead of freeform text
Update parseLocalModelResponse() to try JSON parsing first, falling back to the existing regex-based parsing for backward compatibility
Simplify LOCAL_SYSTEM_PROMPT from 21 lines to 4 — format constraints are now enforced by the engine, not by prompt instructions
Pass responseFormat as an optional parameter to generate() so non-classification callers (e.g. suggestAnnoyingReasons) still get freeform text

Problem: Local model responses are parsed with regex looking for "Matches X" or "No match" in freeform text. When the model phrases things differently, the regex misses and the post silently gets classified as "no match" — a false negative.

Fix: Constrained decoding guarantees valid JSON output. The model can only produce {"reasoning": "...", "match": "..."}. Zero parsing ambiguity. The vendored WebLLM (0.2.82-custom) already supports json_object response format — it just wasn't being used.

Changes

File	What
`src/background/local-model.ts`	Add `CLASSIFICATION_RESPONSE_FORMAT` constant, make `responseFormat` optional on `generate()`, pass it from `callLocalInference`, guard against `"null"` string match
`src/shared/prompts.ts`	Simplify `LOCAL_SYSTEM_PROMPT` (21 lines → 4 lines)
`tests/background/local-model.test.ts`	6 new tests: JSON match, no-match, empty string, "null" string, regex fallback, malformed JSON fallback

Test plan

6 new tests for JSON parsing + edge cases
All 205 tests pass (existing regex-based tests still pass via fallback path)
response_format only applied to classification calls, not suggestAnnoyingReasons
Backward compatible — non-JSON model output still parsed via regex fallback

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…regex fallback Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ation output Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…" string match - response_format is now passed by callers (callLocalInference) rather than hardcoded in generate(), so non-classification callers like suggestAnnoyingReasons still get freeform text output - Treat the literal string "null" as no-match in JSON parsing - Add test for "null" string edge case Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

imperion402 · 2026-04-12T02:01:13Z

+    // Not JSON — fall through to regex parsing
+  }
+
+  // Fallback: freeform text parsing (backward compatibility)


what is this backward compatibility you speak of?

imperion402 · 2026-04-12T02:02:14Z

-You will be provided with a post (<post>) and a list of filter categories (<filter_categories>).
-Assess whether the topic of the post relates to any of the topics in the filter categories list.
-Your reasoning must be AT MOST 15 words, and MUST end with a statement of "Matches <topic>" or "No match".
+Respond with JSON: {"reasoning": "<10-15 words about what the post is about>", "match": "<matched category or null>"}


what evals did you perform to validate this achieves similar metrics? What's the F1 score, accuracy, precision, etc...?

this is the key point - we experimented previously with structured output like this and found that this simple of a prompt actually leads to far worse classification performance, which is why local models now receive approximately the same prompt as API ones. I'd love to be able to use a much shorter prompt if it would actually work as well since it would certainly process much faster.

The 'Check upstream' example implied local models always use a reasoning prompt (per PR imbue-ai#23). That was Qwen-era: after migrating local inference to LiteRT/Gemma, upstream itself adopted the terse table_yesno prompt. Note the lesson is model-specific + eval-gated, and that PR imbue-ai#23 was a third-party, unmerged JSON-output PR on the old codebase.

jdcodes1 and others added 4 commits April 11, 2026 21:43

test: add structured JSON output tests for parseLocalModelResponse

b705e89

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: support structured JSON output in parseLocalModelResponse with …

18666d6

…regex fallback Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: use WebLLM json_object response_format for structured classific…

0a3291f

…ation output Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

imperion402 reviewed Apr 12, 2026

View reviewed changes

rishabgit mentioned this pull request May 25, 2026

feat: dual local inference engine (WebLLM/Qwen + LiteRT-LM/Gemma) rishabgit/bouncer#5

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use structured JSON output for local model classification#23

Use structured JSON output for local model classification#23
jdcodes1 wants to merge 4 commits into
imbue-ai:mainfrom
jdcodes1:feat/structured-json-output

jdcodes1 commented Apr 12, 2026 •

edited

Loading

Uh oh!

imperion402 Apr 12, 2026

Uh oh!

imperion402 Apr 12, 2026

Uh oh!

gnguralnick Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

jdcodes1 commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Uh oh!

imperion402 Apr 12, 2026

Choose a reason for hiding this comment

Uh oh!

imperion402 Apr 12, 2026

Choose a reason for hiding this comment

Uh oh!

gnguralnick Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jdcodes1 commented Apr 12, 2026 •

edited

Loading