Enhance Ollama thinking parameter. Separate thinking content from final answer content output. #9020

JamesMustafa · 2025-09-20T10:19:08Z

Description of the feature

Updated ChatOllamaInput - thinking property with the recent additions in Ollama Thinking. More specifically, thinking intensity (low,medium,high) was added. This change is already present in LangChain python library.
Separated thinking content from the actual answer content. This was creating a big confusion for me initially, as I couldn't distinguish between thoughts/actual response. Ollama API provides message.content and message.thinking attributes which were already consumed by LangChain JS. However they were combined into one token, which lead to the unified final content.

const token = this.think
  ? responseMessage.thinking ?? responseMessage.content ?? ""
  : responseMessage.content ?? "";

In this PR, thinking_content attribute is added into additional_kwargs which isolates the thinking content only. This attribute is available everytime thinking is enabled. The content attribute now holds the actual answer only.

Results

Attaching an example AIMessage output payload I got while testing with local Ollama gpt-oss:20b model:

{
  "AIMessage": {
    "content": "There are **3** occurrences of the letter “r” in the word *strawberry*.",
    "additional_kwargs": {
      "thinking_content": "The user asks: \"how many r in the word strawberry?\" They want the count of letter 'r' in the word \"strawberry\". The word \"strawberry\" has letters: s t r a w b e r y. Count r's: there are 3 r's? Let's check: s(1), t(2), r(3), a(4), w(5), b(6), e(7), r(8), r(9), y(10). So r appears at positions 3, 8, 9. That's 3 r's. So answer: 3. But maybe they want explanation. The user just asked \"how many r in the word strawberry?\" So answer: 3. Provide a short answer."
    },
    "response_metadata": {
      "model": "gpt-oss:20b",
      "created_at": "2025-09-20T10:11:59.4606692Z",
      "done": true,
      "done_reason": "stop",
      "total_duration": 5971396100,
      "load_duration": 4384213800,
      "prompt_eval_count": 121,
      "prompt_eval_duration": 234847800,
      "eval_count": 191,
      "eval_duration": 1329256500
    },
    "tool_calls": [],
    "invalid_tool_calls": [],
    "usage_metadata": {
      "input_tokens": 121,
      "output_tokens": 191,
      "total_tokens": 312
    }
  }
}

Tests

Comprehensive integration tests for this specific feature/s added

Docs

TODO...

changeset-bot · 2025-09-20T10:19:12Z

⚠️ No Changeset found

Latest commit: 5bbc81e

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

vercel · 2025-09-20T10:19:13Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
langchainjs-docs	Ready	Preview		Sep 20, 2025 11:23am

1 Skipped Deployment

Project	Deployment	Preview	Comments	Updated (UTC)
langchainjs-api-refs	Ignored			Sep 20, 2025 11:23am

libs/langchain-ollama/src/chat_models.ts

libs/langchain-ollama/src/tests/chat_models_think.int.test.ts

christian-bromann · 2025-09-22T05:47:35Z

libs/langchain-ollama/src/tests/chat_models_think.int.test.ts

+  // Quick test that string values work in practice
+  const res = await ollamaHigh.invoke([
+    new HumanMessage({ content: "How many r in the word strawberry?" })
+  ]);
+
+  expect(res).toBeDefined();
+  expect(typeof res.content).toBe("string");
+  expect(res.content.length).toBeGreaterThan(0);


No reason to use the model if we just do type checks. In v1 branch we have migrated to Vitest which allows to create type tests.

Hi @christian-bromann and thanks a lot for your review and feedback! :))

Can you please clarify what would be the best approach here in your opinion? As I already do test model invocation in other test (test string thinking parameter '$thinkLevel'), I removed it from this one, as you suggested.

I read a bit about Vitest, and I guess we can do something like that here:

test("test type safety for thinking parameter values", () => { expectTypeOf<ChatOllamaInput['think']>().toEqualTypeOf<boolean | ThinkingIntensity | undefined>() expectTypeOf<'high'>().toExtend<ThinkingIntensity>() expectTypeOf<'medium'>().toExtend<ThinkingIntensity>() expectTypeOf<'low'>().toExtend<ThinkingIntensity>() expectTypeOf<{ think: ThinkingIntensity }>().toExtend<Partial<ChatOllamaInput>>() expectTypeOf<{ think: boolean }>().toExtend<Partial<ChatOllamaInput>>() });

However, Vitest is still not present inside the package.json file of langchain-ollama package. I see that it's already used in some of your internal libraries and also you plan to migrate to it, but what would be the best option currently? On one hand, I am not sure if adding vitest in the package.json would do any harm, on the other hand, if I try to do some workaround for type checking with Jest, this will create a technical debt for you in the future. What do you suggest?

libs/langchain-ollama/src/tests/chat_models_think.int.test.ts

JamesMustafa added 3 commits September 20, 2025 12:29

Enhance Ollama integration with thinking

61843b4

Fix errors

2eb6523

Everything works perfectly

0f33b50

Enhance comments

0e8524e

vercel bot deployed to Preview – langchainjs-docs September 20, 2025 10:51 View deployment

JamesMustafa added 3 commits September 20, 2025 13:52

Add tests

6c5daae

Enhance tests

7bb16b7

Remove build artifact from repository

5bbc81e

vercel bot deployed to Preview – langchainjs-docs September 20, 2025 11:23 View deployment

christian-bromann reviewed Sep 22, 2025

View reviewed changes

JamesMustafa requested a review from christian-bromann September 25, 2025 07:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhance Ollama thinking parameter. Separate thinking content from final answer content output. #9020

Enhance Ollama thinking parameter. Separate thinking content from final answer content output. #9020

Uh oh!

JamesMustafa commented Sep 20, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Sep 20, 2025 •

edited

Loading

Uh oh!

vercel bot commented Sep 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

christian-bromann Sep 22, 2025

Uh oh!

JamesMustafa Sep 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enhance Ollama thinking parameter. Separate thinking content from final answer content output. #9020

Are you sure you want to change the base?

Enhance Ollama thinking parameter. Separate thinking content from final answer content output. #9020

Uh oh!

Conversation

JamesMustafa commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the feature

Results

Tests

Docs

Uh oh!

changeset-bot bot commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

vercel bot commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

christian-bromann Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

JamesMustafa Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JamesMustafa commented Sep 20, 2025 •

edited

Loading

changeset-bot bot commented Sep 20, 2025 •

edited

Loading

vercel bot commented Sep 20, 2025 •

edited

Loading