Fix Qwen3 thinking model params overwritten by instruct defaults by conorbronsdon · Pull Request #15 · rungalileo/agent-leaderboard

conorbronsdon · 2026-03-13T06:07:43Z

User description

Summary

Adds missing else clause so the Qwen3 instruct-model parameters don't unconditionally overwrite the thinking-model parameters

Context

The instruct-model parameter block (Temperature=0.7, TopP=0.8) ran after the if "thinking" block without an else, silently overwriting the thinking-model's recommended parameters (Temperature=0.6, TopP=0.95) for models like Qwen3-235B-A22B-Thinking-2507. Both sets of parameters reference Qwen's official best-practices docs on HuggingFace.

Test plan

Verify thinking models (e.g. Qwen3-235B-A22B-Thinking-2507) get Temperature=0.6, TopP=0.95
Verify instruct models (e.g. Qwen3-235B-A22B-Instruct-2507) get Temperature=0.7, TopP=0.8

🤖 Generated with Claude Code

Generated description

Below is a concise technical summary of the changes proposed in this PR:
Ensure the LLMHandler.get_llm method applies the correct Qwen3 best-practice temperature, TopP, TopK, and MinP values based on whether a thinking or instruct variant is requested. Update the model-name checks to normalize casing so the correct branch is chosen before the instruct defaults run.

Latest Contributors(1)

User	Commit	Date
pratik@galileo.ai	changes-for-kimi-thinking	November 18, 2025

This pull request is reviewed by Baz. Review like a pro on (Baz).

The instruct-model parameter block ran unconditionally after the `if "thinking"` block, silently overwriting the thinking-model's recommended parameters (Temperature=0.6, TopP=0.95) with the instruct defaults (Temperature=0.7, TopP=0.8). Add the missing `else` clause so thinking and instruct models each get their correct parameters per Qwen's best-practices docs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The "thinking" and "qwen3" substring checks were case-sensitive, but model names use capitalized forms (e.g. Qwen3-235B-A22B-Thinking-2507). This caused thinking models to fall through to instruct defaults. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

galileo-automation · 2026-05-05T02:25:54Z

No activity for 30 days — this PR will be closed in 5 days unless updated.

conorbronsdon · 2026-05-05T16:37:44Z

@bhavsarpratik ?

conorbronsdon marked this pull request as ready for review March 13, 2026 06:13

baz-reviewer Bot reviewed Mar 13, 2026

View reviewed changes

Comment thread v2/evaluate/llm_handler.py

baz-reviewer Bot approved these changes Mar 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Qwen3 thinking model params overwritten by instruct defaults#15

Fix Qwen3 thinking model params overwritten by instruct defaults#15
conorbronsdon wants to merge 2 commits into
rungalileo:mainfrom
conorbronsdon:fix/qwen3-thinking-params

conorbronsdon commented Mar 13, 2026 •

edited by baz-reviewer Bot

Loading

Uh oh!

Uh oh!

galileo-automation commented May 5, 2026

Uh oh!

conorbronsdon commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

conorbronsdon commented Mar 13, 2026 • edited by baz-reviewer Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

Summary

Context

Test plan

Generated description

Uh oh!

Uh oh!

galileo-automation commented May 5, 2026

Uh oh!

conorbronsdon commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

conorbronsdon commented Mar 13, 2026 •

edited by baz-reviewer Bot

Loading