██╗ ██╗███████╗██████╗ ███╗ ███╗███████╗███████╗ █████╗ ██████╗ ███████╗███╗ ██╗████████╗
██║ ██║██╔════╝██╔══██╗████╗ ████║██╔════╝██╔════╝ ██╔══██╗██╔════╝ ██╔════╝████╗ ██║╚══██╔══╝
███████║█████╗ ██████╔╝██╔████╔██║█████╗ ███████╗█████╗███████║██║ ███╗█████╗ ██╔██╗ ██║ ██║
██╔══██║██╔══╝ ██╔══██╗██║╚██╔╝██║██╔══╝ ╚════██║╚════╝██╔══██║██║ ██║██╔══╝ ██║╚██╗██║ ██║
██║ ██║███████╗██║ ██║██║ ╚═╝ ██║███████╗███████║ ██║ ██║╚██████╔╝███████╗██║ ╚████║ ██║
╚═╝ ╚═╝╚══════╝╚═╝ ╚═╝╚═╝ ╚═╝╚══════╝╚══════╝ ╚═╝ ╚═╝ ╚═════╝ ╚══════╝╚═╝ ╚═══╝ ╚═╝
╭────────────────────────────────── Hermes Agent v0.15.1 (2026.5.29) · upstream 6110aed9 ──────────────────────────────────╮
│ Available Tools │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⣀⡀⠀⣀⣀⠀⢀⣀⡀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ browser: browser_back, browser_click, ... │
│ ⠀⠀⠀⠀⠀⠀⢀⣠⣴⣾⣿⣿⣇⠸⣿⣿⠇⣸⣿⣿⣷⣦⣄⡀⠀⠀⠀⠀⠀⠀ browser-cdp: browser_cdp, browser_dialog │
│ ⠀⢀⣠⣴⣶⠿⠋⣩⡿⣿⡿⠻⣿⡇⢠⡄⢸⣿⠟⢿⣿⢿⣍⠙⠿⣶⣦⣄⡀⠀ clarify: clarify │
│ ⠀⠀⠉⠉⠁⠶⠟⠋⠀⠉⠀⢀⣈⣁⡈⢁⣈⣁⡀⠀⠉⠀⠙⠻⠶⠈⠉⠉⠀⠀ code_execution: execute_code │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣴⣿⡿⠛⢁⡈⠛⢿⣿⣦⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ computer_use: computer_use │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠿⣿⣦⣤⣈⠁⢠⣴⣿⠿⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ cronjob: cronjob │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠈⠉⠻⢿⣿⣦⡉⠁⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ delegation: delegate_task │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠘⢷⣦⣈⠛⠃⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ discord: discord │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢠⣴⠦⠈⠙⠿⣦⡄⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ (and 21 more toolsets...) │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠸⣿⣤⡈⠁⢤⣿⠇⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠉⠛⠷⠄⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ Available Skills │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⣀⠑⢶⣄⡀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ autonomous-ai-agents: claude-code, codex, hermes-agent, kanban-codex-... │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⣿⠁⢰⡆⠈⡿⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ creative: architecture-diagram, ascii-art, ascii-video, b... │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠈⠳⠈⣡⠞⠁⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ data-science: jupyter-live-kernel │
│ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠈⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀ devops: enterprise-hardware-sourcing, imessage-bluebubb... │
│ email: himalaya │
│ vLLM · Nous Research gaming: minecraft-modpack-server, pokemon-player │
│ /home/faisal general: browser-setup, dogfood, yuanbao │
│ Session: 20260610_185917_132f6f github: codebase-inspection, github-auth, github-code-r... │
│ mcp: native-mcp │
│ media: gif-search, heartmula, songsee, spotify, youtub... │
│ mlops: audiocraft-audio-generation, dspy, evaluating-l... │
│ note-taking: obsidian │
│ productivity: airtable, google-workspace, linear, maps, nano-... │
│ red-teaming: godmode │
│ research: arxiv, blogwatcher, ecosystem-research, llm-lan... │
│ smart-home: openhue │
│ social-media: xurl │
│ software-development: debugging-hermes-state, debugging-hermes-tui-co... │
│ │
│ Profile: sami │
│ 29 tools · 94 skills · /help for commands │
│ ⚠ 1284 commits behind — run hermes update to update │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
Welcome to Hermes Agent! Type your message or /help for commands.
✦ Tip: The TUI renders LaTeX inline — $E=mc^2$ becomes Unicode math instead of raw TeX.
────────────────────────────────────────
● Look up KVarN by Huawei CSL. Share what information you can about it, please
Initializing agent...
────────────────────────────────────────
┌─ Reasoning ──────────────────────────────────────────────────────────────────────────────────────────────────────────────┐
The user is asking me to look up information about "KVarN by Huawei CSL." This seems
to be a search/research task. Let me use the search tool to find information about
this.
CSL likely refers to China Software Laboratory, which is the research lab of Huawei
.
Let me search for this.
└──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┘
╭─ ⚕ Hermes ───────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
<arg_key>query</arg_key>
<arg_value>KVarN Huawei CSL China Software Laboratory</arg_value>
<arg_key>query</arg_key>
<arg_value>KVarN Huawei CSL China Software Laboratory</arg_value>
<arg_key>query</arg_key>
<arg_value>KVarN by Huawei CSL ontological reasoning</arg_value>
"KVarN by Huawei CSL ontological reasoning"
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
⚠ Auxiliary title generation failed: Request timed out.
⚕ vLLM │ 17.4K/65.5K │ [███░░░░░░░] 27% │ 46m │ ⏲ 15s
────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
sami ❯
────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
Resume this session with:
hermes --resume 20260610_185917_132f6f -p sami
Session: 20260610_185917_132f6f
Duration: 46m 28s
Messages: 2 (1 user, 0 tool calls)
Your current environment
Hardware/Software:
Initial Feedback:
Changing from fp16 KV cache to kvarn_k4v2_g128 changed GPU KV cache size from: 861,434 tokens to: 2,754,939 tokens.
That's just amazing. Great Job!
Problem:
Asking the hermes agent to perform a task that requires tool calling leads to the tool calling being inside the agent reply, not triggering the actual tool.
The agent then stops and no further action is taken.
In vLLM logs, continued activity is visible through generation throughput.
Several minutes later, vLLM activity stops. The agent produces nothing.
1st vLLM Launch Command Attempted:
2nd vLLM Launch Command:
CLI Chat
How would you like to use vllm
Expectation is that tools would be called normally, similar to when not using KVarN.
Before submitting a new issue...