extend server/public_simplechat with simple minded interactive browser-client side based toolcalling - base logic #16563

hanishkvc · 2025-10-13T14:10:21Z

Extends my earlier simple minded tools/server/public_simplechat web/browser ui for llama.cpp to include support for a simple minded interactive tool calling which uses the javascript environment of the browser to provide some basic tool / function calls.

Currently it provides simple_calculator and run_javascript_function_code tool calls.

If ToolCalling is enabled in ui settings, meta data about these tools is handshaked with the GenAi/LLM model. Inturn if the ai model used is aware of tool calling and makes a tool_calls request, the user is shown the tool name and the argument being passed to it. User can verify the same and trigger the tool call as is or make changes as needed before triggering the tool call.

The result of the tool call is automatically placed into the user query chat area, with tool_response tag surrounding it. The user can submit the response as is or make suitable changes to the tool response contents before submitting the same to the ai model.

NOTE: This is for a simple minded exploration of tool calling support in newer ai models and some fun along the way as well as occasional practical use like verifying mathematical or logical statements/reasoning made by the ai model during chat sessions by getting it to also create and execute code to verify such stuff and so.

Later I may also add a web_fetch tool call which will work with a local web proxy/cache server (may implement a simple minded one with white list or so) to access content from internet and thus allow ai model to augment its context with additional data as needed, when it is generating its response.

[[OLD NOTE: The ai model created code is currently run in the browser's global scope, so always cross check the tool call before allowing/running it. In a later version will be updating the logic so that the generated tool call is run within a web worker scope, to limit its powers a little bit, but always be careful when using this. OLD]]

The ai model created code is run from within a web worker context in the browser, to try and isolate it from the main browser context. However any shared web worker context, if any, is not isolated. Always cross check the tool call before allowing/running it.

Bit more details about this feature is in the updated readme.md within public_simplechat.

NOTE: The tool calling has been implemented for the chat streaming mode for now. Will add support for oneshot mode later. Tool calling with this logics current simple minded ideosynchronusy (noted in readme.md) has been tested with Gemma3N model for now.

Enable streaming by default, to check the handshake before going on to change the code, given that havent looked into this for more than a year now and have been busy with totally different stuff. Also updated the user messages used for testing a bit

Define the meta that needs to be passed to the GenAi Engine. Define the logic that implements the tool call, if called. Implement the flow/structure such that a single tool calls implementation file can define multiple tool calls.

Make tooljs structure and flow more generic Add a simple_calculator tool/function call logic Add initial skeleton wrt the main tools.mjs file.

Changed latestResponse type to an object instead of a string. Inturn it contains entries for content, toolname and toolargs. Added a custom clear logic due to the same and used it to replace the previously simple assigning of empty string to latestResponse. For now in all places where latestReponse is used, I have replaced with latestReponse.content. Next need to handle identifying the field being streamed and inturn append to it. Also need to add logic to call tool, when tool_call triggered by genai.

Update response_extract_stream to check for which field is being currently streamed ie is it normal content or tool call func name or tool call func args and then return the field name and extracted value. Previously it was always assumed that only normal content will be returned. Currently it is assumed that the server will only stream one of the 3 supported fields at any time and not more than one of them at the same time. TODO: Have to also add logic to extract the reasoning field later, ie wrt gen ai models which give out their thinking. Have updated append_response to expect both the key and the value wrt the latestResponse object, which it will be manipualted. Previously it was always assumed that content is what will be got and inturn appended.

I was wrongly checking for finish_reason to be non null, before trying to extract the genai content/toolcalls, have fixed this oversight with the new flow in progress. I had added few debug logs to identify the above issue, need to remove them later. Note: given that debug logs are disabled by replacing the debug function during this program's initialisation, which I had forgotten about, I didnt get the debug messages and had to scratch my head a bit, before realising this and the other issue ;) Also either when I had originally implemented simplechat 1+ years back, or later due to changes on the server end, the streaming flow sends a initial null wrt the content, where it only sets the role. This was not handled in my flow on the client side, so a null was getting prepended to the chat messages/responses from the server. This has been fixed now in the new generic flow.

Make latestResponse into a new class based type instance wrt ai assistant response, which is what it represents. Move clearing, appending fields' values and getting assistant's response info (irrespective of a content or toolcall response) into this new class and inturn use the same.

Switch oneshot handler to use AssistantResponse, inturn currenlty only handle the normal content in the response. TODO: If any tool_calls in the oneshot response, it is currently not handled. Inturn switch the generic/toplevel handle response logic to use AssistantResponse class, given that both oneshot and the multipart/streaming flows use/return it. Inturn add trimmedContent member to AssistantResponse class and make the generic handle response logic to save the trimmed content into this. Update users of trimmed to work with this structure.

As there could be failure wrt getting the response from the ai server some where in between a long response spread over multiple parts, the logic uses the latestResponse to cache the response as it is being received. However once the full response is got, one needs to transfer it to a new instance of AssistantResponse class, so that latestResponse can be cleared, while the new instance can be used in other locations in the flow as needed. Achieve the same now.

Previously if content was empty, it would have always sent the toolcall info related version even if there was no toolcall info in it. Fixed now to return empty string, if both content and toolname are empty.

The implementations of javascript and simple_calculator now use provided helpers to trap console.log messages when they execute the code / expression provided by GenAi and inturn store the captured log messages in the newly added result key in tc_switch This should help trap the output generated by the provided code or expression as the case maybe and inturn return the same to the GenAi, for its further processing.

Checks for toolname to be defined or not in the GenAi's response If toolname is set, then check if a corresponding tool/func exists, and if so call the same by passing it the GenAi provided toolargs as a object. Inturn the text generated by the tool/func is captured and put into the user input entry text box, with tool_response tag around it.

As output generated by any tool/function call is currently placed into the TextArea provided for End user (for their queries), bcas the GenAi (engine/LLM) may be expecting the tool response to be sent as a user role data with tool_response tag surrounding the results from the tool call. So also now at the end of submit btn click handling, the end user input text area is not cleared, if there was a tool call handled, for above reasons. Also given that running a simple arithmatic expression in itself doesnt generate any output, so wrap them in a console.log, to help capture the result using the console.log trapping flow that is already setup.

and inform the GenAi/LLM about the same

Should hopeful ensure that the GenAi/LLM will generate appropriate code/expression as the argument to pass to these tool calls, to some extent.

ie in vs code with ts-check

Move tool calling logic into tools module. Try trap async promise failures by awaiting results of tool calling and putting full thing in an outer try catch. Have forgotten the nitty gritties of JS flow, this might help, need to check.

So that when tool handler writes the result to the tc_switch, it can make use of the same, to write to the right location. NOTE: This also fixes the issue with I forgetting to rename the key in js_run wrt writing of result.

to better describe how it will be run, so that genai/llm while creating the code to run, will hopefully take care of any naunces required.

Also as part of same, wrap the request details in the assistant block using a similar tagging format as the tool_response in user block.

Instead of automatically calling the requested tool with supplied arguments, rather allow user to verify things before triggering the tool. NOTE: User already provided control over tool_response before submitting it to the ai assistant.

Instead of automatically calling any requested tool by the GenAi / llm, that is from the tail end of the handle user submit btn click, Now if the GenAi/LLM has requested any tool to be called, then enable the Tool Run related UI elements and fill them with the tool name and tool args. In turn the user can verify if they are ok with the tool being called and the arguments being passed to it. Rather they can even fix any errors in the tool usage like the arithmatic expr to calculate that is being passed to simple_calculator or the javascript code being passed to run_javascript_function_code If user is ok with the tool call being requested, then trigger the same. The results if any will be automatically placed into the user query text area. User can cross verify if they are ok with the result and or modify it suitabley if required and inturn submit the same to the GenAi/LLM.

Also avoid showing Tool calling UI elements, when not needed to be shown.

Dont forget to map members of got entity from fetch to things from saved original promise, bcas remember what is got is a promise. also add some comments around certain decisions and needed exploration

hanishkvc · 2025-10-15T18:18:58Z

Code/flow has been updated to ensure to trap promises including those from fetch in the generated code if any

A basic go at it

Declare the result of UrlReq as a DataClass, so that one doesnt goof up wrt updating and accessing members. Duplicate UrlRaw into UrlText, need to add Text extracting from html next for UrlText

As _UrlopenRet not exposed for use outside urllib, so decode and encode the data. Add skeleton to try get the html/xml tree top elements

As html can be malformed, xml ElementTree XMLParser cant handle the same properly, so switch to the HtmlParser helper class that is provided by python and try extend it. Currently a minimal skeleton to just start it out, which captures only the body contents.

First identify lines which have only whitespace and replace them with lines with only newline char in them. Next strip out adjacent lines, if they have only newlines

Ensures that if the url being requested as any query strings in them then things dont get messed up, when the url to get inc its query is extracted from the proxy request's query string

also now track header, footer and nav so that they arent captured

Add a new send headers common helper and use the same wrt the overridden send_error as well as do_OPTIONS This ensures that if there is any error during proxy opertions, the send_error propogates to the fetch from any browser properly without browser intercepting it with a CORS error

So that the same error path is used for logical error wrt http req also, without needing a different path for it. Dont forget to return the resp text/json/..., so that the contents are passed along the promise then chain

hanishkvc added 30 commits October 13, 2025 18:51

SimpleChatToolCalling: Test/Explore srvr initial hs using cmdline

fa23e9d

SimpleChatTools: Add boolean to allow user control of tools use

75ce9e4

SimpleChatTC: More generic tooljs, SimpCalc, some main skeleton

bbaae70

Make tooljs structure and flow more generic Add a simple_calculator tool/function call logic Add initial skeleton wrt the main tools.mjs file.

SimpleChatTC: Bring in the tools meta into the main flow

f091568

SimpleChatTC: use tcpdump to dbg hs; check if ai aware of tools

9d8be85

SimpleChatTC: Show toolcall being generated by ai - Temp

4cbe1d2

SimpleChatTC: Saner/Robust AssistantResponse content_equiv

d7f612f

Previously if content was empty, it would have always sent the toolcall info related version even if there was no toolcall info in it. Fixed now to return empty string, if both content and toolname are empty.

SimpleChatTC: Trap any exception raised during tool call

7a2bcfb

and inform the GenAi/LLM about the same

SimpleChatTC: More clearer description of toolcalls execution env

f10ab96

Should hopeful ensure that the GenAi/LLM will generate appropriate code/expression as the argument to pass to these tool calls, to some extent.

SimpleChatTC: Clarify some type definitions to avoid warnings

a1f1776

ie in vs code with ts-check

SimpleChatTC: Pass toolname to the tool handler

3796306

So that when tool handler writes the result to the tc_switch, it can make use of the same, to write to the right location. NOTE: This also fixes the issue with I forgetting to rename the key in js_run wrt writing of result.

SimpleChatTC: Cleanup the function description a bit

0ed8329

to better describe how it will be run, so that genai/llm while creating the code to run, will hopefully take care of any naunces required.

SimpleChatTC: Update the readme.md wrt tool calling a bit

aa81f51

SimpleChatTC: ToolCall hs info in normal assistant-user chat flow

5ed2bc3

Also as part of same, wrap the request details in the assistant block using a similar tagging format as the tool_response in user block.

SimpleChatTC: Update readme with bit more details, Cleaner UI

2aabca2

Also avoid showing Tool calling UI elements, when not needed to be shown.

SimpleChatTC: Tool Calling UI elements use up horizontal space

90b2491

hanishkvc added 3 commits October 15, 2025 22:52

SimpleChatTC: Allow await in generated code that will be evald

788a9e4

SimpleChatTC:Ensure fetch's promise chain is also trapped

c155bd3

Dont forget to map members of got entity from fetch to things from saved original promise, bcas remember what is got is a promise. also add some comments around certain decisions and needed exploration

SimpleChatTC: update readme wrt promise related trapping

1e1bb9a

hanishkvc added 21 commits October 16, 2025 11:52

SimpleChatTC:WebFetchThroughProxy:Initial go creating request

9b1ed1f

SimpleChatTC:SimpleProxy:Process args --port

1b361ee

SimpleChatTC:SimpleProxy: Start server, Show requested path

9ec3d45

SimpleChatTC:SimpleProxy: Cleanup for basic run

620da45

SimpleChatTC:SimpleProxy: Extract and check path, route to handlers

bdd054b

SimpleChatTC:SimpleProxy:implement handle_urlraw

5ac60a8

A basic go at it

SimpleChatTC:SimpleProxy:UrlRaw: Fixup basic oversight wrt 1st go

ac448c3

SimpleChatTC:WebFetch: Update to use internal SimpleProxy.py

2ca2b98

SimpleChatTC:SimpleProxy: Cleanup few messages

04a5f95

SimpleChatTC:SimpleProxy:Common UrlReq helper for UrlRaw & UrlText

295893c

Declare the result of UrlReq as a DataClass, so that one doesnt goof up wrt updating and accessing members. Duplicate UrlRaw into UrlText, need to add Text extracting from html next for UrlText

SimpleChatTC:SimpleProxy: ElementTree, No _UrlopenRet

4694537

As _UrlopenRet not exposed for use outside urllib, so decode and encode the data. Add skeleton to try get the html/xml tree top elements

SimpleChatTC:SimpleProxy:UrlText: Capture body except for scripts

2e6a386

SimpleChatTC:SimpleProxy:UrlText: Avoid style blocks also

0a3b4c2

SimpleChatTC:WebUrl FetchStrip through simple proxy

61b9371

SimpleChatTC:SimpleProxy:UrlText: Try strip empty lines some what

378bde5

SimpleChatTC:SimpleProxy:UrlText: Slightly better trimming

e7277cb

First identify lines which have only whitespace and replace them with lines with only newline char in them. Next strip out adjacent lines, if they have only newlines

SimpleChatTC:WebUrlText:Update name and desc to see if prefered

9db2f27

SimpleChatTC:SimpleProxy:Options just in case

c5a602f

SimpleChatTC:WebFetch:UrlEnc url2fetch b4Passing toProxy asQuery

6efbd12

Ensures that if the url being requested as any query strings in them then things dont get messed up, when the url to get inc its query is extracted from the proxy request's query string

SimpleChatTC: Update readme wrt web fetch and related simple proxy

8f0700a

github-actions bot added the python python script changes label Oct 17, 2025

hanishkvc added 4 commits October 17, 2025 18:02

SimpleChatTC:SimpleProxy:HtmlParser more generic and flexible

514cec0

also now track header, footer and nav so that they arent captured

SimpleChatTC:WebFetch: Cleanup the names and descriptions a bit

e124317

SimpleChatTC:WebFetch:Trap Non Ok status and raise error

4ffed94

So that the same error path is used for logical error wrt http req also, without needing a different path for it. Dont forget to return the resp text/json/..., so that the contents are passed along the promise then chain

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

extend server/public_simplechat with simple minded interactive browser-client side based toolcalling - base logic #16563

extend server/public_simplechat with simple minded interactive browser-client side based toolcalling - base logic #16563

hanishkvc commented Oct 13, 2025 •

edited

Loading

Uh oh!

hanishkvc commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

extend server/public_simplechat with simple minded interactive browser-client side based toolcalling - base logic #16563

Are you sure you want to change the base?

extend server/public_simplechat with simple minded interactive browser-client side based toolcalling - base logic #16563

Conversation

hanishkvc commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanishkvc commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hanishkvc commented Oct 13, 2025 •

edited

Loading