Enhanced coding system #636

MaxRobinsonTheGreat · 2025-10-02T22:40:40Z

No description provided.

…h patch format

…tch-system

Ninot1Quyi · 2025-10-20T16:31:31Z

MindCraft Models Tool API Support Status

Model Name	Implementation	Test Status
Azure	Inherits GPT	[v]
Cerebras	Independent	[v]
Claude	Independent	[v]
DeepSeek	Inherits GPT	[v]
Gemini	Independent	[v]
GLHF	Inherits GPT	[ ] Pending
GPT	Base Class	[ ] Pending
Grok	Inherits GPT	[v]
Groq	Independent	[v]
HuggingFace	Independent	[v]
Hyperbolic	Independent	[x]
Mercury	Inherits GPT	[v]
Mistral	Independent	[v]
Novita	Inherits GPT	[v]
Ollama	Independent	[-]
OpenRouter	Inherits GPT	[v]
Qwen	Inherits GPT	[-]
Replicate	Conditional Switch	[x]
VLLM	Inherits GPT	[ ] Pending

Total: 19 Models(/provider)

Status Legend:
[ ] Pending
[ v ] Supported
[ - ] Imperfect support
[ x ] Not supported

Implementation Categories

Implementation Type	Model Count	Model List
Inherits GPT Base	9	Azure, DeepSeek, GLHF, Grok, Mercury, Novita, OpenRouter, Qwen, VLLM
Independent Implementation	9	Cerebras, Claude, Gemini, Groq,HuggingFace, Hyperbolic, Mistral, Ollama, GPT
Conditional API Switch	1	Replicate

Ninot1Quyi · 2025-10-26T13:56:04Z

Can someone help me?🤯

If anyone has API credits on these untested platforms, I really hope you can come and help me test them.
Untested platforms: GLHF、GPT、VLLM.

I've never used these three providers: GLHF, and I don't want to recharge on those platforms.
For GPT, I can't use a Chinese bank card to pay and get credits.
For VLLM, it's hard to test because my computer's performance is insufficient.

I hope someone can help me test the remaining models. I've already completed the testing and fixes for the ones I can test.

After the testing is done, I will clean up the code and submit the PR.

If you can help me, I will be extremely grateful.

Ninot1Quyi · 2025-10-27T02:16:40Z

1. How to Test?

There are screenshot examples in the PDF. File: enhaced-coding-system-en.pdf

Step 1: In settings.js, add the path to the configuration file you want to test. Qwen has some limitations in the tool-calling API, so currently, we can't force Qwen to call tools.
Step 2: Enable allow_insecure_coding=True.
Step 3: Start the game, open port 55916, and run npm start to launch mindCraft.
Step 4: Enter the following command and observe if the bot is working. Scroll through the terminal to check for colored logs indicating tool calls.

!newAction("Don't give up, the goal is to beat Minecraft and defeat the Ender Dragon. You can use 'inventory' to check your items")

If the tool call fails:

Look for Response is not in Tool format. Please use Tool command format. in the terminal. If this appears, it means the tool call failed this time.

If the tool call succeeds:

Look for [ToolManager] Executing in the terminal. If this appears, it means the tool call was successful, and the related prompts are in color.

…stem

Ninot1Quyi · 2025-11-08T15:31:00Z

1. Effects

Survival Mode

It can work stably, plan todos reasonably and update them in a timely manner, quickly acquire materials, and convert part of the code into reusable skill functions.

Its disadvantage is that it is relatively slow and consumes a large number of tokens.

Its advantage, however, is that it only activates when the agent uses !newAction(task)—and since the task is usually a relatively complex one, its cost-effectiveness is quite good.

Construction Capability

Instead of completing an entire building in one step as before, it can now first generate a todo plan, then acquire resources, and subsequently build the foundation, construct the walls, and enclose the roof in sequence.

However, its disadvantage is that the resulting building is not as stunningly beautiful as the one generated in a single step previously;

its advantage, on the other hand, is that it demonstrates the ability to modify and edit buildings.I have added a function world.getBuildingStructure(bot, corner1, corner2) which can return the blocks in the area layer by layer in JSON text format, allowing the agent to quickly understand the 3D structure through text.I believe that with further enhancements, the agent will be able to achieve the capability of modifying buildings accurately.

Detailed Implementation Documentation

The Enhanced-coding-system accomplishes these features:

Allows AI to generate code (which was already available).
Enables AI to write files via tool calls, precisely fix specific errors in code instead of regenerating new code.
Permits AI to save generated code to the learnSkills folder and use learned skills in subsequent code generation processes.

The above operations can only be performed in the defined workspace, which uses absolute path validation to ensure security. Code execution is done in a sandbox, where the code is wrapped and line/column numbers are remapped to correctly point to the actual location of errors in the code.

Why do this?

To enable the AI to evolve during development, learn new things and skills.

To later add a command !saveNewCommand, which can save generated code as instructions usable in normal mode.

To allow the AI to autonomously fix erroneous code during the first use of the skill and when errors occur in subsequent uses.

2.1 What Was Updated?

Added the tools system.
- Related code is located in the src/agent/tools folder.
- src/agent/tools/toolManager.js serves as the manager and executor of tools, responsible for connecting various tools to the agent.
- src/agent/tools/tools-prompt.md is the prompt for using tools, placed in the tools folder. When sending to the LLM, it will be injected by replacing $TOOLS in the coding.md file.
- There are 10 tool definition files in total: edit.js, execute.js, finishCoding.js, glob.js, grep.js, lint.js, ls.js, multiEdit.js, read.js, todoWrite.js.
Improved prompt readability and editability.
- Modified profiles/defaults/prompts to move prompts from JSON to MD files, making them easier to read and edit. Changed the original locations to paths of MD files.
Refactored the original coding prompt, adding a lot of prompt words based on task needs to standardize and constrain the Agent's behavior. Note that src/agent/tools/tools-prompt.md will be injected as tools_manual into profiles/defaults/prompts/coding.md. This makes the coding prompt very long; I'm considering if we can streamline it a bit and remove some parts, as the current cost is quite high.
Relevent Code:src/models/prompter.js: Line22~51,309~391
Added required return values for all !Command instructions.
- src/agent/commands/actions.js. Previously, some commands wouldn't return any message on failure, not providing enough feedback to the agent, and empty returns could cause other issues. Added required success and failure return values for all commands.
Added more comprehensive action interruption checks.
- src/agent/library/skills.js, src/agent/library/world.js. Added bot.interrupt_code to all for and while loops as well as timer code in these two files, allowing the bot to stop every action promptly. For example, the following !bot.interrupt_code:

    for (let i = 0; i < empty_pos.length && !bot.interrupt_code; i++) {
            let empty = true;
            for (let x = 0; x < size && !bot.interrupt_code; x++) {
                for (let z = 0; z < size && !bot.interrupt_code; z++) {
                    let top = bot.blockAt(empty_pos[i].offset(x, 0, z));
                    let bottom = bot.blockAt(empty_pos[i].offset(x, -1, z));
                    if (!top || !top.name == 'air' || !bottom || !bottom.drops || bottom.drops.length == 0 || !bottom.diggable) {
                        empty = false;
                        break;
                    }
                }
                if (!empty) break;
            }
            if (empty) {
                return empty_pos[i];
            }
        }

2.2 Execution Flow

2.2.1 When is coding triggered?

Currently, coding mode is only entered when using !newAction("task_text"), just like before.

The new coding system will add task_text as coding_goal to profiles/defaults/prompts/coding.md.

The AI will determine on its own if the task is complete, and when it is, it will call the FinishCoding tool to exit coding mode.

Step 1: Use !newAction("task_text") --> Enter coding mode -->
Step 2: Replace content in coding.md with current information --> Send to LLM --> LLM returns JSON result with tool calls -->
Step 3: ToolManager executes and adds the result to messages --> Back to Step 2

When the LLM calls FinishCoding, exit coding mode and return summary.

3. The support status of tools across all models (/suppliers)

Two tool invocation methods have been implemented in the program: one is the native tool API, and the other is via prompt engineering. Currently, the tool invocation method has been preconfigured in the configuration files for models (/suppliers) that either do not support tool invocation or support it incompletely.

Switching between the two methods can be done by modifying the "use_native_tools" parameter in the profile. Each configuration file has independent settings, and the tool API invocation is used by default.
Such as:

{
    "name": "gpt",

    "model": {
        "model": "gpt-5",
        "params": {
            "reasoning": {
                "effort": "low"
            }
        }
    },

    "use_native_tools": true
}

4. Miscellaneous Updates

Modify the function comments in parts of src/agent/library/skills.js and src/agent/library/world.js.
Change the format to start with /** and end with **/ to correctly build skillDocs.

5. Important

Untested platforms: GLHF、GPT、VLLM.

Ninot1Quyi added 30 commits September 10, 2025 20:13

add codex's apply-patch tool

922a570

fea: rewrote src/agent/coder.js using a patch

54c4738

Add: code_workspaces in settings.js for coding security

9085d67

update codingprompt for patch fromat

634b8ff

Modify: Change stop_seq from '***' to '<|EOT|>' to avoid conflict wit…

20a5335

…h patch format

fix: remove the custom URL

7a32b82

Modify: Change stop_seq from '***' to '<|EOT|>' to avoid conflict wit…

6939237

…h patch format

refactor: migrate apply-patch to JavaScript

1c6f47c

add: apply-patch test script

b84b4e8

update: use JS version of patch tool in src/agent/patch_applier.js

04f0f64

update: add dependencies required for patch tool to package.json

c90bfe4

Merge remote-tracking branch 'origin/develop' into enhanced-coding-pa…

28711f0

…tch-system

fix: distinguish skill execute or save

879d19b

fix reset MAX_ATTEMPTS to 5

8657115

remove codex apply patch

0a5b1d5

remove patch_applier.js

0a207ef

update lot about tools use

cbd1276

add this.code_workspaces

1aa6920

update support $TOOLS

3bb5e1d

fea tools support

1ec7886

fea tools support

2900e1b

add TodoWrite tool

888b80f

fix return the correct result

d4f29ec

test Move the prompt to the coding.md file for test

56e538c

test change for test

c5456db

test tools-prompt.md file for test

9af0673

fix the auto-execution bug

6f1a467

test change settings.js

0201b51

add error throwing template

8cf5878

change error throwing template

a167301

Ninot1Quyi added 14 commits October 21, 2025 11:04

Update: More comprehensive tool calling and parsing

05b4fda

Verify DeepSeek, Gemini, Groq

4444cdf

Verify huggingface and use newAPI

b0f7fd0

Varify mercury

55024b3

Update huggingface SDK

1fe8776

Restore the relevant selection method for '$CODE_DOCS'

466d273

Verify cerebras

84a6ad9

Verify Qwen

6d20ec8

Use the Tools API for calls instead of prompt engineering

4b13f07

Change setting.js for testing

5f4ee59

Verify Mistral

82eac42

Verify novita

edb3a2e

Verify Grok

87dd244

Verify Ollama

e61403b

Ninot1Quyi added 8 commits November 8, 2025 16:25

Verify Azure

e255418

Hyperbolic dont support tools call

5ab27b3

add glhf profiles

9eabba3

Merge remote-tracking branch 'origin/develop' into enhanced-coding-sy…

bee4c28

…stem

Supports switching tool calling methods: tools api or prompt engineering

7b5b568

Correct the configuration

9ba03c7

add func getBuildingStructure to show building for LLMs

57b33dc

set default

417442d

Ninot1Quyi added 2 commits November 8, 2025 23:36

clean up

6b07622

clean up glob.js

45d7c6e

Ninot1Quyi marked this pull request as ready for review November 8, 2025 15:49

Ninot1Quyi marked this pull request as draft November 9, 2025 04:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhanced coding system #636

Enhanced coding system #636

Uh oh!

MaxRobinsonTheGreat commented Oct 2, 2025

Uh oh!

Ninot1Quyi commented Oct 20, 2025 •

edited

Loading

Uh oh!

Ninot1Quyi commented Oct 26, 2025 •

edited

Loading

Uh oh!

Ninot1Quyi commented Oct 27, 2025 •

edited

Loading

Uh oh!

Ninot1Quyi commented Nov 8, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Enhanced coding system #636

Are you sure you want to change the base?

Enhanced coding system #636

Uh oh!

Conversation

MaxRobinsonTheGreat commented Oct 2, 2025

Uh oh!

Ninot1Quyi commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

MindCraft Models Tool API Support Status

Implementation Categories

Uh oh!

Ninot1Quyi commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ninot1Quyi commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. How to Test?

Uh oh!

Ninot1Quyi commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. Effects

Survival Mode

Construction Capability

Detailed Implementation Documentation

2.1 What Was Updated?

2.2 Execution Flow

2.2.1 When is coding triggered?

3. The support status of tools across all models (/suppliers)

4. Miscellaneous Updates

5. Important

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Ninot1Quyi commented Oct 20, 2025 •

edited

Loading

Ninot1Quyi commented Oct 26, 2025 •

edited

Loading

Ninot1Quyi commented Oct 27, 2025 •

edited

Loading

Ninot1Quyi commented Nov 8, 2025 •

edited

Loading