feat: 支持 GLM 系列模型（智谱 AI）作为推理后端 by hanzhcn · Pull Request #221 · Conway-Research/automaton

hanzhcn · 2026-02-24T20:14:10Z

概述

本 PR 添加了对智谱 AI GLM 系列模型（如 GLM-5）作为推理后端的支持，使 Conway Automaton 能够使用国产大模型运行。

背景

智谱 AI 的 GLM 系列是中国领先的国产大语言模型，具有以下优势：

成本效益：相比 OpenAI GPT-4，GLM-5 的 API 成本显著更低
本地化合规：数据不出境，满足中国市场的合规需求
中文优化：对中文语境和任务有更好的理解能力
稳定可用：通过 GLM Coding Plan 提供稳定的企业级 API 服务

技术变更

1. 新增配置项 (`src/types.ts`)

/** Custom OpenAI API base URL (e.g., for GLM Coding Plan: https://open.bigmodel.cn/api/coding/paas/v4) */
openaiApiBaseUrl?: string;

2. 配置桥接修复 (`src/agent/loop.ts`) ⭐ 关键修复

问题：顶层配置 inferenceModel 未传递到 modelStrategyConfig，导致用户配置被忽略。

修复：桥接顶层配置字段到 modelStrategyConfig：

const modelStrategyConfig: ModelStrategyConfig = {
  ...DEFAULT_MODEL_STRATEGY_CONFIG,
  // Bridge top-level config fields for backward compatibility
  ...(config.inferenceModel ? { inferenceModel: config.inferenceModel } : {}),
  ...(config.maxTokensPerTurn ? { maxTokensPerTurn: config.maxTokensPerTurn } : {}),
  // Nested modelStrategy takes highest priority
  ...(config.modelStrategy ?? {}),
};

3. 模型选择优先级修复 (`src/inference/router.ts`) ⭐ 关键修复

问题：路由矩阵优先于用户配置，导致 gpt-5.2 覆盖用户配置的 glm-5。

修复：用户配置的 inferenceModel 现在具有最高优先级：

selectModel(tier: SurvivalTier, taskType: InferenceTaskType): ModelEntry | null {
  // 1. Try user-configured inferenceModel first (highest priority)
  if (strategy.inferenceModel) {
    const entry = this.registry.get(strategy.inferenceModel);
    if (entry && entry.enabled) {
      const isFree = entry.costPer1kInput === 0 && entry.costPer1kOutput === 0;
      const tierOk = tierRank >= (TIER_ORDER[entry.tierMinimum] ?? 0);
      if (isFree || tierOk) {
        return entry;
      }
    }
  }
  // 2. Try routing-matrix candidates as fallback
  // 3. Fall back to other user-configured models...
}

优先级顺序：

用户配置的 inferenceModel（最高优先级）
路由矩阵候选
其他配置模型（lowComputeModel, criticalModel）

4. 推理客户端适配 (`src/conway/inference.ts`)

模型识别：自动检测 GLM 模型并路由到 OpenAI 兼容后端

// GLM models (智谱): glm-* - 使用 OpenAI 兼容 API
if (keys.openaiApiKey && /^glm/i.test(model)) return "openai";

API 端点适配：GLM Coding Plan 使用 /chat/completions 而非标准 OpenAI 的 /v1/chat/completions

const isGlmApi =
  params.apiUrl.includes("bigmodel.cn") ||
  params.apiUrl.includes("/v4") ||
  params.apiUrl.includes("/paas");
const endpoint = isGlmApi ? "/chat/completions" : "/v1/chat/completions";

5. 上下文温度修复 (`src/agent/context.ts`)

GLM-5 要求 temperature 在开区间 (0, 1) 内，不能为 0：

temperature: 0.1, // GLM-5 requires temperature > 0

6. 模型注册 (`src/inference/types.ts`)

添加 GLM-5 到 STATIC_MODEL_BASELINE：

{
  modelId: "glm-5",
  provider: "openai", // Uses OpenAI-compatible API
  displayName: "GLM-5 (智谱 Coding Plan)",
  tierMinimum: "normal",
  costPer1kInput: 0, // User pays directly to Zhipu
  costPer1kOutput: 0,
  maxTokens: 16384,
  contextWindow: 131072,
  supportsTools: true,
  supportsVision: false,
  parameterStyle: "max_tokens",
  enabled: true,
},

使用方法

在 automaton.json 中配置：

{
  "openaiApiKey": "your-glm-api-key",
  "openaiApiBaseUrl": "https://open.bigmodel.cn/api/coding/paas/v4",
  "inferenceModel": "glm-5"
}

重要：只需配置顶层 inferenceModel，无需配置嵌套的 modelStrategy。

测试

已在生产环境验证：

✅ GLM-5 推理正常工作
✅ 工具调用（function calling）正常
✅ 上下文管理正常
✅ 心跳任务正常执行
✅ 用户配置的 inferenceModel 优先于路由矩阵

兼容性

完全向后兼容，不影响现有用户
仅在使用 openaiApiBaseUrl 配置时启用新功能
对标准 OpenAI API 行为无影响
配置桥接修复适用于所有模型，不仅限于 GLM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: 支持 GLM 系列模型（智谱 AI）作为推理后端#221

feat: 支持 GLM 系列模型（智谱 AI）作为推理后端#221
hanzhcn wants to merge 5 commits intoConway-Research:mainfrom
hanzhcn:feat/glm-model-support

hanzhcn commented Feb 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hanzhcn commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

概述

背景

技术变更

1. 新增配置项 (src/types.ts)

2. 配置桥接修复 (src/agent/loop.ts) ⭐ 关键修复

3. 模型选择优先级修复 (src/inference/router.ts) ⭐ 关键修复

4. 推理客户端适配 (src/conway/inference.ts)

5. 上下文温度修复 (src/agent/context.ts)

6. 模型注册 (src/inference/types.ts)

使用方法

测试

兼容性

相关链接

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hanzhcn commented Feb 24, 2026 •

edited

Loading

1. 新增配置项 (`src/types.ts`)

2. 配置桥接修复 (`src/agent/loop.ts`) ⭐ 关键修复

3. 模型选择优先级修复 (`src/inference/router.ts`) ⭐ 关键修复

4. 推理客户端适配 (`src/conway/inference.ts`)

5. 上下文温度修复 (`src/agent/context.ts`)

6. 模型注册 (`src/inference/types.ts`)