Codegeex4 ERROR: ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"} #2757

Oaklight · 2025-01-12T09:06:09Z

System Info / 系統信息

managed server, account without sudo privilege
singularity available:

$ singularity --version
singularity-ce version 4.1.2-focal

OS version

NAME="Ubuntu"
VERSION="20.04.6 LTS (Focal Fossa)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 20.04.6 LTS"
VERSION_ID="20.04"
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
VERSION_CODENAME=focal
UBUNTU_CODENAME=focal

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

docker / docker
pip install / 通过 pip install 安装
installation from source / 从源码安装

Version info / 版本信息

v1.1.0

The command used to start Xinference / 用以启动 xinference 的命令

singularity exec --fakeroot
--env XINFERENCE_MODEL_SRC=huggingface
--bind xinference/.xinference:/root/.xinference
--nv
--bind /tmp/.X11-unix:/tmp/.X11-unix
xinference/xinference_v1.1.0.sif
xinference-local -H 0.0.0.0 --log-level debug

Reproduction / 复现过程

Model Engine: Transformers Ccached)
Model Fomat: pytorch Ccached)
Model Size: 9(CACHED)
Quantization: 8-bit(cached)
N-GPU: 1
Replica: 1
Additional parameters passed tothe inference engine: Transformers
- key: dtype
- value: half

access via localhost:port/v1/chat/completions:

Error handling webview message: {
  "msg": {
    "messageId": "b88dd8cb-da49-4938-83d2-429313093b9c",
    "messageType": "llm/streamChat",
    "data": {
      "messages": [
        {
          "role": "user",
          "content": [
            {
              "type": "text",
              "text": "hi"
            }
          ]
        },
        {
          "role": "assistant",
          "content": ""
        }
      ],
      "title": "CodeGeeX4",
      "completionOptions": {}
    }
  }
}

Error: Malformed JSON sent from server: {"error": "[address=0.0.0.0:34705, pid=3795840] ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"}

then got this error message

Expected behavior / 期待表现

should work normally. llama.cpp version is good to the exact same query

You can read THUDM/GLM-4#578 for discussion at ChatGLM4 repo

The text was updated successfully, but these errors were encountered:

qinxuye · 2025-01-14T04:17:36Z

@codingl2k1 Can you help with this?

codingl2k1 · 2025-01-14T17:44:44Z

The model tokenizer requires an update, related issue: https://huggingface.co/THUDM/codegeex4-all-9b/discussions/20

Just like this fix on LongWriter-glm4-9b: https://huggingface.co/THUDM/LongWriter-glm4-9b/commit/778b5712634889f5123d6c463ca383bc6dd5c621

github-actions · 2025-01-21T19:03:46Z

This issue is stale because it has been open for 7 days with no activity.

XprobeBot added the gpu label Jan 12, 2025

XprobeBot added this to the v1.x milestone Jan 12, 2025

github-actions bot added the stale label Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Codegeex4 ERROR: ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"} #2757

Codegeex4 ERROR: ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"} #2757

Oaklight commented Jan 12, 2025

qinxuye commented Jan 14, 2025

codingl2k1 commented Jan 14, 2025

github-actions bot commented Jan 21, 2025

Codegeex4 ERROR: ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"} #2757

Codegeex4 ERROR: ChatGLM4Tokenizer._pad() got an unexpected keyword argument 'padding_side'"} #2757

Comments

Oaklight commented Jan 12, 2025

System Info / 系統信息

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

Version info / 版本信息

The command used to start Xinference / 用以启动 xinference 的命令

Reproduction / 复现过程

Expected behavior / 期待表现

qinxuye commented Jan 14, 2025

codingl2k1 commented Jan 14, 2025

github-actions bot commented Jan 21, 2025