server : clean up built-in template detection by ngxson · Pull Request #11026 · ggml-org/llama.cpp

ngxson · 2024-12-31T11:18:47Z

Bug: when starting the server, templates longer than 2048 bytes is shown as "not supported" in the log output, while it's still being formatted correctly in /v1/chat/completions

slaren · 2024-12-31T13:09:16Z

common/common.cpp

+    static const char * template_key = "tokenizer.chat_template";
+    // call with NULL buffer to get the total size of the string
+    int32_t res = llama_model_meta_val_str(model, template_key, NULL, 0);
+    if (res < 2) {


What's the reason for the 2 here? llama_model_meta_val_str returns either -1 if the key is not found, or the length of the string.

Hmm yeah someone added this in the PR to fix the null terminator. I suppose that it's to make sure model_template.size() - 1 stay positive.

In any case, we can change it to

Suggested change

if (res < 2) {

if (res < 0) {

Then below:

return std::string(model_template.data(), res);

What do you think?

model_template.size() should always be at least 1, since the size is res + 1. So I don't expect the -1 to cause issues, but that should also work.

I ended up just flipping the condition to if (res > 0), it's more readable this way.

* server : clean up built-in template detection * fix compilation * add chat template test * fix condition

server : clean up built-in template detection

c5ac2b8

ngxson requested a review from slaren December 31, 2024 11:18

github-actions bot added examples server labels Dec 31, 2024

ngxson added 2 commits December 31, 2024 12:23

fix compilation

44f998a

add chat template test

c6bd7a7

github-actions bot added the python python script changes label Dec 31, 2024

slaren reviewed Dec 31, 2024

View reviewed changes

slaren approved these changes Dec 31, 2024

View reviewed changes

fix condition

450e47b

ngxson merged commit 45095a6 into ggml-org:master Dec 31, 2024

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

server : clean up built-in template detection (ggml-org#11026)

4a22968

* server : clean up built-in template detection * fix compilation * add chat template test * fix condition

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server : clean up built-in template detection#11026

server : clean up built-in template detection#11026
ngxson merged 4 commits intoggml-org:masterfrom
ngxson:xsn/server_chat_template_detect

ngxson commented Dec 31, 2024

Uh oh!

slaren Dec 31, 2024 •

edited

Loading

Uh oh!

ngxson Dec 31, 2024 •

edited

Loading

Uh oh!

slaren Dec 31, 2024

Uh oh!

ngxson Dec 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ngxson commented Dec 31, 2024

Uh oh!

slaren Dec 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngxson Dec 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

slaren Dec 31, 2024

Choose a reason for hiding this comment

Uh oh!

ngxson Dec 31, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

slaren Dec 31, 2024 •

edited

Loading

ngxson Dec 31, 2024 •

edited

Loading