Support InfiniAI Megrez 3b #10893

dixyes · 2024-12-19T09:06:44Z

This pr is to add InfiniAI Megrez support into llama.cpp

The model now(@58f1df16523cb2a9acb225aa808146e052f2b5b2) seems have a wrong eos_token set in its tokenizer_config.json.
( <|turn_end|> at template and <|turn_end> in json) Not sure if this is on purpose. Also metioned here

So the converted model will not stop generating in chat mode. Modify it to <|turn_end|> in tokenizer_config.json, then the generated gguf will work.

ngxson

I'm not sure about the tokenizer_pre == "megrez" part (if other collaborators know, please feel free to review this PR).

The template part looks good to me.

arch-btw · 2024-12-20T04:37:42Z

Thanks for doing this, I was trying it myself but didn't finish it. Just so you know they fixed the eos 30 minutes ago.

src/llama.cpp

Support InfiniAI Megrez 3b

a02c63d

github-actions bot added testing Everything test related python python script changes labels Dec 19, 2024

ngxson reviewed Dec 19, 2024

View reviewed changes

slaren reviewed Dec 20, 2024

View reviewed changes

src/llama.cpp Outdated Show resolved Hide resolved

dixyes force-pushed the megrez branch 2 times, most recently from 048d345 to 73f3d01 Compare December 22, 2024 06:55

Fix tokenizer_clean_spaces for megrez

01a0c36

dixyes force-pushed the megrez branch from 73f3d01 to 01a0c36 Compare December 22, 2024 06:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support InfiniAI Megrez 3b #10893

Support InfiniAI Megrez 3b #10893

dixyes commented Dec 19, 2024

ngxson left a comment •

edited

Loading

arch-btw commented Dec 20, 2024

Support InfiniAI Megrez 3b #10893

Are you sure you want to change the base?

Support InfiniAI Megrez 3b #10893

Conversation

dixyes commented Dec 19, 2024

ngxson left a comment • edited Loading

Choose a reason for hiding this comment

arch-btw commented Dec 20, 2024

ngxson left a comment •

edited

Loading