[MODEL ADDITION] Ovis2 Model Addition #15826

mlinmg · 2025-03-31T15:48:59Z

FIX #13251
FIX #13317
FIX #13441
FIX #14346

With this PR I want to add the ovis architecture to VLLM continuing the discussion at AIDC-AI/Ovis#70

github-actions · 2025-03-31T15:49:10Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mlinmg · 2025-03-31T15:56:29Z

@Isotr0py I have a couple of question, there are some files like

https://github.com/AIDC-AI/Ovis/blob/35ab51a1a1e3542fa6db260a1084cefbc8f164bb/ovis/vllm/aimv2/modeling_aimv2.py
https://github.com/AIDC-AI/Ovis/blob/35ab51a1a1e3542fa6db260a1084cefbc8f164bb/ovis/vllm/aimv2/visual_tokenizer_aimv2.py
https://github.com/AIDC-AI/Ovis/blob/35ab51a1a1e3542fa6db260a1084cefbc8f164bb/ovis/vllm/aimv2/modeling_aimv2.py
That should find a place in where to be placed, do you have any clues?

Isotr0py · 2025-03-31T16:01:33Z

I think you can create a aimv2.py file under vllm/model_executor/models and put them in it (just like clip.py since this is the ViT part).

mlinmg · 2025-03-31T16:26:14Z

ok cool and also since they use the Qwen2 image preprocessor and I can't modify it to accept new kwargs, how would you expose the mm_kward to the image_processor call, @JumpingRain how have you implemented it?

mlinmg · 2025-03-31T16:46:18Z

I've added the config and the processor classes to the modeling fle since they are not present in transformers

Isotr0py · 2025-03-31T17:00:50Z

Seems that there are some changes have been in main already. Can you try to base the PR off main branch?

JumpingRain · 2025-04-01T02:27:49Z

@mlinmg @Isotr0py Hello, I'm currently using the following method to set max_partition as an initialization parameter for OvisProcessor:

class OvisProcessor(ProcessorMixin):
    attributes = ["image_processor", "tokenizer"]
    valid_kwargs = ["chat_template"]

    image_processor_class = "AutoImageProcessor"
    tokenizer_class = ("Qwen2Tokenizer", "Qwen2TokenizerFast")

    def __init__(self, image_processor=None, tokenizer=None, chat_template=None, **kwargs):
        self.image_token = "<|image_pad|>" if not hasattr(tokenizer, "image_token") else tokenizer.image_token
        self.video_token = "<|video_pad|>" if not hasattr(tokenizer, "video_token") else tokenizer.video_token
        self.max_partition = kwargs.get('max_partition', 9)
        self.covering_threshold = kwargs.get('covering_threshold', 0.9)
        self.convert_to_rgb = kwargs.get('convert_to_rgb', True)
        self.return_tensors = kwargs.get('return_tensors', 'pt')
        super().__init__(image_processor, tokenizer, chat_template=chat_template, **kwargs)
    

    def preprocess_image(self, image: PIL.Image.Image, max_partition=None, covering_threshold=None, convert_to_rgb=None, return_tensors=None):
        max_partition = max_partition if max_partition is not None else self.max_partition
        covering_threshold = covering_threshold if covering_threshold is not None else self.covering_threshold
        convert_to_rgb = convert_to_rgb if convert_to_rgb is not None else self.convert_to_rgb
        return_tensors = return_tensors if return_tensors is not None else self.return_tensors
        # other code

runninglsy · 2025-04-01T08:55:50Z

Thanks for the valuable contributions to Ovis. I kindly suggest the possibility of using 'ovis2' or 'Ovis2' for the model_type or coding to help differentiate it from previous versions like Ovis, Ovis1.5, and Ovis1.6. This approach would also facilitate future versioning, such as Ovis2.X or Ovis3.

mlinmg · 2025-04-01T10:53:25Z

Thanks for the valuable contributions to Ovis. I kindly suggest the possibility of using 'ovis2' or 'Ovis2' for the model_type or coding to help differentiate it from previous versions like Ovis, Ovis1.5, and Ovis1.6. This approach would also facilitate future versioning, such as Ovis2.X or Ovis3.

I've modified it but you'll need to also modify the configuration/processin/ auto config section file to have the corrected naming (Ovis2)

mergify · 2025-04-01T16:31:38Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @mlinmg.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

mlinmg · 2025-04-02T05:20:41Z

You should create a different HF repo with modified files, following the instructions on my pull request in the Ovis repo. Marco

Signed-off-by: Marco <[email protected]>

vllm/entrypoints/chat_utils.py

vllm/model_executor/models/aimv2.py

vllm/model_executor/models/ovis2.py

JumpingRain · 2025-04-23T09:29:07Z

@mlinmg @Isotr0py Thank you for your outstanding work; it seems that Ovis is on track to smoothly integrate with Vllm! Is there anything I can assist with on my end?

Additionally, you previously mentioned the need to modify the Ovis Hugging Face files. Considering that after the release, we aim for the weights to be compatible with historical code, we hope to achieve the Ovis HF file modification with minimal changes. In my local tests, I found that this can be accomplished by modifying the config.json and tokenizer config. Could you please specify which parts of the Ovis HF code need to be modified to support Vllm usage? I can make the necessary adjustments quickly.

Isotr0py · 2025-04-23T15:29:35Z

In my local tests, I found that this can be accomplished by modifying the config.json and tokenizer config. Could you please specify which parts of the Ovis HF code need to be modified to support Vllm usage? I can make the necessary adjustments quickly.

Currently, this PR require to use a modified tokenizer on HF (tokenizer = "Isotr0py/Ovis2-tokenizer") when initialize vLLM instance, so that users don't need to make any modifications locally. But it would be best if it can be upstream to Ovis2 models repo!

Signed-off-by: Isotr0py <[email protected]>

mergify · 2025-04-24T13:15:55Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @mlinmg.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Isotr0py <[email protected]>

vllm/model_executor/models/ovis2.py

sukkritsharmaofficial · 2025-04-29T15:34:03Z

eagerly waiting for this support guys @Isotr0py @DarkLight1337 , let me know if i can help in any way

Signed-off-by: Isotr0py <[email protected]>

Isotr0py

Tests can still pass on my side locally, let's put this forward!

Signed-off-by: Marco <[email protected]> Signed-off-by: Isotr0py <[email protected]> Signed-off-by: isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]>

Signed-off-by: Marco <[email protected]> Signed-off-by: Isotr0py <[email protected]> Signed-off-by: isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]> Signed-off-by: Mu Huai <[email protected]>

Signed-off-by: Marco <[email protected]> Signed-off-by: Isotr0py <[email protected]> Signed-off-by: isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

mergify bot added the frontend label Mar 31, 2025

mlinmg force-pushed the Ovis-model-addition branch from 5859227 to 96f0c10 Compare March 31, 2025 15:50

mergify bot added documentation Improvements or additions to documentation ci/build v1 labels Mar 31, 2025

mlinmg force-pushed the Ovis-model-addition branch from 96f0c10 to 1976b0d Compare March 31, 2025 15:55

Isotr0py removed ci/build v1 labels Apr 1, 2025

Isotr0py self-assigned this Apr 1, 2025

Isotr0py added this to Multi-modal Model Requests Apr 1, 2025

Isotr0py moved this to In Progress in Multi-modal Model Requests Apr 1, 2025

mergify bot added the needs-rebase label Apr 1, 2025

mlinmg added 4 commits April 2, 2025 11:29

Start ovis model addition

332f191

Signed-off-by: Marco <[email protected]>

Added configurations and processors to the model files

751ad19

Changed ovis to ovis2 for better model versioning

172a2eb

corrected processor implementation

64985c1

mlinmg force-pushed the Ovis-model-addition branch from 23476d8 to 64985c1 Compare April 2, 2025 09:30

Merge branch 'main' into Ovis-model-addition

90bf158

mergify bot removed the needs-rebase label Apr 2, 2025

Isotr0py requested review from DarkLight1337 and ywang96 as code owners April 21, 2025 07:10

DarkLight1337 reviewed Apr 22, 2025

View reviewed changes

Isotr0py added 2 commits April 23, 2025 23:59

clean up aimv2 ViT

19cf4e1

Signed-off-by: Isotr0py <[email protected]>

clean up ovis2

dacad3a

Signed-off-by: Isotr0py <[email protected]>

mergify bot added the needs-rebase label Apr 24, 2025

Merge branch 'main' into Ovis-model-addition

aa8e815

mergify bot removed the needs-rebase label Apr 24, 2025

make isort happy

dd4e856

Signed-off-by: Isotr0py <[email protected]>

DarkLight1337 reviewed Apr 25, 2025

View reviewed changes

vllm/model_executor/models/ovis2.py Outdated Show resolved Hide resolved

vllm/model_executor/models/ovis2.py Outdated Show resolved Hide resolved

remove sampler and unused config

411c2a2

DarkLight1337 removed this from Multi-modal Model Requests Apr 28, 2025

Isotr0py and others added 2 commits April 30, 2025 00:10

Merge branch 'main' into Ovis-model-addition

5174be6

fix config import

d2cbdd5

Signed-off-by: Isotr0py <[email protected]>

Isotr0py approved these changes Apr 29, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) April 29, 2025 16:35

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 29, 2025

Isotr0py merged commit 54072f3 into vllm-project:main Apr 30, 2025
63 checks passed

DarkLight1337 mentioned this pull request May 10, 2025

[New Model]: support Ovis VLM series #13441

Closed

1 task

vaclcer mentioned this pull request Jun 4, 2025

Inference Ovis2 with vllm AIDC-AI/Ovis#77

Open

Uh oh!

[MODEL ADDITION] Ovis2 Model Addition #15826

[MODEL ADDITION] Ovis2 Model Addition #15826

Uh oh!

Conversation

mlinmg commented Mar 31, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 31, 2025

Uh oh!

mlinmg commented Mar 31, 2025

Uh oh!

Isotr0py commented Mar 31, 2025

Uh oh!

mlinmg commented Mar 31, 2025

Uh oh!

mlinmg commented Mar 31, 2025

Uh oh!

Isotr0py commented Mar 31, 2025

Uh oh!

JumpingRain commented Apr 1, 2025

Uh oh!

runninglsy commented Apr 1, 2025

Uh oh!

mlinmg commented Apr 1, 2025

Uh oh!

mergify bot commented Apr 1, 2025

Uh oh!

mlinmg commented Apr 2, 2025 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JumpingRain commented Apr 23, 2025

Uh oh!

Isotr0py commented Apr 23, 2025

Uh oh!

mergify bot commented Apr 24, 2025

Uh oh!

Uh oh!

Uh oh!

sukkritsharmaofficial commented Apr 29, 2025

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

mlinmg commented Mar 31, 2025 •

edited by github-actions bot

Loading

mlinmg commented Apr 2, 2025 via email •

edited

Loading