[Model] add colqwen2_vl code & inference #14291

BloomBerry · 2025-03-05T14:35:37Z

Add support for ColQwen2VL model
Description
This PR adds support for the ColQwen2VL model to vLLM. ColQwen2VL is an efficient document retrieval vision language model based on Qwen2VL, as described in the paper "ColPali: Efficient Document Retrieval with Vision Language Models". The model is designed to generate embeddings rather than text outputs, making it suitable for document retrieval applications.
Key implementation details:
Extended the existing Qwen2VL implementation for ColQwen2VL compatibility
Implemented custom text projection layer and L2 normalization for embedding generation
Added appropriate processing utilities for image and video inputs
Overrode forward, compute_logits and sample methods to optimize for embedding output
This implementation enables users to leverage ColQwen2VL's multimodal document retrieval capabilities through vLLM's efficient serving infrastructure.
Testing
Tested with sample image inputs
Verified embedding output format and dimensions
Confirmed compatibility with HuggingFace ColQwen2VL models

github-actions · 2025-03-05T14:35:51Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: BloomBerry <[email protected]>

DarkLight1337 · 2025-03-05T14:59:08Z

Thanks for implementing this! Can you update the following files as well?

Supported Models page
Test registry tests/models/test_registry.py
Model correctness tests tests/models/embedding/vision_language
Processor correctness tests tests/models/multimodal/processing/test_common.py

Signed-off-by: BloomBerry <[email protected]>

add colqwen2_vl code & inference

67574c0

mergify bot added the documentation Improvements or additions to documentation label Mar 5, 2025

BloomBerry added 2 commits March 5, 2025 23:55

Add ColQwen2VL model implementation

acf3d8c

Signed-off-by: BloomBerry <[email protected]>

Merge branch 'vllm-project:main' into colqwen2_vl

1e26ffc

DarkLight1337 changed the title ~~add colqwen2_vl code & inference~~ [Model] add colqwen2_vl code & inference Mar 5, 2025

BloomBerry added 4 commits March 6, 2025 00:22

colqwen2_vl inference code

6f130dd

Signed-off-by: BloomBerry <[email protected]>

add supported models md

919c9e9

Signed-off-by: BloomBerry <[email protected]>

add test_registry

f175e5b

Signed-off-by: BloomBerry <[email protected]>

add test_colqwen2vl code

1df7a7e

Signed-off-by: BloomBerry <[email protected]>

BloomBerry requested review from DarkLight1337 and ywang96 as code owners March 5, 2025 15:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] add colqwen2_vl code & inference #14291

[Model] add colqwen2_vl code & inference #14291

BloomBerry commented Mar 5, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Mar 5, 2025

DarkLight1337 commented Mar 5, 2025

[Model] add colqwen2_vl code & inference #14291

Are you sure you want to change the base?

[Model] add colqwen2_vl code & inference #14291

Conversation

BloomBerry commented Mar 5, 2025 • edited by github-actions bot Loading

github-actions bot commented Mar 5, 2025

DarkLight1337 commented Mar 5, 2025

BloomBerry commented Mar 5, 2025 •

edited by github-actions bot

Loading