Add ImageProcessorFast to Qwen2.5-VL processor #36164

Isotr0py · 2025-02-13T09:16:21Z

What does this PR do?

Add Qwen2_5_VLImageProcessorFast to Qwen2.5-VL modular file

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yonigozlan @ArthurZucker

Signed-off-by: isotr0py <[email protected]>

ArthurZucker

🔥

src/transformers/models/auto/image_processing_auto.py

ArthurZucker · 2025-02-13T10:52:06Z

src/transformers/models/qwen2_5_vl/modular_qwen2_5_vl.py

@@ -900,7 +943,7 @@ class Qwen2_5_VLProcessor(Qwen2VLProcessor):
            in a chat into a tokenizable string.
    """

-    image_processor_class = "Qwen2_5_VLImageProcessor"
+    image_processor_class = "AutoImageProcessor"


most needed!

src/transformers/models/qwen2_5_vl/image_processing_qwen2_5_vl.py

HuggingFaceDocBuilderDev · 2025-02-13T11:16:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Signed-off-by: isotr0py <[email protected]>

ArthurZucker

Nice thanks 🤗

ArthurZucker · 2025-02-13T15:39:44Z

src/transformers/__init__.py

@@ -6442,7 +6441,6 @@
            PoolFormerImageProcessor,
        )
        from .models.pvt import PvtImageProcessor
-        from .models.qwen2_5_vl import Qwen2_5_VLImageProcessor


Not breaking because we added it in this release!

hiyouga · 2025-02-14T10:20:35Z

Hi @Isotr0py could you please update https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct/blob/main/preprocessor_config.json according to this PR's modification? I use the latest version of transformers and it gives error:

processor = AutoProcessor.from_pretrained("Qwen/Qwen2.5-VL-7B-Instruct")
Using a slow image processor as `use_fast` is unset and a slow processor was saved with this model. `use_fast=True` will be the default behavior in v4.48, even if the model was saved with a slow processor. This will result in minor differences in outputs. You'll still be able to use a slow processor with `use_fast=False`.
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/local/lib/python3.10/dist-packages/transformers/models/auto/processing_auto.py", line 334, in from_pretrained
    return processor_class.from_pretrained(
  File "/usr/local/lib/python3.10/dist-packages/transformers/processing_utils.py", line 1043, in from_pretrained
    args = cls._get_arguments_from_pretrained(pretrained_model_name_or_path, **kwargs)
  File "/usr/local/lib/python3.10/dist-packages/transformers/processing_utils.py", line 1089, in _get_arguments_from_pretrained
    args.append(attribute_class.from_pretrained(pretrained_model_name_or_path, **kwargs))
  File "/usr/local/lib/python3.10/dist-packages/transformers/models/auto/image_processing_auto.py", line 569, in from_pretrained
    raise ValueError(
ValueError: Unrecognized image processor in Qwen/Qwen2.5-VL-7B-Instruct. Should have a `image_processor_type` key in its preprocessor_config.json of config.json, or one of the following `model_type` keys in its config.json: align, aria, beit, bit, blip, blip-2, bridgetower, chameleon, chinese_clip, clip, clipseg, conditional_detr, convnext, convnextv2, cvt, data2vec-vision, deformable_detr, deit, depth_anything, depth_pro, deta, detr, dinat, dinov2, donut-swin, dpt, efficientformer, efficientnet, flava, focalnet, fuyu, git, glpn, got_ocr2, grounding-dino, groupvit, hiera, idefics, idefics2, idefics3, ijepa, imagegpt, instructblip, instructblipvideo, kosmos-2, layoutlmv2, layoutlmv3, levit, llava, llava_next, llava_next_video, llava_onevision, mask2former, maskformer, mgp-str, mllama, mobilenet_v1, mobilenet_v2, mobilevit, mobilevitv2, nat, nougat, oneformer, owlv2, owlvit, paligemma, perceiver, pix2struct, pixtral, poolformer, pvt, pvt_v2, qwen2_5_vl, qwen2_vl, regnet, resnet, rt_detr, sam, segformer, seggpt, siglip, superglue, swiftformer, swin, swin2sr, swinv2, table-transformer, timesformer, timm_wrapper, tvlt, tvp, udop, upernet, van, videomae, vilt, vipllava, vit, vit_hybrid, vit_mae, vit_msn, vitmatte, xclip, yolos, zoedepth

hiyouga · 2025-02-14T10:27:21Z

~~The image_processor_type key should be updated to avoid errors: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct/blob/main/preprocessor_config.json#L17~~

~~cc @simonJJJ @ShuaiBai623~~

It has been fixed at the hf repo: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct/blob/main/preprocessor_config.json

Isotr0py · 2025-02-14T10:45:00Z

@hiyouga Thanks for feedback! Just opened PR in model's repo for update: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct/discussions/24

Isotr0py added 6 commits February 13, 2025 15:06

add qwen2 fast image processor to modular file

ce16648

Signed-off-by: isotr0py <[email protected]>

fix modular

9d3063a

Signed-off-by: isotr0py <[email protected]>

fix circle import

762be48

Signed-off-by: isotr0py <[email protected]>

add docs

099cb99

Signed-off-by: isotr0py <[email protected]>

Merge branch 'main' into qwen2_5-fastprocessor

bd06d88

fix typo

1b96d6e

Signed-off-by: isotr0py <[email protected]>

Isotr0py marked this pull request as ready for review February 13, 2025 10:26

Isotr0py added 3 commits February 13, 2025 18:26

Merge branch 'main' into qwen2_5-fastprocessor

b8262bb

add modular generated files

0b57b91

Signed-off-by: isotr0py <[email protected]>

Merge branch 'main' into qwen2_5-fastprocessor

1b8c746

ArthurZucker approved these changes Feb 13, 2025

View reviewed changes

Isotr0py added 12 commits February 13, 2025 19:45

revert qwen2vl fast image processor

5a933d0

Signed-off-by: isotr0py <[email protected]>

remove qwen2.5-vl image processor from modular

228f2ee

Signed-off-by: isotr0py <[email protected]>

re-generate qwen2.5-vl files

e68dd27

Signed-off-by: isotr0py <[email protected]>

remove unnecessary test

b21f773

Signed-off-by: isotr0py <[email protected]>

fix auto map

0827393

Signed-off-by: isotr0py <[email protected]>

cleanup

fac86d5

Signed-off-by: isotr0py <[email protected]>

fix model_input_names

cf1ed99

Signed-off-by: isotr0py <[email protected]>

Merge branch 'main' into qwen2_5-fastprocessor

b92c728

remove import

f45bbac

Signed-off-by: isotr0py <[email protected]>

Merge branch 'main' into qwen2_5-fastprocessor

97469e7

make fix-copies

7e35cd2

Signed-off-by: isotr0py <[email protected]>

Merge branch 'main' into qwen2_5-fastprocessor

cc350d0

ArthurZucker approved these changes Feb 13, 2025

View reviewed changes

Isotr0py merged commit 33d1d71 into huggingface:main Feb 14, 2025
25 checks passed

Isotr0py deleted the qwen2_5-fastprocessor branch February 14, 2025 09:35

Isotr0py mentioned this pull request Feb 14, 2025

[Bugfix] Fix qwen2.5-vl image processor vllm-project/vllm#13286

Merged

hiyouga mentioned this pull request Feb 14, 2025

transformers for qwen-2_5_vl has updated hiyouga/LLaMA-Factory#6941

Closed

1 task

Jintao-Huang mentioned this pull request Feb 15, 2025

微调Qwen2_5_VL模型时报错：ImportError: cannot import name 'Qwen2_5_VLForConditionalGeneration' from 'transformers' modelscope/ms-swift#3109

Closed

Daheer mentioned this pull request Feb 16, 2025

smart_resize import issue QwenLM/Qwen2.5-VL#787

Merged

hiyouga mentioned this pull request Feb 17, 2025

Qwen2.5-VL *B模型 SFT训练&使用vllm推理报错：AttributeError: 'NoneType' object has no attribute 'image_processor' hiyouga/LLaMA-Factory#6965

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ImageProcessorFast to Qwen2.5-VL processor #36164

Add ImageProcessorFast to Qwen2.5-VL processor #36164

Isotr0py commented Feb 13, 2025 •

edited

Loading

ArthurZucker left a comment

ArthurZucker Feb 13, 2025

HuggingFaceDocBuilderDev commented Feb 13, 2025

ArthurZucker left a comment

ArthurZucker Feb 13, 2025

hiyouga commented Feb 14, 2025 •

edited

Loading

hiyouga commented Feb 14, 2025 •

edited

Loading

Isotr0py commented Feb 14, 2025

Add ImageProcessorFast to Qwen2.5-VL processor #36164

Add ImageProcessorFast to Qwen2.5-VL processor #36164

Conversation

Isotr0py commented Feb 13, 2025 • edited Loading

What does this PR do?

Before submitting

Who can review?

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Feb 13, 2025

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Feb 13, 2025

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Feb 13, 2025

Choose a reason for hiding this comment

hiyouga commented Feb 14, 2025 • edited Loading

hiyouga commented Feb 14, 2025 • edited Loading

Isotr0py commented Feb 14, 2025

Isotr0py commented Feb 13, 2025 •

edited

Loading

hiyouga commented Feb 14, 2025 •

edited

Loading

hiyouga commented Feb 14, 2025 •

edited

Loading