[OpenVINO] Add support for GLM-4.1V-9B-Thinkin #1387

openvino-dev-samples · 2025-07-18T07:03:30Z

No description provided.

echarlaix

echarlaix · 2025-07-23T08:44:47Z

optimum/exporters/openvino/model_patcher.py

+    return causal_mask
+
+
+def _glm4v_update_causal_mask(


why is this needed? I was thinking that we can remove _glm4v_update_causal_mask and _glm4v_prepare_4d_causal_attention_mask_with_cache_position as we should be compatible with create_causal_mask https://github.com/huggingface/transformers/blob/v4.53.0/src/transformers/models/glm4v/modular_glm4v.py#L982 (#1377 will soon be merged)

currently, there is accuracy issue on conversion torch.vmap, which is included in create_casual_mask.

echarlaix · 2025-07-23T08:47:56Z

optimum/exporters/openvino/model_configs.py

 def init_model_configs():
    if "open_clip" not in TasksManager._LIBRARY_TO_SUPPORTED_MODEL_TYPES:
        TasksManager._LIBRARY_TO_SUPPORTED_MODEL_TYPES["open_clip"] = {}
+    TasksManager._CUSTOM_CLASSES[("pt", "glm4v", "image-text-to-text")] = (


here why not used AutoModelForImageTextToText directly to load all the image-text-to-text task models?
https://github.com/huggingface/transformers/blob/5dba4bc7b2c1ef517ed44bba76bb70b59001c737/src/transformers/models/auto/modeling_auto.py#L941

echarlaix · 2025-07-23T08:49:13Z

optimum/intel/openvino/modeling_visual_language.py

    "phi4_multimodal": _OVPhi4MMForCausalLM,
    "llama4": _OVLlama4ForCausalLM,
+    "glm4v": _OVGlm4vForCausalLM,
 }


would you mind adding a test with a tiny random model ?

echarlaix · 2025-07-23T08:53:27Z

optimum/exporters/openvino/model_configs.py

+            )
+
+        if input_name == "seqlens":
+            return torch.tensor([grid_t * grid_h * grid_w], dtype=torch.int64)


question : do we need to generate seqlens in the input generator or should we infer it directly in the patch instead ? (given hidden_states for example) ?

@echarlaix Sorry, I dont understand what "infer it directly in the patch instead" means ? could you show me a link of example code ? thanks

echarlaix · 2025-07-23T09:04:11Z

optimum/exporters/openvino/model_configs.py

+                [grid_t * grid_h * grid_w, 2], max_value=grid_h, framework=framework, dtype=int_dtype
+            )
+
+        if input_name == "grid_thw":


same question shouldn't it be inferred in the patch ?

its directly from forward function

echarlaix · 2025-07-23T09:07:10Z

optimum/exporters/openvino/model_configs.py

+            dim = self.embed_dim // self.num_heads // 2
+            return self.random_float_tensor([grid_h * grid_t * grid_w, dim], framework=framework, dtype=float_dtype)
+
+        if input_name == "image_type_ids":


looks like DummyGlm4vVisionEmbedInputGenerator could inherit from DummyQwen2VLVisionEmbedInputGenerator as they are both very close (just need to overwrite generate and just add image_type_ids ) what do you think ?

echarlaix · 2025-07-23T09:08:31Z

optimum/intel/openvino/modeling_visual_language.py

+                f"Initialization model for {self.config.model_type} required at least transformers >= 4.45"
+            )
+
+    def get_rope_index(


would you mind adding a link to the original code https://github.com/huggingface/transformers/blob/v4.53.3/src/transformers/models/glm4v/modular_glm4v.py#L1014 (and highlight any modifications if any)

openvino-dev-samples added 6 commits July 15, 2025 20:06

add glm4v support

546952d

add vlm pipeline

1e7f17c

add glm4

eefe590

Merge branch 'huggingface:main' into glm4v

12a4486

add glm4

42e8acb

reformat

2975fef

echarlaix reviewed Jul 23, 2025

View reviewed changes

update the glm4v

bfa073d

openvino-dev-samples changed the title ~~[OpenVINO][Draft] Add support for GLM-4.1V-9B-Thinkin~~ [OpenVINO] Add support for GLM-4.1V-9B-Thinkin Aug 6, 2025

openvino-dev-samples added 3 commits August 11, 2025 08:45

solve conflict

eae3354

Merge branch 'huggingface:main' into glm4v

5a86e0b

remove glm4v patcher

53a2bb4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OpenVINO] Add support for GLM-4.1V-9B-Thinkin #1387

[OpenVINO] Add support for GLM-4.1V-9B-Thinkin #1387

Uh oh!

openvino-dev-samples commented Jul 18, 2025

Uh oh!

echarlaix left a comment

Uh oh!

echarlaix Jul 23, 2025

Uh oh!

openvino-dev-samples Jul 23, 2025

Uh oh!

echarlaix Jul 23, 2025

Uh oh!

echarlaix Jul 23, 2025

Uh oh!

echarlaix Jul 23, 2025

Uh oh!

openvino-dev-samples Aug 1, 2025 •

edited

Loading

Uh oh!

echarlaix Jul 23, 2025

Uh oh!

openvino-dev-samples Jul 24, 2025

Uh oh!

echarlaix Jul 23, 2025

Uh oh!

echarlaix Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[OpenVINO] Add support for GLM-4.1V-9B-Thinkin #1387

Are you sure you want to change the base?

[OpenVINO] Add support for GLM-4.1V-9B-Thinkin #1387

Uh oh!

Conversation

openvino-dev-samples commented Jul 18, 2025

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

openvino-dev-samples Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

openvino-dev-samples Aug 1, 2025 •

edited

Loading