[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 #7273

jeejeelee · 2024-08-07T16:22:20Z

I have completed the following modification:

Rename MiniCPMVQwen2 to MiniCPMV2.6
Slightly modified some logic
Add MiniCPMV2.6 to list of supported models (FIX [New Model]:Is MiniCPM-V-2_6 supported? #7267)

github-actions · 2024-08-07T16:22:32Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

vllm/model_executor/models/minicpmv.py

DarkLight1337 · 2024-08-07T16:39:13Z

Could you also update the docs to include this in the list of supported models?

vllm/model_executor/models/minicpmv.py

HwwwwwwwH · 2024-08-08T02:53:50Z

Sry for late. Thank you for your work! There're something still need attention. The example of minicpmv should be extended to contain three versions in example/offline_inference_vision_language.py. And three versions of minicpmv need different stop_token_ids. So if the user run example/offline_inference_vision_language.py, for example, for V2.5, they might got many <|eot|> at the end of outputs.
So the logic of offline_inference_vision_language.py may need a slight modification.

jeejeelee · 2024-08-08T03:00:09Z

Sry for late. Thank you for your work! There're something still need attention. The example of minicpmv should be extended to contain three versions in example/offline_inference_vision_language.py. And three versions of minicpmv need different stop_token_ids. So if the user run example/offline_inference_vision_language.py, for example, for V2.5, they might got many <|eot|> at the end of outputs. So the logic of offline_inference_vision_language.py many need a slight modification.

Ok, is that right？

  # 2.0
  stop_token_ids = [tokenizer.eos_id]
  # 2.5
  # stop_token_ids = [tokenizer.eos_id, tokenizer.eot_id]
  #2.6
  # stop_tokens = ['<|im_end|>', '<|endoftext|>']
  # stop_token_ids = [tokenizer.convert_tokens_to_ids(i) for i in stop_tokens]

HwwwwwwwH · 2024-08-08T03:04:44Z

Ok, is that right？

  # 2.0
  stop_token_ids = [tokenizer.eos_id]
  # 2.5
  # stop_token_ids = [tokenizer.eos_id, tokenizer.eot_id]
  #2.6
  # stop_tokens = ['<|im_end|>', '<|endoftext|>']
  # stop_token_ids = [tokenizer.convert_tokens_to_ids(i) for i in stop_tokens]

Yes

jeejeelee · 2024-08-08T03:14:39Z

Ok, is that right？

  # 2.0
  stop_token_ids = [tokenizer.eos_id]
  # 2.5
  # stop_token_ids = [tokenizer.eos_id, tokenizer.eot_id]
  #2.6
  # stop_tokens = ['<|im_end|>', '<|endoftext|>']
  # stop_token_ids = [tokenizer.convert_tokens_to_ids(i) for i in stop_tokens]

Yes

@DarkLight1337 I am not sure if I can add these in vl example

DarkLight1337 · 2024-08-08T03:18:16Z

Feel free to update the example!

DarkLight1337 · 2024-08-08T06:33:06Z

@HwwwwwwwH can you try running the example on your end? I'm outside right now.

HwwwwwwwH · 2024-08-08T10:07:22Z

@HwwwwwwwH can you try running the example on your end? I'm outside right now.

I'm running it now.

HwwwwwwwH · 2024-08-08T10:10:06Z

@HwwwwwwwH I'd like to know why you chose to use the LlamaModel rather than LlamaForCausalLM as the language model for minicpm2.5. This approach actually changes the model hierarchy, which is not conducive to the support of LoRA.

emmmm, at first I used xxxCausalLM, but then I found that all of the CausalLM deleted the inputs_embeds parameters while kept it in xxModel . And using xxModel is what other VLMs do, so I followed this way.

HwwwwwwwH · 2024-08-08T10:14:50Z

I got this error when running gguf. And I don't see any dependency in requirements files. Maybe you need add to this?

Traceback (most recent call last):
  File "/data1/a1/vllm_e/examples/offline_inference_vision_language.py", line 10, in <module>
    from vllm import LLM, SamplingParams
  File "/data1/a1/vllm_e/vllm/__init__.py", line 3, in <module>
    from vllm.engine.arg_utils import AsyncEngineArgs, EngineArgs
  File "/data1/a1/vllm_e/vllm/engine/arg_utils.py", line 7, in <module>
    from vllm.config import (CacheConfig, DecodingConfig, DeviceConfig,
  File "/data1/a1/vllm_e/vllm/config.py", line 11, in <module>
    from vllm.model_executor.layers.quantization import QUANTIZATION_METHODS
  File "/data1/a1/vllm_e/vllm/model_executor/layers/quantization/__init__.py", line 16, in <module>
    from vllm.model_executor.layers.quantization.gguf import GGUFConfig
  File "/data1/a1/vllm_e/vllm/model_executor/layers/quantization/gguf.py", line 3, in <module>
    import gguf
ModuleNotFoundError: No module named 'gguf'

DarkLight1337 · 2024-08-08T10:15:43Z

I got this error when running gguf. And I don't see any dependency in requirements files. Maybe you need add to this?

Traceback (most recent call last):
  File "/data1/a1/vllm_e/examples/offline_inference_vision_language.py", line 10, in <module>
    from vllm import LLM, SamplingParams
  File "/data1/a1/vllm_e/vllm/__init__.py", line 3, in <module>
    from vllm.engine.arg_utils import AsyncEngineArgs, EngineArgs
  File "/data1/a1/vllm_e/vllm/engine/arg_utils.py", line 7, in <module>
    from vllm.config import (CacheConfig, DecodingConfig, DeviceConfig,
  File "/data1/a1/vllm_e/vllm/config.py", line 11, in <module>
    from vllm.model_executor.layers.quantization import QUANTIZATION_METHODS
  File "/data1/a1/vllm_e/vllm/model_executor/layers/quantization/__init__.py", line 16, in <module>
    from vllm.model_executor.layers.quantization.gguf import GGUFConfig
  File "/data1/a1/vllm_e/vllm/model_executor/layers/quantization/gguf.py", line 3, in <module>
    import gguf
ModuleNotFoundError: No module named 'gguf'

It should be in the latest requirements.txt. Maybe need to sync this branch with main.

DarkLight1337 · 2024-08-08T10:16:30Z

@HwwwwwwwH I'd like to know why you chose to use the LlamaModel rather than LlamaForCausalLM as the language model for minicpm2.5. This approach actually changes the model hierarchy, which is not conducive to the support of LoRA.

emmmm, at first I used xxxCausalLM, but then I found that all of the CausalLM deleted the inputs_embeds parameters while kept it in xxModel . And using xxModel is what other VLMs do, so I followed this way.

Yeah, the existing VLMs use the *Model class rather than *ForCausalLM. There have been efforts to change this though to use @Isotr0py 's recipe for loading inner models. (See #7153)

HwwwwwwwH · 2024-08-08T10:39:44Z

@HwwwwwwwH can you try running the example on your end? I'm outside right now.

It's ok with the examples.

jeejeelee · 2024-08-08T10:43:26Z

@HwwwwwwwH I'd like to know why you chose to use the LlamaModel rather than LlamaForCausalLM as the language model for minicpm2.5. This approach actually changes the model hierarchy, which is not conducive to the support of LoRA.

emmmm, at first I used xxxCausalLM, but then I found that all of the CausalLM deleted the inputs_embeds parameters while kept it in xxModel . And using xxModel is what other VLMs do, so I followed this way.

Yeah, the existing VLMs use the *Model class rather than *ForCausalLM. There have been efforts to change this though to use @Isotr0py 's recipe for loading inner models. (See #7153)

Sorry for deleting my previous comments by mistake. I will read these codes, thank you.

Signed-off-by: Alvant <alvasian@yandex.ru>

Rename MiniCPMVQwen2 to MiniCPMV2.6

17b8e38

DarkLight1337 reviewed Aug 7, 2024

View reviewed changes

vllm/model_executor/models/minicpmv.py Outdated Show resolved Hide resolved

DarkLight1337 reviewed Aug 7, 2024

View reviewed changes

vllm/model_executor/models/minicpmv.py Outdated Show resolved Hide resolved

fix bug

69fb396

DarkLight1337 reviewed Aug 7, 2024

View reviewed changes

vllm/model_executor/models/minicpmv.py Outdated Show resolved Hide resolved

jeejeelee requested a review from DarkLight1337 August 8, 2024 01:39

Revert resampler's kv_proj logic

1a4bf1b

DarkLight1337 approved these changes Aug 8, 2024

View reviewed changes

Modify VL example

25aea1c

jeejeelee requested a review from DarkLight1337 August 8, 2024 06:19

Merge branch 'vllm-project:main' into optimize-minicpmv-code

2d306b7

DarkLight1337 approved these changes Aug 8, 2024

View reviewed changes

DarkLight1337 enabled auto-merge (squash) August 8, 2024 12:48

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 8, 2024

DarkLight1337 merged commit 757ac70 into vllm-project:main Aug 8, 2024
60 checks passed

sfc-gh-mkeralapura pushed a commit to sfc-gh-mkeralapura/vllm that referenced this pull request Aug 12, 2024

[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 (vllm-project#7273)

9c53b63

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Aug 17, 2024

[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 (vllm-project#7273)

1b1588f

jeejeelee deleted the optimize-minicpmv-code branch August 19, 2024 08:09

fialhocoelho pushed a commit to opendatahub-io/vllm that referenced this pull request Aug 22, 2024

[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 (vllm-project#7273)

d678587

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 (vllm-project#7273)

3cdb246

Signed-off-by: Alvant <alvasian@yandex.ru>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 #7273

[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 #7273

jeejeelee commented Aug 7, 2024 •

edited by DarkLight1337

Loading

github-actions bot commented Aug 7, 2024

DarkLight1337 commented Aug 7, 2024

HwwwwwwwH commented Aug 8, 2024 •

edited

Loading

jeejeelee commented Aug 8, 2024

HwwwwwwwH commented Aug 8, 2024

jeejeelee commented Aug 8, 2024

DarkLight1337 commented Aug 8, 2024

DarkLight1337 commented Aug 8, 2024

HwwwwwwwH commented Aug 8, 2024 •

edited

Loading

HwwwwwwwH commented Aug 8, 2024

HwwwwwwwH commented Aug 8, 2024

DarkLight1337 commented Aug 8, 2024

DarkLight1337 commented Aug 8, 2024 •

edited

Loading

HwwwwwwwH commented Aug 8, 2024

jeejeelee commented Aug 8, 2024

[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 #7273

[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6 #7273

Conversation

jeejeelee commented Aug 7, 2024 • edited by DarkLight1337 Loading

github-actions bot commented Aug 7, 2024

DarkLight1337 commented Aug 7, 2024

HwwwwwwwH commented Aug 8, 2024 • edited Loading

jeejeelee commented Aug 8, 2024

HwwwwwwwH commented Aug 8, 2024

jeejeelee commented Aug 8, 2024

DarkLight1337 commented Aug 8, 2024

DarkLight1337 commented Aug 8, 2024

HwwwwwwwH commented Aug 8, 2024 • edited Loading

HwwwwwwwH commented Aug 8, 2024

HwwwwwwwH commented Aug 8, 2024

DarkLight1337 commented Aug 8, 2024

DarkLight1337 commented Aug 8, 2024 • edited Loading

HwwwwwwwH commented Aug 8, 2024

jeejeelee commented Aug 8, 2024

jeejeelee commented Aug 7, 2024 •

edited by DarkLight1337

Loading

HwwwwwwwH commented Aug 8, 2024 •

edited

Loading

HwwwwwwwH commented Aug 8, 2024 •

edited

Loading

DarkLight1337 commented Aug 8, 2024 •

edited

Loading