Qwen2.5VL

Note: Due to performance limitations, the RDK X5 4GB version can only run the small parameter version.

Qwen2.5-VL is the new flagship vision-language model of Qwen, and a significant leap compared to the previous Qwen2-VL.

1. Model Scale

ModelSize
qwen2.5vl:3b3.2GB
qwen2.5vl:7b6.0GB

2. Performance

image.png

image.png

3. Using Qwen2.5VL

3.1 Running Qwen2.5VL

Use the run command to start the model. If you have not downloaded this model before, it will be automatically pulled from the Ollama model library:

image-20250701190658392

3.2 Having a Conversation

The response time depends on the hardware configuration, please be patient!

image-20250701190728726

3.3 Vision Capabilities

test_pic

image-20250701195329370

3.4 Ending the Conversation

Use the Ctrl+d shortcut or /bye to end the conversation!

3.5 Chinese Conversation

If you don't have a Chinese input method, you can refer to the tutorial on switching Chinese input methods.

Chinese conversation:

image-20250628164219894

References

Ollama

Official Website: https://ollama.com/

GitHub: https://github.com/ollama/ollama

Qwen2.5VL

GitHub: https://github.com/QwenLM/Qwen2.5-VL

Ollama Corresponding Model: https://ollama.com/library/qwen2.5vl