-
|
Hello!
Any image sent with a native resolution within this range does great with bbox and point tasks. If you upload a 4k image (3840 x 2160) the coordinates are off. To fix:
If I change add Am I doing something wrong? It works 100% of the time when the image is within the default token limit, but larger it fails. Thanks for your time! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
Got it figured out. Basically need to use this: https://github.com/QwenLM/Qwen3-VL/blob/main/qwen-vl-utils/src/qwen_vl_utils/vision_process.py The smart resizing fixed it all! |
Beta Was this translation helpful? Give feedback.
Got it figured out. Basically need to use this: https://github.com/QwenLM/Qwen3-VL/blob/main/qwen-vl-utils/src/qwen_vl_utils/vision_process.py
The smart resizing fixed it all!