Qwen-VL-Chat is a high-performing LVLM by Alibaba Cloud for text-image dialogue tasks, excelling in zero-shot captioning, VQA, and referring expression comprehension while supporting multilingual dialogue.






Model is trained and ready for deployment

qwen-VL-Chat

The maximum number of tokens to generate. Shorter token lengths will provide faster performance.

A decimal number that determines the degree of randomness in the response

The top-k parameter limits the model's predictions to the top k most probable tokens at each step of generation.

An alternative to sampling with temperature, where the model considers the results of the tokens with top_p probability mass.

Qwen-VL-Chat is a high-performing LVLM by Alibaba Cloud for text-image dialogue tasks, excelling in zero-shot captioning, VQA, and referring expression