MiniCPM-o-2d6-language is the latest series of end-side multimodal LLMs (MLLMs) upgraded from MiniCPM-V. The models can now take images, video, text, and audio as inputs and provide high-quality text output in an end-to-end fashion
The maximum number of tokens to generate. Shorter token lengths will provide faster performance.
A decimal number that determines the degree of randomness in the response
An alternative to sampling with temperature, where the model considers the results of the tokens with top_p probability mass.
ResetModel loading...
Output
Notes
ID
Model Type ID
Multimodal To Text
Input Type
image
Output Type
text
Description
MiniCPM-o-2d6-language is the latest series of end-side multimodal LLMs (MLLMs) upgraded from MiniCPM-V. The models can now take images, video, text, and audio as inputs and provide high-quality text output in an end-to-end fashion