Florence-2-large is a lightweight, versatile vision-language model by Microsoft, excelling in multiple tasks using a unified representation and the extensive FLD-5B dataset
0d58637d4d0c4f25b7e60ad8dd318dba
0d58637d4d0c4f25b7e60ad8dd318dba
OverviewVersions (1)Deployments
Input
Prompt:
Press Ctrl + Enter to submit
The maximum number of tokens to generate. Shorter token lengths will provide faster performance.
A decimal number that determines the degree of randomness in the response
The top-k parameter limits the model's predictions to the top k most probable tokens at each step of generation.
An alternative to sampling with temperature, where samples from the top p percentage of most likely tokens.
.
ResetModel loading...
Output
Notes
ID
Model Type ID
Multimodal To Text
Input Type
image
Output Type
text
Description
Florence-2-large is a lightweight, versatile vision-language model by Microsoft, excelling in multiple tasks using a unified representation and the extensive FLD-5B dataset