microsoft/florence/florence-2-large
8
Florence-2-large is a lightweight, versatile vision-language model by Microsoft, excelling in multiple tasks using a unified representation and the extensive FLD-5B dataset
  • ID
    florence-2-large
  • Type
    multimodal-to-text
  • Updated
    Oct 17, 2024
  • Input
  • Output
  • Config
  • Privacy
    Public
  • License
    MIT
  • Toolkit
    HuggingFace
  • Use Case
    instance-segmentationllmobject-detectionobject-trackingocrsemantic-segmentationvisual-question-answering
  • Share
    • Badge
      florence-2-large