• Community
  • Model
  • general-image-embedding-vit

general-image-embedding-vit

AI visual recognition model for returning 768-dimensional numerical vectors that represent the items in images and video.

Notes

Produce embeddings (numerical vectors that represent the input images in a 768-dimensional space) which are computed by using Clarifai’s ‘General’ model. The vectors of visually similar images will be close to each other in the 768-dimensional space. The ‘General Embedding’ model can be used for filtering, indexing, ranking, and organizing images according to visual similarity and transfer learning tasks.

  • "dba3b4530ff4466a80812892823d51cc" version has been trained on combination of ImageNet21k, OpenImages, and a Clarifai's internal dataset.
  • "a78386d5142c4025ac42272b86f06134" version has been trained on ImageNet21k.
  • ID
  • Name
    general-vision-transformer
  • Model Type ID
    Visual Embedder
  • Description
    AI visual recognition model for returning 768-dimensional numerical vectors that represent the items in images and video.
  • Last Updated
    Oct 16, 2024
  • Privacy
    PUBLIC
  • Toolkit
  • License
  • Share
    • Badge
      general-image-embedding-vit