• Community
  • Model
  • general-image-recognition-vit

general-image-recognition-vit

Image recognition model for identifying different concepts in images and video including objects, themes, moods, and more.

Notes

General Information

  • Purpose: Classifier for a variety of concepts, common objects, etc. This model is a great all-purpose solution for most visual recognition needs with industry-leading performance.

  • Architecture: Vision Transformer

  • Intended Use: image indexing by tags, filtering, cascade routing

  • Limitations: works well when content is prevalent in the image

Training/Test Data

The model was trained and tested on an internal dataset with approximately 10,000 concepts and 20M images, with multiple concepts per image. The class distributions on train and validation sets are long-tailed.

  • ID
  • Name
    general-vision-transformer
  • Model Type ID
    Visual Classifier
  • Description
    Image recognition model for identifying different concepts in images and video including objects, themes, moods, and more.
  • Last Updated
    Oct 29, 2024
  • Privacy
    PUBLIC
  • License
  • Share
    • Badge
      general-image-recognition-vit