• Community
  • Model
  • general-image-detector-detic_C2_SwinB_896_lvis

general-image-detector-detic_C2_SwinB_896_lvis

--

Notes

Detecting Twenty-thousand Classes using Image-level Supervision

Detic: A Detector with image classes that can use image-level labels to easily train detectors.

Detecting Twenty-thousand Classes using Image-level Supervision,
Xingyi Zhou, Rohit Girdhar, Armand Joulin, Philipp Krähenbühl, Ishan Misra,
ECCV 2022 (arXiv 2201.02605)

Features

  • Detects any class given class names (using CLIP).

  • We train the detector on ImageNet-21K dataset with 21K classes.

  • Cross-dataset generalization to OpenImages and Objects365 without finetuning.

  • State-of-the-art results on Open-vocabulary LVIS and Open-vocabulary COCO.

Detic_C2_SwinB_896_4x Performance

Standard LVIS

NameTraining timemask mAPmask mAP_rare
Box-Supervised_C2_R50_640_4x17h31.525.6
Detic_C2_R50_640_4x22h33.229.7
Box-Supervised_C2_SwinB_896_4x43h40.735.9
Detic_C2_SwinB_896_4x47h41.741.7

Note

  • All Detic models use the overlap classes between ImageNet-21K and LVIS as image-labeled data;

  • The models with C2 are trained using our improved LVIS baseline in the paper, including CenterNet2 detector, Federated loss, large-scale jittering, etc.

  • ID
  • Name
    general-image-detector-detic_C2_SwinB_896_lvis
  • Model Type ID
    Visual Detector
  • Description
    --
  • Last Updated
    Aug 29, 2022
  • Privacy
    PUBLIC
  • License
  • Share
    • Badge
      general-image-detector-detic_C2_SwinB_896_lvis