- Community
- Model
- general-image-detector-detic_C2_SwinB_896_lvis
Notes
Detecting Twenty-thousand Classes using Image-level Supervision
Detic: A Detector with image classes that can use image-level labels to easily train detectors.
Detecting Twenty-thousand Classes using Image-level Supervision,
Xingyi Zhou, Rohit Girdhar, Armand Joulin, Philipp Krähenbühl, Ishan Misra,
ECCV 2022 (arXiv 2201.02605)
Features
Detects any class given class names (using CLIP).
We train the detector on ImageNet-21K dataset with 21K classes.
Cross-dataset generalization to OpenImages and Objects365 without finetuning.
State-of-the-art results on Open-vocabulary LVIS and Open-vocabulary COCO.
Detic_C2_SwinB_896_4x Performance
Standard LVIS
Name | Training time | mask mAP | mask mAP_rare |
---|---|---|---|
Box-Supervised_C2_R50_640_4x | 17h | 31.5 | 25.6 |
Detic_C2_R50_640_4x | 22h | 33.2 | 29.7 |
Box-Supervised_C2_SwinB_896_4x | 43h | 40.7 | 35.9 |
Detic_C2_SwinB_896_4x | 47h | 41.7 | 41.7 |
Note
All Detic models use the overlap classes between ImageNet-21K and LVIS as image-labeled data;
The models with C2 are trained using our improved LVIS baseline in the paper, including CenterNet2 detector, Federated loss, large-scale jittering, etc.
- ID
- Namegeneral-image-detector-detic_C2_SwinB_896_lvis
- Model Type IDVisual Detector
- Description--
- Last UpdatedAug 29, 2022
- PrivacyPUBLIC
- License
- Share
- Badge
Concept | Date |
---|