AI model for detecting the location of items including themes, people, moods, etc. in images and video.
Edge optimized AI model for identifying and detecting the location of people in images and video using YoloV5s
Image recognition model for identifying different concepts in images and video including objects, themes, moods, and more.
Text translation model from a romance language to English using sentence piece-based segmentation
Image moderation model for recognizing inappropriate content containing gore, drugs, explicit, suggestive or safe content within images and video.
AI models for locating clothing and fashion-related concepts such as jewelry, hats, etc. in images and videos.
AI model for anlyzing english text and transforming it from human-readable text to a computer-readable vector.
AI model for detecting and recognizing multi-language text in images with OCR.
AI visual recognition model for returning 1024-dimensional numerical vectors that represent the items in images and video.
AI model for detecting the location of human faces in images and video.
AI model for detecting whether an image or video has a celebrity face in it using face detection.
AI model for recognizing celebrity faces in images or video.
A general image workflow that combines detection, classification, and embedding to identify general concepts including objects, themes, moods, etc.
A workflow that combines detection, recognition, and embedding to generate face landmarks and enable visual search using detected faces's embeddings.
A workflow that combines detection, classification, and embedding functions to visually classify food items and enable visual search using embeddings.
A general image detection workflow that detects a variety of common objects, and enable visual search using general embeddings on detected regions.
A single-model workflow of text embedding model for general english text.
An image moderation workflow that combines detection, classification, and embedding to classify harmful content and enable visual search using embeddings.
A workflow that combines detection, classification, and embedding to classify travel-related properties and items, and enable visual search using embeddings.
A multi-model demographics workflow that detects, crops and recognize demographic characteristics of those faces.
A scene-text based ocr workflow that detects where the text is located in an image and returns the text including document and in-the-wild images.
A single-model text moderation workflow that combines classification and embedding functions to classify harmful text content.