This model is an instance segmentation network for one class - person. It is a PointRend based architecture with EfficientNet-B1 backbone, light-weight FPN, RPN, detection and segmentation heads.
Metric | Value |
---|---|
COCO val2017 box AP (person) | 35.7% |
COCO val2017 mask AP (person) | 30.9% |
Max objects to detect | 10 |
GFlops | 4.8492 |
MParams | 7.2996 |
Source framework | PyTorch* |
Average Precision (AP) is defined and measured according to standard COCO evaluation procedure.
Image, name: image
, shape: 1, 3, 320, 544
in the format 1, C, H, W
, where:
C
- number of channelsH
- image heightW
- image width
The expected channel order is BGR
Model has outputs with dynamic shapes.
- Name:
labels
, shape:-1
- Contiguous integer class ID for every detected object. - Name:
boxes
, shape:-1, 5
- Bounding boxes around every detected objects in (top_left_x, top_left_y, bottom_right_x, bottom_right_y) format and its confidence score in range [0, 1]. - Name:
masks
, shape:-1, 224, 224
- Segmentation heatmaps for every output bounding box.
The OpenVINO Training Extensions provide a training pipeline, allowing to fine-tune the model on custom dataset.
The model can be used in the following demos provided by the Open Model Zoo to show its capabilities:
[*] Other names and brands may be claimed as the property of others.