CVAT is free, online, interactive video and image annotation tool for computer vision. It is being used by our team to annotate million of objects with different properties. Many UI and UX decisions are based on feedbacks from professional data annotation team. Try it online cvat.org.
Format selection is possible after clicking on the Upload annotation and Dump annotation buttons. Datumaro dataset framework allows additional dataset transformations via its command line tool and Python library.
For more information about supported formats look at the documentation.
Annotation format | Import | Export |
---|---|---|
CVAT for images | X | X |
CVAT for a video | X | X |
Datumaro | X | |
PASCAL VOC | X | X |
Segmentation masks from PASCAL VOC | X | X |
YOLO | X | X |
MS COCO Object Detection | X | X |
TFrecord | X | X |
MOT | X | X |
LabelMe 3.0 | X | X |
ImageNet | X | X |
CamVid | X | X |
WIDER Face | X | X |
VGGFace2 | X | X |
Market-1501 | X | X |
ICDAR13/15 | X | X |
Open Images V6 | X | X |
Cityscapes | X | X |
KITTI | X | X |
LFW | X | X |
Name | Type | Framework | CPU | GPU |
---|---|---|---|---|
Deep Extreme Cut | interactor | OpenVINO | X | |
Faster RCNN | detector | OpenVINO | X | |
Mask RCNN | detector | OpenVINO | X | |
YOLO v3 | detector | OpenVINO | X | |
Object reidentification | reid | OpenVINO | X | |
Semantic segmentation for ADAS | detector | OpenVINO | X | |
Text detection v4 | detector | OpenVINO | X | |
YOLO v5 | detector | PyTorch | X | |
SiamMask | tracker | PyTorch | X | X |
f-BRS | interactor | PyTorch | X | |
HRNet | interactor | PyTorch | X | |
Inside-Outside Guidance | interactor | PyTorch | X | |
Faster RCNN | detector | TensorFlow | X | X |
Mask RCNN | detector | TensorFlow | X | X |
RetinaNet | detector | PyTorch | X | X |
Face Detection | detector | OpenVINO | X |
This is an online demo with the latest version of the annotation tool. Try it online without local installation. Only own or assigned tasks are visible to users.
Disabled features:
Limitations:
Prebuilt docker images for CVAT releases are available on Docker Hub:
The current REST API version is 2.0-alpha
. We focus on its improvement and therefore
REST API may be changed in the next release.
Code released under the MIT License.
This software uses LGPL licensed libraries from the FFmpeg project. The exact steps on how FFmpeg was configured and compiled can be found in the Dockerfile.
FFmpeg is an open source framework licensed under LGPL and GPL. See https://www.ffmpeg.org/legal.html. You are solely responsible for determining if your use of FFmpeg requires any additional licenses. Intel is not responsible for obtaining any such licenses, nor liable for any licensing fees due in connection with your use of FFmpeg.
CVAT usage related questions or unclear concepts can be posted in our Gitter chat for quick replies from contributors and other users.
However, if you have a feature request or a bug report that can reproduced, feel free to open an issue (with steps to reproduce the bug if it's a bug report) on GitHub* issues.
If you are not sure or just want to browse other users common questions, Gitter chat is the way to go.
Other ways to ask questions and get our support:
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。