加入 Gitee

与超过 1200万开发者一起发现、参与优秀开源项目，私有仓库也完全免费：）

免费加入

克隆/下载

textocr.md 906 Bytes

一键复制编辑原始数据按行查看历史

提交于 2023-07-06 17:50 . Fix docs links (#477)

TextOCR 数据集

数据集下载

TextOCR数据集官网 | 下载链接

请从上述网站下载数据并解压缩文件。解压文件后，数据结构应该是这样的：

TextOCR
  |--- train_val_images
  |    |--- <image_name>.jpg
  |    |--- <image_name>.jpg
  |    |--- ...
  |--- TextOCR_0.1_train.json
  |--- TextOCR_0.1_val.json

数据准备

检测任务

要准备用于文本检测的数据，您可以运行以下命令：

python tools/dataset_converters/convert.py \
    --dataset_name textocr --task det \
    --image_dir path/to/TextOCR/train_val_images/ \
    --label_dir path/to/TextOCR/TextOCR_0.1_train.json \
    --output_path path/to/TextOCR/det_gt.txt

运行后，在文件夹TextOCR/下会生成注释文件det_gt.txt。

返回dataset converters

https://gitee.com/mindspore-lab/mindocr.git

git@gitee.com:mindspore-lab/mindocr.git

mindspore-lab

mindocr

main

MindSpore Lab / mindocr

TextOCR 数据集

数据集下载

数据准备

检测任务

简介

发行版

贡献者

近期动态

MindSpore Lab / mindocr .gitee-modal { width: 500px !important; }

TextOCR 数据集

数据集下载

数据准备

检测任务

简介

发行版

开源评估指数源自 OSS-Compass 评估体系，评估体系围绕以下三个维度对项目展开评估：

贡献者

近期动态

搜索帮助

MindSpore Lab / mindocr