1 Star 0 Fork 0

gtyelv / CV1

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
_aux_get_sample_list.py 1.33 KB
一键复制 编辑 原始数据 按行查看 历史
gtyelv 提交于 2021-09-04 18:24 . first commit
#----------------------------------------#
# 生成猫狗数据集的样本列表文件
#----------------------------------------#
import os
import random
'''
1.修改样本集目录 images_dir
2.运行本文件
'''
#------------ Control Pannel ------------#
random.seed(0)
images_dir = 'E:/【AI】/Datasets/CV_ds/dogs_vs_cats/train/'
train_list_filepath = './model_data/train.txt'
valid_list_filepath = './model_data/valid.txt'
test_list_filepath = './model_data/test.txt'
train_percent = 8/10
valid_percent = 1/10
test_percent = 1/10
#----------------------------------------#
# 1.读取样本文件名
samples = os.listdir(images_dir)
random.shuffle(samples)
# 2.划分数据集
num_samples = len(samples)
s1 = int(train_percent * num_samples)
s2 = int(valid_percent * num_samples)
train_samples = samples[ :s1 ]
valid_samples = samples[s1 :s1+s2]
test_samples = samples[s1+s2: ]
# 3.生成样本列表文件
with open(train_list_filepath, 'w') as f:
f.writelines([c+'\n' for c in train_samples])
with open(valid_list_filepath, 'w') as f:
f.writelines([c+'\n' for c in valid_samples])
with open(test_list_filepath, 'w') as f:
f.writelines([c+'\n' for c in test_samples])
# 4.输出数据集信息
print('Size of total, train, valid, test: {}, {}, {}, {}'.format(num_samples, s1, s2, num_samples-s2-s1))
Python
1
https://gitee.com/gtyelv/cv1.git
git@gitee.com:gtyelv/cv1.git
gtyelv
cv1
CV1
master

搜索帮助