COCO数据集使用——COCOAPI配置--688IT编程网

COCO数据集使⽤——COCOAPI配置

1.下数据集，包括训练集，验证集，测试集，annotation等。

2. 下载新版API，地址。

3. 进⼊PythonAPI/路径⾥，进⾏配置，下⾯的配置过程分为两种情况。⼀是ubuntu系统，⼀是windows系统。

【⽤ubuntu配置】 ——⽐较推荐，坑⽐较少！

激活tensorflow环境，进⼊~/cocostuffapi/PythonAPI/路径下，输⼊ python install，如果不报错即配置成功，在python环境中import pycocotools试试看，如果不报错，说明安装成功。

报错可能有：cython版本过低，pycocotools要求cython版本⼤于0.27.3

解决⽅案：pip install Cython（注意C是⼤写），安装完后输⼊cython测试是否安装成功。

2018.10.10更新：

看到了⼏个关于coco数据集的处理教程，分享下链接～

1.pycocotools中包含了⼀个coco.py⽂件，是⼀个对coco数据json⽂件的解析⼯具（这也是我们前⾯要千⾟万苦安装它的原因），在程序开头这样调⽤： import COCO

coco.py中包含以下⼏个接⼝（）：

# decodeMask - Decode binary mask M encoded via run-length encoding.

# encodeMask - Encode binary mask M using run-length encoding.

迅搜百科

# getAnnIds - Get ann ids that satisfy given filter conditions.

# getCatIds - Get cat ids that satisfy given filter conditions.

# getImgIds - Get img ids that satisfy given filter conditions.

# loadAnns - Load anns with the specified ids.

# loadCats - Load cats with the specified ids.

# loadImgs - Load imgs with the specified ids.

# annToMask - Convert segmentation in an annotation to binary mask.

# showAnns - Display the specified annotations.

# loadRes - Load algorithm results and create API for accessing them.

# download - Download COCO images server.

# Throughout the API "ann"=annotation,"cat"=category, and "img"=image.

2018.10.11更新：

coco数据集的使⽤：

图⽚名称file_name和图⽚id的对应关系：id号就是图⽚名称从⾮零位开始的部分，所以想要通过图⽚名称读取id，再读取信息可以这样做：

def filename_imgid(filename_list):

imgIds =[]

for i in range(len(filename_list)):

for j in range(12):

if(filename_list[i][j]!='0'):

imgIds.append(int(filename_list[i][j:12])) #将字符串转换成数字存储

break

return imgIds

【注意】：

break只跳出最内层循环

filename_list[i][j]是字符’0’，不是数字0

vs2017和vs2019获取了imgIds之后，需要通过loadImgs操作来提取信息，我们先来查看⼀下coco数据存储格式：

coco=COCO(annFile)

imgs =[(img_id, coco.imgs[img_id])for img_id in coco.imgs] #获取全部图⽚信息

print(imgs[0]) #输出的是第⼀个图⽚的信息

print(type(imgs[0])) #<class'tuple'>说明是以tuple存储的。

coco数据集的存储格式是这样的（也就是上⾯的print(imgs[0])输出）：

(532481, {'width': 640, 'file_name': '000000532481.jpg', 'coco_url': '/val2017/000000532481.jpg', 'height': 426, 'id': 532481, 'li cense': 3, 'date_captured': '2013-11-20 16:28:24', 'flickr_url': 'farm7.staticflickr/6048/5915494136_da3cfa7c5a_z.jpg'})

也就是说，必须通过img的id才能获取后⾯这个tuple的信息。现在已知了imgIds，要load的时候，需要使⽤命令：

# img = coco.loadImgs(imgIds)[0]

img = coco.loadImgs(imgIds) #因为我是⾃⼰构建的imgIds，本来就已经是⼀个数字构成的list了，所以就不需要[0]了，关于[0]的说明见⽂章最下⾯。

主程序部分：

c语言定义一个结构dataDir='/mask/data/coco'

dataType='val2017'

annFile='{}/annotations/instances_{}.json'.format(dataDir,dataType)

coco=COCO(annFile)

filename_list =_get_img_filename(dataDir,dataType)

imgIds =filename_imgid(filename_list)

print(imgIds)

img = coco.loadImgs(imgIds)

print(type(img[1]['file_name']))

for i in range(len(imgIds)):

I =io.imread('%s/%s/%s'%(dataDir, dataType, img[i]['file_name']))

plt.imshow(I)

annIds = AnnIds(imgIds=img[i]['id'])

anns = coco.loadAnns(annIds)

coco.showAnns(anns)

plt.show()

对[0]的说明：

imgIds =[1296,1490,1000,1353,872,1425,885,1503] #⾃⼰的图⽚id

img = coco.loadImgs(imgIds[np.random.randint(0,len(imgIds))])

print(img)

img2 = coco.loadImgs(imgIds[np.random.randint(0,len(imgIds))])[0]

print(img2)

#img:

[{'license':4,'file_name':'000000001000.jpg','coco_url':'/val2017/000000001000.jpg','height':480,'width':640,'id':1000,'d ate_captured':'2013-11-21 05:13:59','flickr_url':'farm5.staticflickr/4115/4906536419_6113bd7de4_z.jpg'}]

ostrich读音浊化吗#img2:

{'license':2,'file_name':'000000001503.jpg','coco_url':'/val2017/000000001503.jpg','height':240,'width':320,'id':1503,'d

ate_captured':'2013-11-22 17:22:02','flickr_url':'farm1.staticflickr/4/4589204_0d42f46fe6_z.jpg'}

由此可以看出，img读到的是⼀个list，要对img[0]才是dict，也就是img2 = img[0]~~

scaler是什么意思下⼀部分是⽤提取的图⽚来构建tfrecords⽤来训练，之后在更。

更新：

在mask rcnn的download_and_convert_coco.py基础上，加⼊了我⾃⼰构造的两个函数，通过图⽚名字来获取图⽚id，然后load，在函数_add_to_tfrecord()中加了以下⼏句话：

filename_list =_get_img_filename(image_dir, split_name)

imgIds =filename_imgid(filename_list)

imgs =[(img_id, coco.imgs[img_id])for img_id in coco.imgs if img_id in imgIds] #最重要的是这句

这两个⾃⼰定义的函数如下：

def _get_img_filename(image_dir,split_name):python解析json文件

filename_list = os.listdir(os.path.join(image_dir, split_name))

return filename_list

def filename_imgid(filename_list):

imgIds =[]

for i in range(len(filename_list)):

for j in range(12):

if(filename_list[i][j]!='0'):

imgIds.append(int(filename_list[i][j:12])) # 将字符串转换成数字存储

break

return imgIds

688IT编程网

COCO数据集使用——COCOAPI配置

发表评论

推荐文章

随机森林算法介绍及R语言实现

基于随机森林优化的神经网络算法在冬小麦产量预测中的应用研究_百度文 ...

基于正则化贪心森林算法的情感分析方法研究

随机森林算法和grandientboosting算法

基于随机森林的图像分类算法研究

热门文章

随机森林算法的改进方法

基于随机森林算法的风险预警模型研究

Python中的随机森林算法详解

随机森林发展历史

如何使用随机森林进行时间序列数据模式识别(八)

随机森林回归模型原理

如何使用随机森林进行时间序列数据模式识别(六)

如何使用随机森林进行时间序列数据预测(四)

如何使用随机森林进行异常检测(六)

随机森林算法和grandientboosting算法 -回复

随机森林方法总结全面

随机森林算法原理和步骤

随机森林的原理

随机森林重要性

随机森林算法

机器学习中随机森林的原理

随机森林算法原理

使用计算机视觉技术进行动物识别的技巧

基于crf命名实体识别实验总结

transformer预测模型训练方法

最新文章

随机森林算法介绍及R语言实现

基于随机森林优化的神经网络算法在冬小麦产量预测中的应用研究_百度文 ...

基于正则化贪心森林算法的情感分析方法研究

随机森林算法和grandientboosting算法

基于随机森林的图像分类算法研究

随机森林结合直接正交信号校正的模型传递方法

标签列表

688IT编程网

COCO数据集使用——COCOAPI配置

发表评论

推荐文章

随机森林算法介绍及R语言实现

基于随机森林优化的神经网络算法在冬小麦产量预测中的应用研究_百度文 ...

基于正则化贪心森林算法的情感分析方法研究

随机森林算法和grandientboosting算法

基于随机森林的图像分类算法研究

热门文章

随机森林算法的改进方法

基于随机森林算法的风险预警模型研究

Python中的随机森林算法详解

随机森林发展历史

如何使用随机森林进行时间序列数据模式识别(八)

随机森林回归模型原理

如何使用随机森林进行时间序列数据模式识别(六)

如何使用随机森林进行时间序列数据预测(四)

如何使用随机森林进行异常检测(六)

随机森林算法和grandientboosting算法 -回复

随机森林方法总结全面

随机森林算法原理和步骤

随机森林的原理

随机森林 重要性

随机森林算法

机器学习中随机森林的原理

随机森林算法原理

使用计算机视觉技术进行动物识别的技巧

基于crf命名实体识别实验总结

transformer预测模型训练方法

最新文章

随机森林算法介绍及R语言实现

基于随机森林优化的神经网络算法在冬小麦产量预测中的应用研究_百度文 ...

基于正则化贪心森林算法的情感分析方法研究

随机森林算法和grandientboosting算法

基于随机森林的图像分类算法研究

随机森林结合直接正交信号校正的模型传递方法

标签列表

随机森林重要性