Python图像读写方法对比

作者：农大军乐团_697 | 2021-10-20 23:12

这篇文章主要介绍了Python图像读写方法对比的相关资料，帮助大家更好的理解和使用python，感兴趣的朋友可以了解下

1 实验标准

　　因为训练使用的框架是Pytorch，因此读取的实验标准如下：

　　1、读取分辨率都为1920x1080的5张图片（png格式一张，jpg格式四张）并保存到数组。

　　2、将读取的数组转换为维度顺序为CxHxW的Pytorch张量，并保存到显存中（我使用GPU训练），其中三个通道的顺序为RGB。

　　3、记录各个方法在以上操作中所耗费的时间。因为png格式的图片大小差不多是质量有微小差异的jpg格式的10倍，所以数据集通常不会用png来保存，就不比较这两种格式的读取时间差异了。

　　写入的实验标准如下：

　　1、将5张1920x1080的5张图像对应的Pytorch张量转换为对应方法可使用的数据类型数组。

　　2、以jpg格式保存五张图片。

　　3、记录各个方法保存图片所耗费的时间。

2 实验情况

2.1 cv2

　　因为有GPU，所以cv2读取图片有两种方式：

　　1、先把图片都读取为一个numpy数组，再转换成保存在GPU中的pytorch张量。

　　2、初始化一个保存在GPU中的pytorch张量，然后将每张图直接复制进这个张量中。

　　第一种方式实验代码如下：

import os, torch
import cv2 as cv 
import numpy as np 
from time import time 
 
read_path = 'D:test'
write_path = 'D:test\\write\\'
 
# cv2读取 1
start_t = time()
imgs = np.zeros([5, 1080, 1920, 3])
for img, i in zip(os.listdir(read_path), range(5)): 
 img = cv.imread(filename=os.path.join(read_path, img))
 imgs[i] = img 
imgs = torch.tensor(imgs).to('cuda')[...,[2,1,0]].permute([0,3,1,2])/255 
print('cv2 读取时间1：', time() - start_t) 
# cv2保存
start_t = time()
imgs = (imgs.permute([0,2,3,1])[...,[2,1,0]]*255).cpu().numpy()
for i in range(imgs.shape[0]): 
 cv.imwrite(write_path + str(i) + '.jpg', imgs[i])
print('cv2 保存时间：', time() - start_t)

　实验结果：

cv2 读取时间1： 0.39693760871887207
cv2 保存时间： 0.3560612201690674

第二种方式实验代码如下：

import os, torch
import cv2 as cv 
import numpy as np 
from time import time 
 
read_path = 'D:test'
write_path = 'D:test\\write\\'
 
 
# cv2读取 2
start_t = time()
imgs = torch.zeros([5, 1080, 1920, 3], device='cuda')
for img, i in zip(os.listdir(read_path), range(5)): 
 img = torch.tensor(cv.imread(filename=os.path.join(read_path, img)), device='cuda')
 imgs[i] = img  
imgs = imgs[...,[2,1,0]].permute([0,3,1,2])/255 
print('cv2 读取时间2：', time() - start_t) 
# cv2保存
start_t = time()
imgs = (imgs.permute([0,2,3,1])[...,[2,1,0]]*255).cpu().numpy()
for i in range(imgs.shape[0]): 
 cv.imwrite(write_path + str(i) + '.jpg', imgs[i])
print('cv2 保存时间：', time() - start_t)

　　实验结果：

cv2 读取时间2： 0.23636841773986816
cv2 保存时间： 0.3066873550415039

2.2 matplotlib

　　同样两种读取方式，第一种代码如下：

import os, torch 
import numpy as np
import matplotlib.pyplot as plt 
from time import time 
 
read_path = 'D:test'
write_path = 'D:test\\write\\'
 
# matplotlib 读取 1
start_t = time()
imgs = np.zeros([5, 1080, 1920, 3])
for img, i in zip(os.listdir(read_path), range(5)): 
 img = plt.imread(os.path.join(read_path, img)) 
 imgs[i] = img  
imgs = torch.tensor(imgs).to('cuda').permute([0,3,1,2])/255 
print('matplotlib 读取时间1：', time() - start_t) 
# matplotlib 保存
start_t = time()
imgs = (imgs.permute([0,2,3,1])).cpu().numpy()
for i in range(imgs.shape[0]): 
 plt.imsave(write_path + str(i) + '.jpg', imgs[i])
print('matplotlib 保存时间：', time() - start_t)

　　实验结果：

matplotlib 读取时间1： 0.45380306243896484
matplotlib 保存时间： 0.768944263458252

　　第二种方式实验代码：

import os, torch 
import numpy as np
import matplotlib.pyplot as plt 
from time import time 
 
read_path = 'D:test'
write_path = 'D:test\\write\\'
 
# matplotlib 读取 2
start_t = time()
imgs = torch.zeros([5, 1080, 1920, 3], device='cuda')
for img, i in zip(os.listdir(read_path), range(5)): 
 img = torch.tensor(plt.imread(os.path.join(read_path, img)), device='cuda')
 imgs[i] = img  
imgs = imgs.permute([0,3,1,2])/255 
print('matplotlib 读取时间2：', time() - start_t) 
# matplotlib 保存
start_t = time()
imgs = (imgs.permute([0,2,3,1])).cpu().numpy()
for i in range(imgs.shape[0]): 
 plt.imsave(write_path + str(i) + '.jpg', imgs[i])
print('matplotlib 保存时间：', time() - start_t)

　　实验结果：

matplotlib 读取时间2： 0.2044532299041748
matplotlib 保存时间： 0.4737534523010254

　　需要注意的是，matplotlib读取png格式图片获取的数组的数值是在[0,1][0,1]范围内的浮点数，而jpg格式图片却是在[0,255][0,255]范围内的整数。所以如果数据集内图片格式不一致，要注意先转换为一致再读取，否则数据集的预处理就麻烦了。

2.3 PIL

　　PIL的读取与写入并不能直接使用pytorch张量或numpy数组，要先转换为Image类型，所以很麻烦，时间复杂度上肯定也是占下风的，就不实验了。

2.4 torchvision

　　torchvision提供了直接从pytorch张量保存图片的功能，和上面读取最快的matplotlib的方法结合，代码如下：

import os, torch 
import matplotlib.pyplot as plt 
from time import time 
from torchvision import utils 

read_path = 'D:test'
write_path = 'D:test\\write\\'
 
# matplotlib 读取 2
start_t = time()
imgs = torch.zeros([5, 1080, 1920, 3], device='cuda')
for img, i in zip(os.listdir(read_path), range(5)): 
 img = torch.tensor(plt.imread(os.path.join(read_path, img)), device='cuda')
 imgs[i] = img  
imgs = imgs.permute([0,3,1,2])/255 
print('matplotlib 读取时间2：', time() - start_t) 
# torchvision 保存
start_t = time() 
for i in range(imgs.shape[0]):  
 utils.save_image(imgs[i], write_path + str(i) + '.jpg')
print('torchvision 保存时间：', time() - start_t)

　　实验结果：

matplotlib 读取时间2： 0.15358829498291016
torchvision 保存时间： 0.14760661125183105

　　可以看出这两个是最快的读写方法。另外，要让图片的读写尽量不影响训练进程，我们还可以让这两个过程与训练并行。另外，utils.save_image可以将多张图片拼接成一张来保存，具体使用方法如下：

utils.save_image(tensor = imgs,   # 要保存的多张图片张量 shape = [n, C, H, W]
         fp = 'test.jpg',  # 保存路径
         nrow = 5,     # 多图拼接时，每行所占的图片数
         padding = 1,    # 多图拼接时，每张图之间的间距
         normalize = True, # 是否进行规范化，通常输出图像用tanh，所以要用规范化 
         range = (-1,1))  # 规范化的范围

以上就是Python图像读写方法对比的详细内容，更多关于python 图像读写的资料请关注其它相关文章！

推荐阅读

程序员
在MongoDB中查找具有字符串ID数组的文档

如何解决《在MongoDB中查找具有字符串ID数组的文档》经验，为你挑选了1个好方法。 ... [详细]
程序员
使用Jackson序列化UUID集

如何解决《使用Jackson序列化UUID集》经验，为你挑选了1个好方法。 ... [详细]
程序员
错误:[$ compile:nonassign]与指令'uibTab'一起使用的表达式是不可赋值的

如何解决《错误:[$compile:nonassign]与指令'uibTab'一起使用的表达式是不可赋值的》经验，为你挑选了1个好方法。 ... [详细]
程序员
GCC没有工作,但G ++确实如此

如何解决《GCC没有工作,但G++确实如此》经验，为你挑选了0个好方法。 ... [详细]
程序员
在TensorFlow中使用矩阵乘法函数

如何解决《在TensorFlow中使用矩阵乘法函数》经验，为你挑选了1个好方法。 ... [详细]
程序员
Pycharm Community Edition:"无法显示帧变量"

如何解决《PycharmCommunityEdition:"无法显示帧变量"》经验，为你挑选了0个好方法。 ... [详细]
程序员
搜索2d阵列中最大的空间

如何解决《搜索2d阵列中最大的空间》经验，为你挑选了1个好方法。 ... [详细]
程序员
css3列和溢出隐藏

如何解决《css3列和溢出隐藏》经验，为你挑选了0个好方法。 ... [详细]
程序员
水平UICollectionView单行布局

如何解决《水平UICollectionView单行布局》经验，为你挑选了0个好方法。 ... [详细]
程序员
SQL:选择具有相同单词的字符串

如何解决《SQL:选择具有相同单词的字符串》经验，为你挑选了1个好方法。 ... [详细]
程序员
使用Action Listener获取JButton的文本

如何解决《使用ActionListener获取JButton的文本》经验，为你挑选了1个好方法。 ... [详细]
程序员
将变量传递给嵌套的Handlebars模板/部分

如何解决《将变量传递给嵌套的Handlebars模板/部分》经验，为你挑选了1个好方法。 ... [详细]
程序员
SBT插件在非托管jar文件中

如何解决《SBT插件在非托管jar文件中》经验，为你挑选了1个好方法。 ... [详细]
程序员
设置onSeekBarChangeListener会导致null对象异常

如何解决《设置onSeekBarChangeListener会导致null对象异常》经验，为你挑选了1个好方法。 ... [详细]
程序员
如何在Lisp中格式化REPL输出的数字精度？

如何解决《如何在Lisp中格式化REPL输出的数字精度？》经验，为你挑选了1个好方法。 ... [详细]
程序员
在python中读取/写出字典到csv文件

如何解决《在python中读取/写出字典到csv文件》经验，为你挑选了2个好方法。 ... [详细]
程序员
Ngnix - FastCGI在stderr中发送:"PHP消息:PHP注意:未定义的变量

如何解决《Ngnix-FastCGI在stderr中发送:"PHP消息:PHP注意:未定义的变量》经验，为你挑选了1个好方法。 ... [详细]
程序员
用枚举编写JSON键

如何解决《用枚举编写JSON键》经验，为你挑选了1个好方法。 ... [详细]
程序员
有没有办法如何在"prestart"npm脚本中自动运行"nvm use"？

如何解决《有没有办法如何在"prestart"npm脚本中自动运行"nvmuse"？》经验，为你挑选了1个好方法。 ... [详细]
程序员
格式化perl正则表达式捕获组

如何解决《格式化perl正则表达式捕获组》经验，为你挑选了1个好方法。 ... [详细]

农大军乐团_697

这个屌丝很懒，什么也没留下！

关注作者

Tags | 热门标签

RankList | 热门文章