【Python】根据文件名中的数字对列表进行排序

最新推荐文章于 2024-07-10 19:27:42 发布

果壳小旋子

最新推荐文章于 2024-07-10 19:27:42 发布

阅读量189

点赞数 3

文章标签： python 计算机视觉深度学习人工智能机器学习图像处理

本文链接：https://blog.csdn.net/m0_47867419/article/details/139529910

版权

我想从一个路径读取数据集：

from os import listdir
import re
dir = listdir('F:/dataset/TNO_test/ir')
print(dir)

结果是：

['1.bmp', '10.bmp', '11.bmp', '12.bmp', '13.bmp', '14.bmp', '15.bmp', '16.bmp', '17.bmp', '18.bmp', '19.bmp', '2.bmp', '20.bmp', '3.bmp', '4.bmp', '5.bmp', '6.bmp', '7.bmp', '8.bmp', '9.bmp']

我希望图片能够以“1.bmp、2.bmp、3.bmp…”的顺序排列，所以调用了列表的sort方法：

dir.sort()
print(dir)

结果却还是：

['1.bmp', '10.bmp', '11.bmp', '12.bmp', '13.bmp', '14.bmp', '15.bmp', '16.bmp', '17.bmp', '18.bmp', '19.bmp', '2.bmp', '20.bmp', '3.bmp', '4.bmp', '5.bmp', '6.bmp', '7.bmp', '8.bmp', '9.bmp']

原因在于Python的字符串排序是逐字符比较ASCII值，比如为什么‘10.bmp’会排在‘2.bmp’前面？因为从第一个字符来看，‘1’是小于‘2’的。解决的办法就是将数字提取出来，按照数值大小排序：

dir.sort(key=lambda x: int(re.findall(r'\d+', x)[0]))
print(dir)

找到文件名中所有连续的数字串，并返回第一个数字片段，转为整数类型

['1.bmp', '2.bmp', '3.bmp', '4.bmp', '5.bmp', '6.bmp', '7.bmp', '8.bmp', '9.bmp', '10.bmp', '11.bmp', '12.bmp', '13.bmp', '14.bmp', '15.bmp', '16.bmp', '17.bmp', '18.bmp', '19.bmp', '20.bmp']