Python 对于文件上的一些操作

~拾捌~

已于 2023-10-20 09:34:50 修改

阅读量48

点赞数

分类专栏： python 文章标签： python 开发语言

于 2023-06-19 09:19:21 首次发布

本文链接：https://blog.csdn.net/lsfeitianzhuzhuxia/article/details/131249824

版权

python 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

文章目录

Python 获取文件夹内所有文件名
Python 将制定文件复制到新的文件夹中
Python 文件重命名
Python 将数据保存成txt/csv格式
Python 随机挑选一定数量数据保存
Python 逐行读取txt文件内容并保存
Python 在txt文件中逐行写入列表数据
Python 切分数据集

Python 获取文件夹内所有文件名

RDT_list = os.listdir(RDT_path)

path = './VOCdevkit/VOC2007/JPEGImages/*'
xml_paths = glob.glob(path)

Python 将制定文件复制到新的文件夹中

import os
import shutil

RDT_path = 'D:\\test\\RDT_problem'
RDT_list = os.listdir(RDT_path)
RD_path = 'D:\\test\\evaluate_RD_test'
for i in RDT_list:
        found_name = os.path.join(RD_path, i)
        new_folder =r'D:\test\RD_problem'
        shutil.copy(found_name, os.path.join(new_folder,i))
        print ("copy {} -> {}".format(found_name, os.path.join(new_folder, i)))

Python 文件重命名

old_dir = RDT_path + '\\' + i
new_dir = RDT_path + '\\' + file_name + '.jpg'
os.rename(old_dir, new_dir)

Python 将数据保存成txt/csv格式

def text_save(filename, data):  # filename为写入文件的路径，data为要写入数据列表.
    file = open(filename, 'a')
    for i in range(len(data)):
        s = str(data[i]).replace('[', '').replace(']', '')  # 去除[],这两行按数据不同，可以选择
        s = s.replace("'", '').replace(',', '') + '\n'  # 去除单引号，逗号，每行末尾追加换行符
        file.write(s)
    file.close()
 print("保存文件成功")

Python 随机挑选一定数量数据保存

path = "/data/lshib/MM-DistillNet/dataset/train_all.txt"
ids = get_id_list(path)
sample_num = 20000 # 随机挑选数据的数量
text_save('plt_test_all.txt', random.sample(ids, sample_num))

Python 逐行读取txt文件内容并保存

with open("test.txt", "r", encoding='utf-8') as f:
    for line in f.readlines():
        line = line.strip('\n')  # 去掉列表中每一个元素的换行符
        name = line.split('-')[0]
        List.append(name)

Python 在txt文件中逐行写入列表数据

with open("cc.txt", "w", encoding='utf-8') as f:
    for file in List:
        f.write(file +'\n')
    f.close()

Python 切分数据集

def qieshujuji(xml_paths):
    '''切分数据集'''
    total_num = len(xml_paths)
    train_num = int(0.7 * total_num)
    val_num = total_num - train_num
    index = list(range(0, total_num))
    random.shuffle(index)

    train_txt = "train.txt"
    with open(train_txt, 'w', encoding="utf-8-sig") as f1:
        f1.write("")
    with open(train_txt, 'a', encoding="utf-8-sig") as f1:
        f1.write("text_a\tlabel")
        f1.write("\n")
        for i in range(0, train_num):
            f1.write(xml_paths[index[i]])
            f1.write("\n")

    test_txt = "test.txt"
    # 先清空文件内容
    with open(test_txt, 'w', encoding="utf-8-sig") as f3:
        f3.write("")
    with open(test_txt, 'a', encoding="utf-8-sig") as f3:

        f3.write("text_a\tlabel")
        f3.write("\n")
        for i in range(train_num, total_num):
            f3.write(xml_paths[index[i]])
            f3.write("\n")