备份一个文本处理python代码

最新推荐文章于 2024-04-23 14:28:34 发布

pangdawa

最新推荐文章于 2024-04-23 14:28:34 发布

阅读量306

点赞数

分类专栏：个人 Python 文章标签： python 开发语言

本文链接：https://blog.csdn.net/haithink/article/details/127100385

版权

个人同时被 2 个专栏收录

12 篇文章 0 订阅

订阅专栏

Python

6 篇文章 0 订阅

订阅专栏

昨天一个朋友提了个需求，处理一个excel文档，固定第一列，第二列是第三列编号，使得同一行第三列和第一列内容相同，且对应调整第2列，还花了一个小时左右，碰到些奇葩问题，比如python重定向报错，windows上不知道怎么解决，懒得搜了，直接写文件。还有就是长数字被excel当成数字，怎么改格式也不行。查了下，输出的时候加了个“\t”就好了。

# -*- coding: utf-8 -*-
"""
Created on Wed Sep 28 17:37:22 2022

@author: Administrator
"""

#思路，先要记录第三列，每列作为一个key，value是一个list，即所有编号，位于第二列
# 输出： 读取第一列每一行，检查是否有对应key的存在，如果有，读取对应list，输出其中一个
# 并且从list删除， list为空，那么 key也删除
# 如果 没有key存在， 直接输出两个逗号


fname = './SJ120220928.csv'
#fname = './test.csv'

allDict = {}

allKey = []

with open(fname,'r+',encoding='utf-8') as f:
    for line in f.readlines(): 
        #print(line[:-1].split(','))
        words = line[:-1].split(',')
        #print('len:', len(words))
        #print(words)
        
        allKey.append(words[0])
        if len(words) > 2 and len(words[1]) > 0:
            allDict.setdefault(words[2],[])
            allDict[words[2]].append(words[1])

        
#print(len(allDict))        

DictIdx = {}

i = 0
with open('./res.csv', 'w+', encoding='utf-8') as f:
    for word in allKey:
        #print('i ', i)
        if allDict.get(word) != None:
            DictIdx.setdefault(word, 0)
            #print('word ', word)
            cnt = len(allDict[word])
            #print("size ", len(allDict[word]))
            if DictIdx[word] < cnt:
                #print('idx ', DictIdx[word])
                #print('now',allDict[word][DictIdx[word]])
                str1 = word+',' + allDict[word][DictIdx[word]] + '\t,' + word
                f.write(str1)
                print(str1)
                DictIdx[word] = DictIdx[word] + 1
            else:
                str1 = word + ",,"
                f.write(str1)
                print(str1)
        else:
            str1 = word +  ",,"
            f.write(str1)
            print(str1)
        i = i+1
        f.write("\n")