源代码:
big_shuzu = []
path = r"C:\Users\ME\Desktop\Python project\pachong\scrapy\baichuan4\证券时报网"。encode('gbk')
for root,dirs,files in os.walk(path):
print(root)
print(dirs)
print files
for dir in dirs:
print(os.path.join(root, dir) + '\n')
for fp in files:
print(os.path.join(root, fp))
if fp == None:
print ('None')
else:
print(os.path.join(root, fp))
遇到的问题是'ascii' codec can't decode byte 0xe8 in position 60: ordinal not in range(128)
首先得分清楚一些常见的编码:
python采用的是ASCII 和Unicode编码,所以我们在处理含有中文文件的时候,需要转变成utf-8
解决方案:
在开头添加一段