python3 open csv encoding error

python3.4 & Django1.9

 

with open(r'C:\test\test.csv', newline='', encoding="utf-8") as f:

    data_reader = csv.reader(f)

    for row in data_reader:

        print(row)

 

test.csv 是encoding UTF-8 without BOM类型(Notepad++查看)

Error info:'utf-8' codec can't decode byte 0xa0 in position 1396: invalid start byte

 

修改如下:

with open(r'C:\test\test.csv', newline='', encoding="utf-8", errors="ignore") as f:

 

参考文档:

Python open CSV file with supposedly mixed encodings

open()

open(filemode='r'buffering=-1encoding=Noneerrors=Nonenewline=Noneclosefd=Trueopener=None)

 

errors is an optional string that specifies how encoding and decoding errors are to be handled–this cannot be used in binary mode. A variety of standard error handlers are available, though any error handling name that has been registered with codecs.register_error() is also valid. The standard names are:

  • 'strict' to raise a ValueError exception if there is an encoding error. The default value of None has the same effect.
  • 'ignore' ignores errors. Note that ignoring encoding errors can lead to data loss.
  • 'replace' causes a replacement marker (such as '?') to be inserted where there is malformed data.
  • 'surrogateescape' will represent any incorrect bytes as code points in the Unicode Private Use Area ranging from U+DC80 to U+DCFF. These private code points will then be turned back into the same bytes when thesurrogateescape error handler is used when writing data. This is useful for processing files in an unknown encoding.
  • 'xmlcharrefreplace' is only supported when writing to a file. Characters not supported by the encoding are replaced with the appropriate XML character reference &#nnn;.
  • 'backslashreplace' (also only supported when writing) replaces unsupported characters with Python’s backslashed escape sequences.

 

 

转载于:https://www.cnblogs.com/cyn413/p/7660241.html

要在Python3中读取CSV文件,你可以使用csv模块。以下是一个示例代码: ```python import csv filename = 'lagou.csv' data = \[\] try: with open(filename, encoding='utf-8') as f: reader = csv.reader(f) header = next(reader) data = \[row for row in reader\] except csv.Error as e: print("Error reading CSV file at line %s:%s" % (reader.line_num, e)) sys.exit(-1) if header: print(header) print("===============") for datarow in data: print(datarow) ``` 这段代码首先打开CSV文件,然后使用csv.reader()函数创建一个reader对象。接下来,我们可以使用for循环遍历reader对象,逐行读取CSV文件中的数据并打印出来。在这个例子中,我们还打印了CSV文件的标题行(header)和数据行(data)。请注意,你需要将文件名替换为你实际使用的CSV文件的名称。\[1\] #### 引用[.reference_title] - *1* [Python3读取CSV数据](https://blog.csdn.net/weixin_45890771/article/details/120382819)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v91^insert_down28v1,239^v3^insert_chatgpt"}} ] [.reference_item] - *2* [python3csv的读写](https://blog.csdn.net/a13568hki/article/details/121977445)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v91^insert_down28v1,239^v3^insert_chatgpt"}} ] [.reference_item] - *3* [python3读写csv文件](https://blog.csdn.net/weixin_30883311/article/details/99869412)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v91^insert_down28v1,239^v3^insert_chatgpt"}} ] [.reference_item] [ .reference_list ]
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值