字符编码在python中的处理_字符编码在python中用'u2019'替换'

最新推荐文章于 2023-05-10 17:20:22 发布

weixin_39610785

最新推荐文章于 2023-05-10 17:20:22 发布

阅读量354

点赞数

文章标签：字符编码在python中的处理

I have tried numerous ways to encode this to the end result "BACK RUSHIN'" with the most important character being the right apostrophe '.

I would like a way of getting to this end result using some of the built in functions Python has where there is no discrimination between a normal string and a unicode string.

This was the code I was using to retrieve the string: str(unicode(etree.tostring(root.xpath('path')[0],method='text', encoding='utf-8'),errors='ignore')).strip()

With the result being: 'BACK RUSHIN' the thing being the apostrophe ' is missing.

Another way was: root.xpath('path/text()')

And that result was: u'BACK RUSHIN\u2019' in python.

Lastly if I try: u'BACK RUSHIN\u2019'.encode('ascii', 'replace')

The result is: 'BACK RUSHIN?'

Please no replace functions, I would like to make use of pythons codec libraries.

Also no printing the string because it is being held in a variable.

Thanks

解决方案>>> import unidecode

>>> unidecode.unidecode(u'BACK RUSHIN\u2019')

"BACK RUSHIN'"

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_39610785

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
字符编码在python中的处理_字符编码在python中用'u2019'替换'

I have tried numerous ways to encode this to the end result "BACK RUSHIN'" with the most important character being the right apostrophe '.I would like a way of getting to this end result using some of...
复制链接

扫一扫