Python基础二

最新推荐文章于 2021-03-26 00:20:59 发布

wh0am1

最新推荐文章于 2021-03-26 00:20:59 发布

阅读量229

点赞数

分类专栏： Python 文章标签： python 函数 unicode 编码 utf-8

本文链接：https://blog.csdn.net/wh0am1/article/details/71082003

版权

Python 专栏收录该内容

6 篇文章 0 订阅

订阅专栏

Python的字符串

字符编码问题请点击

字母与对应的数字转换函数ord()和chr()

>>> ord('A')
65
>>> chr(65)
'A'

字符编码与相关函数

以Unicode表示的字符串用u’…’表示，比如：

>>> print u'开车'
开车
>>> u'开车'
u'\uf00\u8f66'

把Unicode转换成utf-8函数encode(‘utf-8’)以及反转函数decode(‘utf-8’)等相关函数len()：
Unicode转换成utf-8函数用法如下：

>>> u'ABC'.encode('utf-8')
'ABC'
>>> u'中国'.encode('utf-8')
'\xe4\xb8\xad\xe5\x9b\xbd'

Unicode转换成utf-8反转函数用法如下：

>>> u'中国'.encode('utf-8')
'\xe4\xb8\xad\xe5\x9b\xbd'
>>> len(u'中国')
2
>>> len( u'中国'.encode('utf-8'))
6
>>> u'中国'.encode('utf-8').decode('utf-8')
u'\u4e2d\u56fd'
>>> printf u'\u4e2d\u56fd'
  File "<stdin>", line 1
    printf u'\u4e2d\u56fd'
                         ^
SyntaxError: invalid syntax
>>> print u'\u4e2d\u56fd'
中国

计算字符串长度函数：

>>> len(u'中国')
2
>>> len(u'abc')
3
>>> len('abc')
3
>>> a = 'Linux'
>>> len(a)
5
>>> len('\xe4\xb8\xad\xe5\x9b\xbd')
6

以下两行代码最好加在每个py文件头。

#!/usr/bin/env python
# -*- coding: utf-8 -*-

格式化

Python中格式化与C语言函数printf()比较相似。
举个栗子：

>>> 'HelloWorld %s %d' %('Linux', 100)
'HelloWorld Linux 100'
>>> 'your name is %s  your age is %d' %('Wh0am1', 21)
'your name is Wh0am1  your age is 21'
>>> 'your name is %s' %'Wh0am1'
'your name is Wh0am1'
>>>