Python字符串类型

最新推荐文章于 2022-01-19 03:00:10 发布

卢同学.

最新推荐文章于 2022-01-19 03:00:10 发布

阅读量530

点赞数

分类专栏： Python 文章标签：字符串 python 正则表达式

本文链接：https://blog.csdn.net/weixin_44801980/article/details/108541669

版权

Python 专栏收录该内容

120 篇文章 4 订阅

订阅专栏

特点

	1、字符串一旦定义不允许修改，可以替换。字符串容器中的元素都是字符类型的。
	2、字符串是一个不可修改的序列，当我们对字符串进行修改的时候，实际上是生成了一个新的被修改后的对象，而原字符串没有发生修改。
	3、字符串开始和结尾使用的引号形式必须一致。
	4、当表示复杂的字符串时，还可以进行引号的嵌套。

定义

【说明】

1、字符串是一个有序的，不可修改的，元素以引号包围的序列。
2、字符串的定义：
'x'            #使用单引号，字符串内容必须在一行中。
"x"            #使用单引号，字符串内容必须在一行中。
'''x'''        #使用三单引号，字符串内容可以换行。
''''''x''''''  #使用三单引号，字符串内容可以换行。
"""xx'x'xx"x"xx""" #嵌套

【案例】

str1='abc'
str2="abc"
str3="""
白发催人老，
虚名误人深。
"""
str4="hello 'kr',come here!"
str5='hello "kr",come here!'
str6=""" aa'bb'cc"dd"ee """

循环遍历

str7 = "hello world!"
for i in range(0,len(str6)):
 	print(str6[i],end='')
	# 结果：hello world!

for i in range(0, len(str6)):
    print(str6[i], end='')
    # 结果：hello world!

转义字符

【说明】

1、python中字符串还支持转义字符，详见[转义字符]。
2、在字符串定界符引号前面加上r或R，那么该字符将原样输出，其中转义字符将不进行转义。
		
【案例】
# 转义，\n 换行符 ;\t 制表符,相当于tab键
print("学海无涯\x0a苦作舟") #\x0a换行符
# 学海无涯
# 苦作舟

print("学海无涯\n苦作舟") #\n 换行符
# 学海无涯
# 苦作舟

print("学海无涯\t苦作舟")  #\t 制表符,相当于tab键
# 学海无涯	苦作舟

print("学海无涯\\n苦作舟") #在\前面再加一个\,表示后面的\没有特殊含义
# 学海无涯\n苦作舟

print("学海无涯\\t苦作舟") #在\前面再加一个\,表示后面的\没有特殊含义
# 学海无涯\t苦作舟

print(r"学海无涯\n苦作舟") #在字符串的外面加一个r,表示后面的\没有特殊含义,不转义
# 学海无涯\n苦作舟

print("C:/Users/LKR>")   #在python中,/也可以表示路径
# C:/Users/LKR>

字符串的索引

【索引】
字符串的索引(index)-在python当中所有有序的序列都是由索引概念的，但是区别在于序列可不可以被修改。索引在我们初学的时候我们可以理解为字符串的下标，字符串里的每一个个体都被称作字符也是该字符串的一个元素。
字符串——“hello”
 h  e  l   l   o
 0  1  2   3   4   正索引
-5 -4 -3  -2  -1   负索引
	
【用法-1】取单个元素

# 字符串[索引值]——>对应索引值的元素
print("hello"[1])  # e

# 2、截取
# 字符串[start:end]——>得到对应范围的元素，该范围包含起始端，不包含结尾端,默认截取的方向是从左往右的。
print("hello"[1:3])	 # 输出：el

# 3、步长截取
# 字符串[start:end:step] 按照步长减一进行隔取
print("whileisok"[0:7:3])	    # 输出：wls

# 4、默认取法
# 字符串[start:end,step]——>这三个参数都有默认值: Start 0;End 结尾;Step 1
print("whileisok"[:7])	 # 输出：whileis
print("whileisok"[:-1])	 # 输出：whileiso
print("whileisok"[0:])	 # 输出：whileisok

# 5、反取
# 字符串[负数]——>从右往左取
print("whileisok"[-1])	     # 输出：k
print("whileisok"[-3])	     # 输出：s
print("whileisok"[::-1])	 # 输出：kosielihw

【用法-2】 通过表名+引索

# 1、打印下标索引为4的字符
str = "helloworld"
print(str[4])   # 结果：o

# 2、打印下标索引为2后面的字符
print(str[2:])  # 结果：llo world

# 3、打印下标索引为2前面的字符
print(str[:2])  # 结果：he

# 4、打印下标索引为2-5之间的字符
print(str[2:5]) # 结果：llo

# 5、打印str所有数据
print(str[:])   # 结果：hello world
print(str)      # 结果：hello world

# 6、输出字符串两次
print(str * 2)  # 结果：hello world hello world

# 7、输出连接的字符串
print(str + "," + "TEST")    # 结果：hello worldTEST

字符串的修饰

1、Center
	# 让字符串在指定的长度居中，如果不能居中左短右长，可以指定填充内容，默认以空格填充。
	str = "while"
	print(str.center(10))       # 让while在10个长度里居中
	print(str.center(10,"*"))   # 让while在10个长度里居中，空白的地方以*填充
	print(len(str))             # 打印"while"本身的长度
	print(len(str.center(10)))  # 打印"while".center(10)的长度
	# 输出:
	#   while
	# **while***
	# 5
	# 10
	
2、ljust
	# 让字符串在指定的长度左齐，如果不能居中左短右长，可以指定填充内 容，默认以空格填充。
	print("while".ljust(10))
	print("while".ljust(10,"*"))
	# 输出:
	# while
	# while*****
	
3、rjust
	# 让字符串在指定的长度右齐，如果不能居中左短右长，可以指定填充内容，默认以空格填充。
	print("while".rjust(10))
	print("while".rjust(10,"*"))
	# 输出:
	#      while
	# *****while
	
4、zfill
	# 将字符串填充到指定的长度，不足地方用0从左开始补充。
	print("while".zfill(10))
	# 输出:
	# 00000while
	
5、format
	# 按照顺序，将后面的参数传递给前面的大括号。
	print("{} is {} years old".format("tom",18))
	# 输出:
	# while is 18 years old
	
6、strip
	# 默认去除两边的空格，去除内容可以指定。
	print("   while hello   ".strip())
	print("***while*hello***".strip())
	print("***while*hello***".strip("*"))
	# 输出:
	# while hello
	# ***while*hello***
	# while*hello
	
7、rstrip
	# 默认去除右边的空格，去除内容可以指定。
	print("   while hello   ".rstrip())
	print("***while*hello***".rstrip())
	print("***while*hello***".rstrip("*"))
	# 输出:
	#    while hello
	# ***while*hello***
	# ***while*hello
	
8、lstrip
	# 默认去除左边的空格，去除内容可以指定。
	print("   while hello   ".lstrip())
	print("***while*hello***".lstrip())
	print("***while*hello***".lstrip("*"))
	# 输出:
	# while hello
	# ***while*hello***
	# while*hello***

字符串的查找

1、count
	计数功能，返回自定字符在字符串当中的个数。
	print("hello world".count("l"))
	输出:
	 3
	 
2、find
	查找，返回从左第一个指定字符的索引，找不到返回-1。
	print("hello world".find("l"))
	print("hello world".find("q"))
	# 输出:
	# 2
	# -1
	
3、rfind
	查找，返回从右第一个指定字符的索引，找不到返回-1。
	print("hello world".rfind("l"))
	print("hello world".rfind("q"))
	# 输出
	# 9
	# -1
	
4、index
	查找，返回从左第一个指定字符的索引，找不到报错。
	print("hello world".index("l"))
	print("hello world".index("q"))
	# 输出:
	# 2
	# ValueError: substring not found
	
5、rindex
	# 查找，返回从右第一个指定字符的索引，找不到报错。
	print("hello world".rindex("l"))
	print("hello world".rindex("q"))
	# 输出:
	# 9
	# ValueError: substring not found

字符串的替换

1、replace
	# 从左到右替换指定的元素，可以指定替换的个数，默认全部替换。
	print("hello world".replace("l","2"))   # 全部替换
	print("hello world".replace("l","2",2)) # 替换2个
	# 输出:
	# he22o wor2d
	# he22o world
	
2、translate
	# 按照对应关系来替换内容。
	from string import maketrans
	trans = maketrans("12345","abcde")
	print("52123123".translate(trans))
	# 输出：
	# ebabcabc

字符串的变形

1、upper
	# 将字符串当中所有的字母转换为大写,如果有大写的，将会保留不变。
	print("while is ok 1992".upper())
	print("while is Ok 1992".upper())
	# 输出:
	# WHILE IS OK 1992
	# WHILE IS OK 1992
	
2、lower
	# 将字符串当中所有的字母转换为小写 ,如果有小写的，将会保留不变。
	print("WHILE iS OK 1992".lower())
	# 输出:
	# while is ok 1992
	
3、swapcase
	# 将字符串当中所有的字母大小写互换。
	print("WHIle iS OK 1992".swapcase())
	# 输出:
	# whiLE Is ok 1992
	
4、title
	# 将字串符当中的单词首字母大写，单词以空格划分。
	print("WHIle iS ok 1992 a bc D".title())
	# 输出:
	# While Is Ok 1992 A Bc D
	
5、capitalize
	# 只有字符串的首字母大写。
	print("WHIle iS ok 1992".capitalize())
	# 输出:
	# While is ok 1992
	
6、expandtabs
	# 修改\t的长度。
	print("while is ok 1992".expandtabs())
	print("while \t is ok 1992".expandtabs())
	print("while \t is ok 1992".expandtabs(16))
	# 输出:
	# while is ok 1992
	# while    is ok 1992
	# while            is ok 1992

字符串的判断

1、isalnum
	# 判断字符串是否完全由字母和数字组成。
	print("hello world".isalnum())
	print("hello 123".isalnum())
	print("hello123".isalnum())
	# 输出:
	# False
	# False
	# True

2、isalpha
	# 判断字符串是否完全由字母组成。
	print("hello world".isalpha())
	print("helloworld".isalpha())
	print("hello 123".isalpha())
	# 输出:
	# False
	# True
	# False

3、isdigit
	# 判断字符串是否完全由数字组成。
	print("123456".isdigit())
	print("12 3".isdigit())
	print("hello123".isdigit())
	# 输出:
	# True
	# False
	# False

4、isupper
	# 判断字符串当中的字母是否完全是大写。
	print("hello".isupper())
	print("HELLO".isupper())
	print("helLO".isupper())
	# 输出:
	# False
	# True
	# False

5、islower
	# 判断字符串当中的字母是否完全是小写。
	print("hello".islower())
	print("HELLO".islower())
	print("helLO".islower())
	# 输出:
	# True
	# False
	# False

6、istitle
	# 判断字符串是否满足title格式 ，title格式是指-只有单词的首字母大写。
	print("hello".istitle())
	print("HELLO".istitle())
	print("helLO".istitle())
	print("Hello".istitle())
	# 输出:
	# False
	# False
	# False
	# True

7、isspace
	# 判断字符串是否完全由空格组成。
	print("   ".isspace())
	print("  a ".isspace())
	print("  \t ".isspace())
	print("  \n ".isspace())
	# 输出:
	# True
	# False
	# True
	# True

8、startswith
	# 判断字符串的开头字符，也可以截取判断。
	print("hello world".startswith("h"))        #判断字符串的开始是不是"h"
	print("hello world".startswith("e"))        #判断字符串的开始是不是"e"
	print("hello world".startswith("e",1))      #判断字符串下标从1开始的是不是"e"
	print("hello world".startswith("e",1,3))    #判断字符串下标从1-3开始的是不是"e"
	# 输出:
	# True
	# False
	# True
	# True

9、endswith
	# 判断字符串的结尾字符，也可以截取判断。
	print("hello world".endswith("d"))        #判断字符串的结尾是不是"d"
	print("hello world".endswith("w",1,6))    #判断字符串从1-6的结尾是不是"w"
	print("hello world".endswith(" ",1,6))    #判断字符串从1-6的结尾是不是" "
	# 输出:
	# True
	# False
	# True

字符串的切分

1、以行切分字符串
	# splitlines() 以行切分字符串，可以指定是否保留行标志（这里0和1代表的是布尔值）。
	print(
	"""
	hello world
	hello kerulu
	hello tom
	""".splitlines()
	)
	print(
	"""
	hello world
	hello kerulu
	hello tom
	""".splitlines(0)
	)
	print(
	"""
	hello world
	hello kerulu
	hello tom
	""".splitlines(1)
	)
	print("hello world".splitlines())
	# 输出:
	# ['', 'hello world', 'hello kerulu', 'hello tom']
	# ['', 'hello world', 'hello kerulu', 'hello tom']
	# ['\n', 'hello world\n', 'hello kerulu\n', 'hello tom\n']
	# ['hello world']
	
2、从左开始切分字符串
	# split() 从左开始切分字符串，可以指定切分次数和对象 ，默认以空格切割。
	print("hello world ke ru lu".split())
	print("hello world ke ru lu".split("l"))
	print("hello world ke ru lu".split("l", 1))
	# 输出:
	# ['hello', 'world', 'ke', 'ru', 'lu']
	# ['he', '', 'o wor', 'd ke ru ', 'u']
	# ['he', 'lo world ke ru lu']
	
3、从右开始切分字符串
	# rsplit() 从右开始切分字符串，可以指定切分次数和对象 ，默认以空格切割
	print("hello world ke ru lu".rsplit())
	print("hello world ke ru lu".rsplit("l"))
	print("hello world ke ru lu".rsplit("l", 1))
	# 输出:
	# ['hello', 'world', 'ke', 'ru', 'lu']
	# ['he', '', 'o wor', 'd ke ru ', 'u']
	# ['hello world ke ru ', 'u']

字符串的拼接

1、join
	# 将指定的字符串插入到后面的序列的每两个元素之间，进行拼接，形成一个新的字符串。
	print("*".join("hello world"))
	print(" ".join("hello world"))
	print("".join("hello world"))
	print(" % ".join("hello world"))
	# 输出:
	# h*e*l*l*o* *w*o*r*l*d
	# h e l l o   w o r l d
	# hello world
	# h % e % l % l % o %   % w % o % r % l % d
	
2、+
	# 将两个字符串拼接起来。
	print("ke"+"ru")
	# 输出:
	# keru
	
3、*
	# 将指定的字符串进行重复。
	# print("ke"*3)
	# 输出:
	# kekeke

字符串的编码

# encode    # 加码
# decode    # 解码
# 首先以gbk进行解码。解码后得到Unicode的中国，unicode是python2的核心编码。
print("中国".decode("gbk"))
# 对解码后的内容进行加码
print("中国".decode("gbk").encode("utf-8"))   #由gbk转码为utf-8的过程。

字符串格式化

# 1、“ %s” 可以接收数字和字符, 字符类型的格式化。
print("Tom is %s years old" % "18")
print("%s is %s years old" % ("Tom", 18))
# 结果:
# Tom is 18 years old
# Tom is 18 years old

# 2、“ %d” 只能接收数字, 整数的格式化。
print("%s is %d years old" % ("Tom", 18))
# print("%s is %d years old" % ("Tom", "18"))
# 结果:
# Tom is 18 years old
# TypeError: %d format: a number is required, not str

# 3、“ %2d” 至少保留两位长度，不足默认以空白从左补充。
print("%s is %2d years old" % ("TOM", 8.958787878))
print("%s is %0.2d years old" % ("TOM", 8.958787878))
# 结果:
# TOM is  8 years old
# TOM is 08 years old

# 4、“ %f”         保留六位小数。float型的格式化。
# 5、“ %0.1f”      保留一位小数。
# 6、“ % 0.2f”     保留两位小数, 浮点的格式化。
print("%s is %f years old" % ("TOM", 0.068787878))
print("%s is %0.1f years old" % ("TOM", 0.068787878))
print("%s is %0.2f years old" % ("TOM", 0.068787878))
# 结果:
# TOM is 0.068788 years old
# TOM is 0.1 years old
# TOM is 0.07 years old

# 7、“ % (key)s”  映射式格式符。
print("%(name)s is %(num)s years old,%(name)s have %(num)s $RMB" % ({"name": "TOM", "num": 18}))
print("{} is {} years old".format("TOM", 18))
# 结果:
# TOM is 18 years old,TOM have 18 $RMB
# TOM is 18 years old

# 8、% c           单个字符。
# 9、% d 或 % i    十进制整数。
# 10、%x           十六进制整数。
# 11、% f、% F     浮点数。
# 12、% r          字符串（采用repr()显示）。
# 13、% o          八进制整数。
# 14、% e          指数（基数写为e）。
# 15、% E          指数（基数写为E）。
# 16、% %          字符 %。
template = '编号：%09d\t公司名称：%s \t官网： http://www.%s.com'  # 定义模板。
x1 = (6, '百度', 'baidu')   # 定义转换的内容
x2 = (8, '华为', 'huawei')  # 定义转换的内容
print(template % x1)  # 格式化输出
print(template % x2)  # 格式化输出
# 输出：
# 编号：000000006	 公司名称：百度 	 官网： http://www.baidu.com
# 编号：000000008	 公司名称：华为 	 官网： http://www.huawei.com

# 17、数字不补空格而是在左侧补0,在%后面写一个0,然后加上位数n。
info1='我叫%6s,你叫%6s,他叫%6s,今年是%010d年'%('高手名字不能太长','徐渭','谢晋',2021)
print(info1)
# 我叫高手名字不能太长,你叫    徐渭,他叫    谢晋,今年是0000002021年

# 18、默认都是右对齐,如果要改成左对齐,那么可以加一个负号
info2='我叫%6s,你叫%6s,他叫%6s,今年是%10d年'%('高手名字不能太长','徐渭','谢晋',2021)
info3='我叫%-6s,你叫%-6s,他叫%-6s,今年是%-10d年'%('高手名字不能太长','徐渭','谢晋',2021)
print(info2)
print(info3)
# 我叫高手名字不能太长,你叫    徐渭,他叫    谢晋,今年是      2021年
# 我叫高手名字不能太长,你叫徐渭    ,他叫谢晋    ,今年是2021      年

# 19、.format()
# str.format(args)
# s      对字符串类型格式化。
# d      十进制数。
# c      将十进制数自动转换为对应的Unicode字符。
# e或E   转换为科学计数法表示在格式化。
# g或G   自动在e和f或E和F中切换。
# b      将十进制数转换成二进制数表示再格式化。
# o      将十进制数转换成八进制数再格式化。
# x或X   将十进制数转换成十六进制数再格式化。
# f或F   转化为浮点数（默认小数点后保留6位）再格式化。
# %     显示百分比（默认显示小数点后6位）。

# template = '编号：{:0>9s}\t 公司名称：{:s} \t官网： http://www.{：s}.com'  # 定义模板。
# x3 = template.format('6', '百度', 'baidu')  # 定义转换的内容
# x4 = template.format('8', '华为', 'huawei')  # 定义转换的内容
# print(x3)  # 格式化输出
# print(x4)  # 格式化输出

# 如果前面的占位符比后面的参数少,不会报错,反之则会报错。
# str1 = 'My name is {},Your name is {},age is {}.'.format('tom','kangkang',21)
# str2 = 'My name is {},Your name is {},age is {}.'.format('tom','kangkang',21,"小明")
# str3 = 'My name is {},Your name is {},age is {}.'.format('tom','kangkang')
# print(str1)
# print(str2)
# print(str3)
# 输出：
# My name is tom,Your name is kangkang,age is 21.
# My name is tom,Your name is kangkang,age is 21.
# IndexError: Replacement index 2 out of range for positional args tuple

# 除了按顺序取值之外,也可以使用下标取值的方式
str4 = 'My name is {1},Your name is {0},age is{2}.'.format('小明','康康',21)
print(str4)
# 输出：
# My name is 康康,Your name is 小明,age is21.

# 补齐到至少n位,在{}里面加上:n
# 字符串默认左对齐,数字默认右对齐,可以根据需要修改对齐方式。
# < 左对齐,> 右对齐,^ 居中对齐
str5 = 'My name is {0:10},Your name is {1:10},age is{2:10}.'.format('zhangwengqiang','Ralf',21)   # 通过下标取值，默认左对齐
str6 = 'My name is {:10},Your name is {:10},age is{:10}.'.format('zhangwengqiang','Ralf',21)      # 默认左对齐
str7 = 'My name is {:<10},Your name is {:<10},age is {:<10}.'.format('zhangwengqiang','Ralf',21)  # 左对齐
str8 = 'My name is {:^10},Your name is {:^10},age is {:^10}.'.format('zhangwengqiang','Ralf',21)  # 居中
str9 = 'My name is {:>10},Your name is {:>10},age is {:>10}.'.format('zhangwengqiang','Ralf',21)  # 右对齐
print(str5)
print(str6)
print(str7)
print(str8)
print(str9)
# 输出：
# My name is zhangwengqiang,Your name is Ralf      ,age is        21.
# My name is zhangwengqiang,Your name is Ralf      ,age is        21.
# My name is zhangwengqiang,Your name is Ralf      ,age is 21        .
# My name is zhangwengqiang,Your name is    Ralf   ,age is     21    .
# My name is zhangwengqiang,Your name is       Ralf,age is         21.

# 数字也可以补0
str10 = 'My name is {1:>010},Your name is {0:>010},age is{2:>010}.'.format('Clark','Ralf',21)
print(str10)
# 输出：
# My name is 000000Ralf,Your name is 00000Clark,age is0000000021.

# 在python3.6以后的版本中,可以用f'字符串'的写法
x = '刘德华'
y = '张学友'
str11 = f'My name is {x},Your name is {y}'
str12 = f'My name is {{{x}}},Your name is {{{y}}}'
str13 = 'My name is %s, Your name is %s' % (x, y)
print(str11)
print(str12)
print(str13)
# 输出：
# My name is 刘德华,Your name is 张学友
# My name is {刘德华},Your name is {张学友}
# My name is 刘德华,Your name is 张学友

卢同学.

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
打赏
0
评论
Python字符串类型

1、特点1.1、字符串一旦定义不允许修改，可以替换。字符串容器中的元素都是字符类型的。1.2、字符串是一个不可修改的序列，当我们对字符串进行修改的时候，实际上是生成了一个新的被修改后的对象，而原字符串没有发生修改。1.3、字符串开始和结尾使用的引号形式必须一致。1.4、当表示复杂的字符串时，还可以进行引号的嵌套。例如："a'bb'c"、""" aa'bb'cc"dd"ee """2、定义str = 'hello world'2.1、字符串是一个有序的，不可修改的，元素以引号包围的序列。
复制链接

扫一扫