day10string-CSDN博客

本文链接：https://blog.csdn.net/FinoXC/article/details/129660409

1 sharCode

计算机存储数据只能存数字（存的是数字的二进制补码）

1.1 字符编码

为了你能够让计算机存储文字符号，给每个符号对应了一个固定的数字，每次在需要存储这个符号的时候，就去存储这个固定的数字。
每个对应的那个数字就是这个符号的编码值。

1.2 编码表 - 保存字符和字符对应编码值的表

ASCII码表
美国信息码（只包含了美国人常用的符号，总共128个）
数字字符
大写字母（A - 65）
小写字母（a - 97）
英文输入法下的特殊符号
Unicode编码表(Python) - 包含了世界上所有的国家所有的民族的所有的语言符号
Unicode编码表包含了ASCII （前128个字符就是ASCII码表中的内容）
中文编码范围：0x4e00 ~ 0x9fa5

1.3 编码值的应用

chr(编码值) - 获取编码值对应的字符

print(chr(97))
print(chr(0x4e00))
print(chr(0x9fa5))

from pypinyin import pinyin
print(pinyin(''))
for i in range(0x4e00, 0x9fa5 + 1):
    print(chr(i), pinyin(chr(i)), end=' ')

ord(字符) - 获取指定字符对应的编码值

print(ord('王'))
print(ord('茂'), ord('M'))
print(ord('俊'))

char1 = 'b'
print(chr(ord(char1) - 32))

char1 = 'B'
print(chr(ord(char1) + 32))

编码字符：\u 4位的16进制编码值

在字符串中提供字符的方式有两种：a.直接提供字符 b.使用编码字符

hex(10进制数) - 获取指定数字对应的16进制表示方式

str1 = 'ab你好'
print(str1)

# 如果知道字符编码值是多少，但是不知道字符是什么的时候，就可以使用编码字符来表示这个字符
str2 = '\u0061\u9fa5'
print(str2)

char = '是'
if '\u4e00' <= char <= '\u9fa5':
    print(char, '是中文字符')
else:
    print(char, '不是中文字符')

2 getCher

字符串获取字符的方法和列表获取元素的方法一样

2.1 获取单个字符

# 注意：字符串中转义字符的长度是1
# 字符串[下标]
str1 = '\thello\nworld\u9fa5'
print(str1[1])
print(str1[-1])

2.2 切片

str2 = 'good good study'
print(str2[0:-2:2])

2.3 遍历字符串

str2 = 'good good study'
for x in str2:
    print(x)

str2 = 'good good study'
for index in range(len(str2)):
    print(index, str2[index])

for index, itme in enumerate(str2):
    print(index, itme)

3 stringOperations

3.1 字符串加法运算

# 字符串1 + 字符串2  -  将两个字符串合并成一个字符串
str1 = 'hello'
str2 = '你好'
print(str1+str2)    # hello你好
print(str1 + ' ' + str2)   # hello 你好

# 案例1:提取字符串中所有数字字符
str1 = '打算dd32a23s，eq42'
new_str1 = ''
for i in str1:
    if '0' <= i <= '9':
        new_str1 += i
print(new_str1)
# 案例2: 在每一个数字字符后面插入一个 %
str1 = '打算dd32a23s，eq42'
new_str1 = ''
for i in str1:
    if '0' <= i <= '9':
        new_str1 += i+'%'
    else:
        new_str1 += i
print(new_str1)

# 练习:将字符串中所有的数字字符都改成'+'
str1 = '打算dd32a23s，eq42'
new_str1 = ''
for i in str1:
    if '0' <= i <= '9':
        new_str1 += '+'
    else:
        new_str1 += i
print(new_str1)

3.2 字符串乘法运算

# 字符串 * N , N * 字符串  -  上字符串中元素重复N次产生一个新的字符串
str1 = '你好' * 5
print(str1)

3.3 字符串比较运算

比较是否相等： ==、!=
```
print('abc' == 'bca')   # False
```
比较大小

两个字符串比较大小，比较的是第一对不相等的字符的编码值的大小

判断字符的性质：
是否是数字字符：‘0’ <= x <= ‘9’
是否是小写字母：‘a’ <= x <= ‘z’
是否是大写字母：‘A’ <= x <= ‘Z’
是否是字母：‘a’ <= x <= ‘z’ or ‘A’ <= x <= ‘Z’
是否是中文：‘\u4e00’ <= x <= ‘\u9fa5’
```
print('abc' > 'bcc')    # False
print('a1c' > 'abc')    # False
print('a你好' > 'adc')  # True
```

3.4 in 和 not in

# 字符串1 in 字符串2 - 字符串1是否是字符串2的子串(字符串2是否包含字符串1)
print('a' in 'abc')         # True
print('ab' in 'abc')        # True
print('ac' in 'abc')        # False

print(10 in [10, 20, 30])       # True
print([10, 20] in [10, 20, 30])   # False

3.5 相关函数

max、min、sorted

# 判断的是十六进制码的大小
print(max('gfdgjkh'))
print(sorted('gfdgjkh'))		
print(sorted('helloworld'))     # ['d', 'e', 'h', 'l', 'l', 'l', 'o', 'o', 'r', 'w']

len(字符串) - 一个转义字符占一个字符

msg = '\thello\nworld!\u4e00'
print(len(msg))

msg = 'sdfjlke43'
print(len(msg))

str(数据) - 任何类型的数据都可以转换成字符串；转换的时候是在数据的打印值外面加引号

str(100)        # '100'
str(2.3)        # '2.3'
str(True)       # 'True'

list1 = [10, 20, 30, 40]
print(str(list1))        # '[10, 20, 30, 40]'

list1 = [10,20,30,40]
print(str(list1))        # '[10, 20, 30, 40]'

list1 = ["sad",23]
print(str(list1))        # "['sad', 23]"

eval(字符串) - 去掉字符串的引号，获取引号中表达式的结果

a = eval('100')         # a = 100
b = eval('1.23')        # b = 1.23

abc = 100
c = eval('abc')         # c = abc

d = eval('"hello"')     # d = "hello"

a1 = eval('[10, 20, 30, 40]')           # a1 = [10, 20, 30, 40]
print(a1)
a1.append(100)
print(a1)       # [10, 20, 30, 40, 100]

b1 = eval('10 + 20')        # b1 = 10 + 20
print(b1)


msg = 'print(100)'
eval(msg)

a = int(input('请输入数字a的值：'))
b = int(input('请输入数字b的值：'))
value = input('请选择运算符（+、-、*、/）：')
result = eval('a' + value + 'b')    # 'a+b'、'a-b'
print(result)

4 stringMethods

字符串.xxx()

4.1 字符串.join() - 将序列中的元素用指定的字符串连接成一个新的字符串(序列中的元素必须全部都是字符串)

list1 = ['小明', '张三', '李四', '王五']
result = '-'.join(list1)
print(result)       # 小明-张三-李四-王五

result = '**'.join('abc')
print(result)      # a**b**c

nums = [90, 32, 43, 12]
result = '+'.join([str(i)for i in nums])
print(result)
print(eval(result))

4.2 字符串1.count(字符串2) - 统计字符串1中字符串2的个数

result = msg.count('a')
print(result)               # 4

print(msg.count('you'))     # 3

4.3 字符串.split()

字符串1.split(字符串2) - 将字符串1中所有的字符串2作为切割点对字符串1进行切割

msg = 'how are you? i am fine! thank you, and you?'
result = msg.split('you')
print(result)      # ['how are ', '? i am fine! thank ', ', and ', '?']

字符串1.split(字符串2, N) - 将字符串1中前N个字符串2作为切割点对字符串1进行切割

msg = 'how are you? i am fine! thank you, and you?'
result = msg.split('you', 2)
print(result)               # ['how are ', '? i am fine! thank ', ', and you?']

4.4 字符串.replace()

字符串1.replace(字符串2, 字符串3) - 将字符串1中所有的字符串2都替换成字符串3

字符串1.replace(字符串2, 字符串3, N) - 将字符串1中前N个字符串2都替换成字符串3

msg = 'how are you? i am fine! thank you, and you?'
result = msg.replace('you', 'me')
print(result)       # how are me? i am fine! thank me, and me?

result = msg.replace('you', '')
print(result)

msg = 'how are you? i am fine! thank you, and you?'
result = msg.replace('you', '+', 2)
print(result)       # how are +? i am fine! thank +, and you?

4.5 字符串.strip() - 去掉字符串前后的空白字符

msg = """

       'how are you? i am fine! thank you, and you?'

"""
print(msg)
result = msg.strip()
print(result)

msg = '    dasda   '
print(msg.strip())

msg = 'dasda///'
print(msg.strip('/'))

4.6

字符串.isupper() - 判断字符串是否是纯大写字母字符串

字符.isupper() - 判断字符是否是大写字母

print('JSKS'.isupper())
print('A'.isupper())

6.7 字符串.islower() - 判断字符或者字符串是否全是小写字母

print('a'.islower())        # True

6.8 字符串.isdigit() - 判断字符或者字符串是否全是数字字符

print('9'.isdigit())
print('720233'.isdigit())

6.9 字符串.upper()

print('hs技术上223jKJ90lo'.upper())
print('m'.upper())

6.10 字符串.lower()

print('hAMs技术上223jKJ90lo'.lower())
print('Q'.lower())


# 案例：判断char是否是字母
char = '2'

if char.islower() or char.isupper():
    print('是字母')
else:
    print('不是字母')

5 formatString 格式化字符串

解决的问题：字符串内容不确定

name = '小明'
age = 25
money = 1200.11
# xxx今年xx岁,月薪：xxxx元！

5.1 字符串拼接

name = '小明'
age = 25
money = 1200.11
msg = name + '今年' + str(age) + '岁,月薪：' + str(money) + '元！'
print(msg)

5.2 格式字符串 - 包含格式占位符的字符串

语法：包含格式占位符的字符串 % (数据1, 数据2, 数据3,…)
注意：()中的数据必须和前面字符串中的占位符一一对应
常见的格式占位符：

%s - 可以给任何类型的数据占位
%d - 可以给任何数字占位(整数占位符，如果给的数字是小数，会自动转换成整数在拼接到字符串中)
%f - 可以给任何数字占位(浮点数占位符)
%.Nf - (控制保留N位小数 - 默认是保留6位小数)

msg = '%s今年%d岁,月薪:%.2f元!' % (name, age, money)
print(msg)

result = 'x: %s' % 3.1455
print(result)

5.3 f-string

在字符串的最前面(引号的前面)加f，就可以在字符串中通过 {表达式} 中表达式的结果来给字符串提供内容

a = 100
b = 20
# 'xxx + xx = xxx'
print(f'{a} + {b} = {a+b}')     # '100 + 20 = 120'


# {表达式:.Nf}  -  控制小数保留N位小数
a = 1.234
b = 2.3
print(f'{a:.2f} + {b:.2f} = {a + b:.0f}')

# {表达式:.N%} -   控制数字显示成百分比，N空值百分比的小数位数
c = 0.98
print(f'及格率:{c:.0%}')

# {表达式:,.Nf}  -  标准的金额拼接
# 357800  -> ￥356,800.00
d = 3578000
print(f'￥{d:,.2f}')