Python-string 库的详细介绍

wanglaqqqq

于 2025-03-11 14:10:06 发布

阅读量1k

点赞数 18

分类专栏： # 文本处理与正则表达式文章标签： python 开发语言

本文链接：https://blog.csdn.net/wanglaqqqq/article/details/146177563

版权

文本处理与正则表达式专栏收录该内容

3 篇文章

订阅专栏

以下是 Python string 模块的详细解析，涵盖其核心功能、常用函数及使用场景：

1. 模块概述

string 是 Python 标准库的基础模块之一，专注于字符串处理，提供以下功能：

字符串格式化（旧式 % 操作符）
字符串常量（如字母、数字、符号集合）
基础字符串操作（大小写转换、查找替换、分割连接等）
编码辅助工具（Python 2 中的编码函数，Python 3 已废弃）

注意：

Python 3 中许多字符串方法已直接集成到 str 类（如 split()、upper()），string 模块更多用于兼容旧代码或提供特定工具。
新手推荐优先学习 str 类的方法，而非直接使用 string 模块。

2. 核心功能与常用函数

(1) 字符串格式化（旧式 `%` 操作符）

函数/方法	描述	示例
`string.Formatter`	类似 `str.format()` 的旧式实现	`formatter = string.Formatter(); print(formatter.format("Name: {}, Age: {}", "Alice", 30))`
`%` 运算符	格式化字符串（需配合格式化说明符）	`print("%.2f" % 3.14159)` → 输出 `3.14`

格式化说明符：

%s：字符串
%d / %i：整数
%f：浮点数
%x / %X：十六进制（小写/大写）
%c：字符

(2) 字符串处理函数

函数名	描述	示例
`string.ascii_lowercase`	ASCII 小写字母集合 (`'abcdefghijklmnopqrstuvwxyz'`)	`import string; print(string.ascii_lowercase[:3])` → `abc`
`string.ascii_uppercase`	ASCII 大写字母集合 (`'ABCDEFGHIJKLMNOPQRSTUVWXYZ'`)	`print(string.ascii_uppercase)` → 全大写字母串
`string.digits`	ASCII 数字集合 (`'0123456789'`)	`print(string.digits)` → `0123456789`
`string.ascii_letters`	ASCII 字母大小写集合（合并小写和大写）	`print(len(string.ascii_letters))` → 输出 `52`
`string.whitespace`	ASCII 空白字符集合（空格、制表符、换行等）	`print(string.whitespace)` → `\t\n\x0b\x0c\r`
`string.punctuation`	ASCII 标点符号集合	`print(string.punctuation)` → `!"#$%&'()*+,-./:;<=>?@[\]^_`{

**(3) 字符串操作工具（兼容旧代码）**

函数名	描述	示例
`string.capwords(s)`	将字符串分割后每个单词首字母大写	`print(string.capwords("hello world"))` → `Hello World`
`string.center(s, width[, fillchar])`	居中对齐字符串，不足部分用 `fillchar` 填充	`print(string.center("test", 10, ''))` → `test***`
`string.count(s, sub)`	统计子串 `sub` 在字符串 `s` 中出现的次数	`print(string.count("apple", "pineapple"))` → `1`
`string.find(s, sub[, start[, end]])`	返回子串首次出现的索引，未找到返回 `-1`	`print(string.find("apple", "pineapple"))` → `0`
`string.join(iterable)`	用字符串连接可迭代对象的元素	`print(string.join(["a", "b"], "-"))` → `a-b`
`string.lower(s)`	转换字符串为小写（等同于 `s.lower()`）	`print(string.lower("HELLO"))` → `hello`
`string.upper(s)`	转换字符串为大写（等同于 `s.upper()`）	`print(string.upper("hello"))` → `HELLO`
`string.split(s[, sep[, maxsplit]])`	按分隔符分割字符串，返回列表	`print(string.split("hello,world", ","))` → `['hello', 'world']`
`string.strip(s[, chars])`	去除字符串两端的空白字符或指定字符	`print(string.strip(" hello "))` → `hello`

(4) 编码与解码（Python 2 向 Python 3 迁移工具）

函数名	描述	示例
`string.encode(s, encoding[, errors])`	字符串编码为字节（Python 3 中已废弃）	`print(string.encode("hello", "utf-8"))` → `b'hello'`
`string.decode(s, encoding[, errors])`	字节解码为字符串（Python 3 中已废弃）	`print(string.decode(b"hello", "utf-8"))` → `hello`

3. 新式字符串格式化（推荐使用）

虽然 string 模块支持旧式 % 格式化，但 Python 3 推荐使用 str.format() 或 f-string：

# 使用 str.format()
name = "Alice"
age = 30
print("Name: {}, Age: {}".format(name, age))  # Name: Alice, Age: 30

# 使用 f-string
print(f"Name: {name}, Age: {age}")  # Name: Alice, Age: 30

4. 示例代码

import string

# 使用常量生成密码规则
all_chars = string.ascii_letters + string.digits + string.punctuation
password = ''.join(random.choices(all_chars, k=12))
print("随机密码:", password)

# 格式化字符串
s = "Today is %s, temperature is %d°C" % (string.ascii_lowercase[0], 25)
print(s)  # Today is a, temperature is 25°C

# 分割并处理文本
text = "Hello,   world! This is a test."
clean_text = string.strip(text).replace("  ", " ").split()
print(clean_text)  # ['Hello,', 'world!', 'This', 'is', 'a', 'test.']

5. 注意事项

Python 3 的改进：
- 大多数字符串方法已直接集成到 str 类（如 s.lower()），string 模块仅保留旧式工具。
- 编码/解码功能推荐使用 bytes 和 str 的 encode()/decode() 方法。
性能考虑：
- string 模块的函数通常返回新字符串，频繁操作大字符串时建议使用列表拼接或生成器表达式。
弃用警告：
- 部分函数（如 string.rjust）在 Python 3 中仍存在，但更推荐使用 str.rjust()。