如何在Python中小写字符串？

最新推荐文章于 2024-09-07 17:44:21 发布

w36680130

最新推荐文章于 2024-09-07 17:44:21 发布

阅读量1.4k

点赞数

文章标签： python string uppercase lowercase

原文链接：https://oldbug.net/q/SWSu/How-do-I-lowercase-a-string-in-Python

版权

本文翻译自：How do I lowercase a string in Python?

Is there a way to convert a string from uppercase, or even part uppercase to lowercase? 有没有一种方法可以将字符串从大写字母甚至部分大写字母转换为小写字母？

For example, "Kilometers" → "kilometers". 例如，“公里”→“公里”。

#1楼

参考：https://stackoom.com/question/SWSu/如何在Python中小写字符串

#2楼

Also, you can overwrite some variables: 另外，您可以覆盖一些变量：

s = input('UPPER CASE')
lower = s.lower()

If you use like this: 如果您这样使用：

s = "Kilometer"
print(s.lower())     - kilometer
print(s)             - Kilometer

It will work just when called. 它会在被调用时起作用。

#3楼

With Python 2, this doesn't work for non-English words in UTF-8. 对于Python 2，这不适用于UTF-8中的非英语单词。 In this case decode('utf-8') can help: 在这种情况下， decode('utf-8')可以帮助您：

>>> s='Километр'
>>> print s.lower()
Километр
>>> print s.decode('utf-8').lower()
километр

#4楼

How to convert string to lowercase in Python? 如何在Python中将字符串转换为小写？

Is there any way to convert an entire user inputted string from uppercase, or even part uppercase to lowercase? 有什么办法可以将整个用户输入的字符串从大写甚至部分大写转换为小写？

Eg Kilometers --> kilometers 例如公里->公里

The canonical Pythonic way of doing this is 规范的Python方式是

>>> 'Kilometers'.lower()
'kilometers'

However, if the purpose is to do case insensitive matching, you should use case-folding: 但是，如果目的是进行不区分大小写的匹配，则应使用大小写折叠：

>>> 'Kilometers'.casefold()
'kilometers'

Here's why: 原因如下：

>>> "Maße".casefold()
'masse'
>>> "Maße".lower()
'maße'
>>> "MASSE" == "Maße"
False
>>> "MASSE".lower() == "Maße".lower()
False
>>> "MASSE".casefold() == "Maße".casefold()
True

This is a str method in Python 3, but in Python 2, you'll want to look at the PyICU or py2casefold - several answers address this here . 这是Python 3中的str方法，但是在Python 2中，您需要查看PyICU或py2casefold- 几个答案在这里解决。

Unicode Python 3 Unicode Python 3

Python 3 handles plain string literals as unicode: Python 3将纯字符串文字处理为unicode：

>>> string = 'Километр'
>>> string
'Километр'
>>> string.lower()
'километр'

Python 2, plain string literals are bytes Python 2，纯字符串文字是字节

In Python 2, the below, pasted into a shell, encodes the literal as a string of bytes, using utf-8 . 在Python 2中，将以下内容粘贴到外壳中，然后使用utf-8将文字编码为字节字符串。

And lower doesn't map any changes that bytes would be aware of, so we get the same string. 而且lower不会映射字节会知道的任何更改，因此我们得到相同的字符串。

>>> string = 'Километр'
>>> string
'\xd0\x9a\xd0\xb8\xd0\xbb\xd0\xbe\xd0\xbc\xd0\xb5\xd1\x82\xd1\x80'
>>> string.lower()
'\xd0\x9a\xd0\xb8\xd0\xbb\xd0\xbe\xd0\xbc\xd0\xb5\xd1\x82\xd1\x80'
>>> print string.lower()
Километр

In scripts, Python will object to non-ascii (as of Python 2.5, and warning in Python 2.4) bytes being in a string with no encoding given, since the intended coding would be ambiguous. 在脚本中，Python将反对非ascii（从Python 2.5开始，在Python 2.4中为警告）字节，该字节位于未给出编码的字符串中，因为预期的编码将是模棱两可的。 For more on that, see the Unicode how-to in the docs and PEP 263 有关更多信息，请参阅文档和PEP 263中的Unicode操作方法。

Use Unicode literals, not `str` literals 使用Unicode文字，而不是`str`文字

So we need a unicode string to handle this conversion, accomplished easily with a unicode string literal, which disambiguates with a u prefix (and note the u prefix also works in Python 3): 因此，我们需要一个unicode字符串来处理此转换，只需使用unicode字符串文字即可轻松完成此操作，该字符串可使用u前缀消除歧义（请注意， u前缀在Python 3中也适用）：

>>> unicode_literal = u'Километр'
>>> print(unicode_literal.lower())
километр

Note that the bytes are completely different from the str bytes - the escape character is '\\u\u0026#39; followed by the 2-byte width, or 16 bit representation of these unicode letters: 请注意，这些字节与str字节完全不同-转义字符为'\\u\u0026#39;后跟2个字节的宽度或这些unicode字母的16位表示形式：

>>> unicode_literal
u'\u041a\u0438\u043b\u043e\u043c\u0435\u0442\u0440'
>>> unicode_literal.lower()
u'\u043a\u0438\u043b\u043e\u043c\u0435\u0442\u0440'

Now if we only have it in the form of a str , we need to convert it to unicode . 现在，如果仅以str形式使用它，则需要将其转换为unicode 。 Python's Unicode type is a universal encoding format that has many advantages relative to most other encodings. Python的Unicode类型是一种通用编码格式，相对于大多数其他编码而言，它具有许多优点。 We can either use the unicode constructor or str.decode method with the codec to convert the str to unicode : 我们可以在编解码器中使用unicode构造函数或str.decode方法，将str转换为unicode ：

>>> unicode_from_string = unicode(string, 'utf-8') # "encoding" unicode from string
>>> print(unicode_from_string.lower())
километр
>>> string_to_unicode = string.decode('utf-8') 
>>> print(string_to_unicode.lower())
километр
>>> unicode_from_string == string_to_unicode == unicode_literal
True

Both methods convert to the unicode type - and same as the unicode_literal. 两种方法都转换为unicode类型-并与unicode_literal相同。

Best Practice, use Unicode 最佳做法，使用Unicode

It is recommended that you always work with text in Unicode . 建议始终使用Unicode文本。

Software should only work with Unicode strings internally, converting to a particular encoding on output. 软件应仅在内部使用Unicode字符串，并在输出时转换为特定的编码。

Can encode back when necessary 必要时可以回编码

However, to get the lowercase back in type str , encode the python string to utf-8 again: 但是，要恢复为str类型的小写字母，请再次将python字符串编码为utf-8 ：

>>> print string
Километр
>>> string
'\xd0\x9a\xd0\xb8\xd0\xbb\xd0\xbe\xd0\xbc\xd0\xb5\xd1\x82\xd1\x80'
>>> string.decode('utf-8')
u'\u041a\u0438\u043b\u043e\u043c\u0435\u0442\u0440'
>>> string.decode('utf-8').lower()
u'\u043a\u0438\u043b\u043e\u043c\u0435\u0442\u0440'
>>> string.decode('utf-8').lower().encode('utf-8')
'\xd0\xba\xd0\xb8\xd0\xbb\xd0\xbe\xd0\xbc\xd0\xb5\xd1\x82\xd1\x80'
>>> print string.decode('utf-8').lower().encode('utf-8')
километр

So in Python 2, Unicode can encode into Python strings, and Python strings can decode into the Unicode type. 因此，在Python 2中，Unicode可以编码为Python字符串，而Python字符串可以解码为Unicode类型。

#5楼

Don't try this, totally un-recommend, don't do this: 请勿尝试，完全不推荐，请勿这样做：

import string
s='ABCD'
print(''.join([string.ascii_lowercase[string.ascii_uppercase.index(i)] for i in s]))

Output: 输出：

abcd

Since no one wrote it yet you can use swapcase (so uppercase letters will become lowercase, and vice versa) (and this one you should use in cases where i just mentioned (convert upper to lower, lower to upper)): 由于没有人写过它，因此您可以使用swapcase （因此大写字母将变为小写，反之亦然）（并且在我刚才提到的情况下，应使用此字母（将大写转换为小写，将小写转换为大写））：

s='ABCD'
print(s.swapcase())

Output: 输出：

abcd

#6楼

Use .lower() - For example: 使用.lower() -例如：

s = "Kilometer"
print(s.lower())

The official 2.x documentation is here: str.lower() 官方2.x文档在这里： str.lower()
The official 3.x documentation is here: str.lower() 正式的3.x文档在这里： str.lower()