Python 中的dir（）函数以及简单的数据结构

本文链接：https://blog.csdn.net/FirstBloodFB/article/details/44398921

dir()函数

你可以使用内建的dir函数来列出模块定义的标识符。标识符有函数、类和变量。

当你为dir()提供一个模块名的时候，它返回模块定义的名称列表。如果不提供参数，它返回当前模块中定义的名

$ python

>>> import sys >>> dir(sys) # get list of attributes for sys module ['__displayhook__', '__doc__', '__excepthook__', '__name__', '__stderr__', '__stdin__', '__stdout__', '_getframe', 'api_version', 'argv', 'builtin_module_names', 'byteorder', 'call_tracing', 'callstats', 'copyright', 'displayhook', 'exc_clear', 'exc_info', 'exc_type', 'excepthook', 'exec_prefix', 'executable', 'exit', 'getcheckinterval', 'getdefaultencoding', 'getdlopenflags', 'getfilesystemencoding', 'getrecursionlimit', 'getrefcount', 'hexversion', 'maxint', 'maxunicode', 'meta_path','modules', 'path', 'path_hooks', 'path_importer_cache', 'platform', 'prefix', 'ps1', 'ps2', 'setcheckinterval', 'setdlopenflags', 'setprofile', 'setrecursionlimit', 'settrace', 'stderr', 'stdin', 'stdout', 'version', 'version_info', 'warnoptions'] >>> dir() # get list of attributes for current module ['__builtins__', '__doc__', '__name__', 'sys'] >>> >>> a = 5 # create a new variable 'a' >>> dir() ['__builtins__', '__doc__', '__name__', 'a', 'sys'] >>> >>> del a # delete/remove a name >>> >>> dir() ['__builtins__', '__doc__', '__name__', 'sys'] >>>

它如何工作

首先，我们来看一下在输入的sys模块上使用dir。我们看到它包含一个庞大的属性列表。

接下来，我们不给dir函数传递参数而使用它——默认地，它返回当前模块的属性列表。注意，输入的模块同样是列表的一部分。

为了观察dir的作用，我们定义一个新的变量a并且给它赋一个值，然后检验dir，我们观察到在列表中增加了以上相同的值。我们使用del语句删除当前模块中的变量/属性，这个变化再一次反映在dir的输出中。

关于del的一点注释——这个语句在运行后被用来删除一个变量/名称。在这个例子中，del a，你将无法再使用变量a——它就好像从来没有存在过一样。

Python 中的数据结构

在Python中有三种内建的数据结构——列表、元组和字典。我们将会学习如何使用它们，以及它们如何使编程变得简单。

列表

list是处理一组有序项目的数据结构，即你可以在一个列表中存储一个序列的项目。假想你有一个购物列表，上面记载着你要买的东西，你就容易理解列表了。只不过在你的购物表上，可能每样东西都独自占有一行，而在Python中，你在每个项目之间用逗号分割。

列表中的项目应该包括在方括号中，这样Python就知道你是在指明一个列表。一旦你创建了一个列表，你可以添加、删除或是搜索列表中的项目。由于你可以增加或删除项目，我们说列表是可变的数据类型，即这种类型是可以被改变的。

元组

元组和列表十分类似，只不过元组和字符串一样是不可变的即你不能修改元组。元组通过圆括号中用逗号分割的项目定义。元组通常用在使语句或用户定义的函数能够安全地采用一组值的时候，即被使用的元组的值不会改变。

字典

字典类似于你通过联系人名字查找地址和联系人详细情况的地址簿，即，我们把键（名字）和值（详细情况）联系在一起。注意，键必须是唯一的，就像如果有两个人恰巧同名的话，你无法找到正确的信息。

注意，你只能使用不可变的对象（比如字符串）来作为字典的键，但是你可以不可变或可变的对象作为字典的值。基本说来就是，你应该只使用简单的对象作为键。

键值对在字典中以这样的方式标记：d = {key1 : value1, key2 : value2 }。注意它们的键/值对用冒号分割，而各个对用逗号分割，所有这些都包括在花括号中。

记住字典中的键/值对是没有顺序的。如果你想要一个特定的顺序，那么你应该在使用前自己对它们排序。

字典是dict类的实例/对象。

序列

列表、元组和字符串都是序列，但是序列是什么，它们为什么如此特别呢？序列的两个主要特点是索引操作符和切片操作符。索引操作符让我们可以从序列中抓取一个特定项目。切片操作符让我们能够获取序列的一个切片，即一部分序列。

#!/usr/bin/python # Filename: seq.py

shoplist = ['apple', 'mango', 'carrot', 'banana']

# Indexing or 'Subscription' operation
print 'Item 0 is', shoplist[0]
print 'Item 1 is', shoplist[1]
print 'Item 2 is', shoplist[2]
print 'Item 3 is', shoplist[3]
print 'Item -1 is', shoplist[-1]
print 'Item -2 is', shoplist[-2]

# Slicing on a list
print 'Item 1 to 3 is', shoplist[1:3]
print 'Item 2 to end is', shoplist[2:]
print 'Item 1 to -1 is', shoplist[1:-1]
print 'Item start to end is', shoplist[:]

# Slicing on a string
name = 'swaroop'
print 'characters 1 to 3 is', name[1:3]
print 'characters 2 to end is', name[2:]
print 'characters 1 to -1 is', name[1:-1]
print 'characters start to end is', name[:]

（源文件：code/seq.py）

输出

$ python seq.py Item 0 is apple Item 1 is mango Item 2 is carrot Item 3 is banana Item -1 is banana Item -2 is carrot Item 1 to 3 is ['mango', 'carrot'] Item 2 to end is ['carrot', 'banana'] Item 1 to -1 is ['mango', 'carrot'] Item start to end is ['apple', 'mango', 'carrot', 'banana'] characters 1 to 3 is wa characters 2 to end is aroop characters 1 to -1 is waroo characters start to end is swaroop

它如何工作

首先，我们来学习如何使用索引来取得序列中的单个项目。这也被称作是下标操作。每当你用方括号中的一个数来指定一个序列的时候，Python会为你抓取序列中对应位置的项目。记住，Python从0开始计数。因此，shoplist[0]抓取第一个项目，shoplist[3]抓取shoplist序列中的第四个元素。

索引同样可以是负数，在那样的情况下，位置是从序列尾开始计算的。因此，shoplist[-1]表示序列的最后一个元素而shoplist[-2]抓取序列的倒数第二个项目。

切片操作符是序列名后跟一个方括号，方括号中有一对可选的数字，并用冒号分割。注意这与你使用的索引操作符十分相似。记住数是可选的，而冒号是必须的。

切片操作符中的第一个数（冒号之前）表示切片开始的位置，第二个数（冒号之后）表示切片到哪里结束。如果不指定第一个数，Python就从序列首开始。如果没有指定第二个数，则Python会停止在序列尾。注意，返回的序列从开始位置开始，刚好在结束位置之前结束。即开始位置是包含在序列切片中的，而结束位置被排斥在切片外。

这样，shoplist[1:3]返回从位置1开始，包括位置2，但是停止在位置3的一个序列切片，因此返回一个含有两个项目的切片。类似地，shoplist[:]返回整个序列的拷贝。

你可以用负数做切片。负数用在从序列尾开始计算的位置。例如，shoplist[:-1]会返回除了最后一个项目外包含所有项目的序列切片。

使用Python解释器交互地尝试不同切片指定组合，即在提示符下你能够马上看到结果。序列的神奇之处在于你可以用相同的方法访问元组、列表和字符串。

参考

当你创建一个对象并给它赋一个变量的时候，这个变量仅仅参考那个对象，而不是表示这个对象本身！也就是说，变量名指向你计算机中存储那个对象的内存。这被称作名称到对象的绑定。

一般说来，你不需要担心这个，只是在参考上有些细微的效果需要你注意。这会通过下面这个例子加以说明。

更多字符串的内容

我们已经在前面详细讨论了字符串。我们还需要知道什么呢？那么，你是否知道字符串也是对象，同样具有方法。这些方法可以完成包括检验一部分字符串和去除空格在内的各种工作。

你在程序中使用的字符串都是str类的对象。这个类的一些有用的方法会在下面这个例子中说明。如果要了解这些方法的完整列表，请参见help(str)。

字符串的方法

例9.7 字符串的方法

#!/usr/bin/python # Filename: str_methods.py

name = 'Swaroop' # This is a string object

if name.startswith('Swa'):
    print 'Yes, the string starts with "Swa"'

if 'a' in name:
    print 'Yes, it contains the string "a"'

if name.find('war') != -1:
    print 'Yes, it contains the string "war"'

delimiter = '_*_'
mylist = ['Brazil', 'Russia', 'India', 'China']
print delimiter.join(mylist)

（源文件：code/str_methods.py）

输出

$ python str_methods.py Yes, the string starts with "Swa" Yes, it contains the string "a" Yes, it contains the string "war" Brazil_*_Russia_*_India_*_China

它如何工作

这里，我们看到使用了许多字符串方法。startwith方法是用来测试字符串是否以给定字符串开始。in操作符用来检验一个给定字符串是否为另一个字符串的一部分。

find方法用来找出给定字符串在另一个字符串中的位置，或者返回-1以表示找不到子字符串。str类也有以一个作为分隔符的字符串join序列的项目的整洁的方法，它返回一个生成的大字符串。

class str(basestring)
| str(object='') -> string
|
| Return a nice string representation of the object.
| If the argument is a string, the return value is the same object.
|
| Method resolution order:
|      str
|      basestring
|      object
|
| Methods defined here:
|
| __add__(...)
|      x.__add__(y) <==> x+y
|
| __contains__(...)
|      x.__contains__(y) <==> y in x
|
| __eq__(...)
|      x.__eq__(y) <==> x==y
|
| __format__(...)
|      S.__format__(format_spec) -> string
|
|      Return a formatted version of S as described by format_spec.
|
| __ge__(...)
|      x.__ge__(y) <==> x>=y
|
| __getattribute__(...)
|      x.__getattribute__('name') <==> x.name
|
| __getitem__(...)
|      x.__getitem__(y) <==> x[y]
|
| __getnewargs__(...)
|
| __getslice__(...)
|      x.__getslice__(i, j) <==> x[i:j]
|
|      Use of negative indices is not supported.
|
| __gt__(...)
|      x.__gt__(y) <==> x>y
|
| __hash__(...)
|      x.__hash__() <==> hash(x)
|
| __le__(...)
|      x.__le__(y) <==> x<=y
|
| __len__(...)
|      x.__len__() <==> len(x)
|
| __lt__(...)
|      x.__lt__(y) <==> x<y
|
| __mod__(...)
|      x.__mod__(y) <==> x%y
|
| __mul__(...)
|      x.__mul__(n) <==> x*n
|
| __ne__(...)
|      x.__ne__(y) <==> x!=y
|
| __repr__(...)
|      x.__repr__() <==> repr(x)
|
| __rmod__(...)
|      x.__rmod__(y) <==> y%x
|
| __rmul__(...)
|      x.__rmul__(n) <==> n*x
|
| __sizeof__(...)
|      S.__sizeof__() -> size of S in memory, in bytes
|
| __str__(...)
|      x.__str__() <==> str(x)
|
| capitalize(...)
|      S.capitalize() -> string
|
|      Return a copy of the string S with only its first character
|      capitalized.
|
| center(...)
|      S.center(width[, fillchar]) -> string
|
|      Return S centered in a string of length width. Padding is
|      done using the specified fill character (default is a space)
|
| count(...)
|      S.count(sub[, start[, end]]) -> int
|
|      Return the number of non-overlapping occurrences of substring sub in
|      string S[start:end]. Optional arguments start and end are interpreted
|      as in slice notation.
|
| decode(...)
|      S.decode([encoding[,errors]]) -> object
|
|      Decodes S using the codec registered for encoding. encoding defaults
|      to the default encoding. errors may be given to set a different error
|      handling scheme. Default is 'strict' meaning that encoding errors raise
|      a UnicodeDecodeError. Other possible values are 'ignore' and 'replace'
|      as well as any other name registered with codecs.register_error that is
|      able to handle UnicodeDecodeErrors.
|
| encode(...)
|      S.encode([encoding[,errors]]) -> object
|
|      Encodes S using the codec registered for encoding. encoding defaults
|      to the default encoding. errors may be given to set a different error
|      handling scheme. Default is 'strict' meaning that encoding errors raise
|      a UnicodeEncodeError. Other possible values are 'ignore', 'replace' and
|      'xmlcharrefreplace' as well as any other name registered with
|      codecs.register_error that is able to handle UnicodeEncodeErrors.
|
| endswith(...)
|      S.endswith(suffix[, start[, end]]) -> bool
|
|      Return True if S ends with the specified suffix, False otherwise.
|      With optional start, test S beginning at that position.
|      With optional end, stop comparing S at that position.
|      suffix can also be a tuple of strings to try.
|
| expandtabs(...)
|      S.expandtabs([tabsize]) -> string
|
|      Return a copy of S where all tab characters are expanded using spaces.
|      If tabsize is not given, a tab size of 8 characters is assumed.
|
| find(...)
|      S.find(sub [,start [,end]]) -> int
|
|      Return the lowest index in S where substring sub is found,
|      such that sub is contained within S[start:end]. Optional
|      arguments start and end are interpreted as in slice notation.
|
|      Return -1 on failure.
|
| format(...)
|      S.format(*args, **kwargs) -> string
|
|      Return a formatted version of S, using substitutions from args and kwargs.
|      The substitutions are identified by braces ('{' and '}').
|
| index(...)
|      S.index(sub [,start [,end]]) -> int
|
|      Like S.find() but raise ValueError when the substring is not found.
|
| isalnum(...)
|      S.isalnum() -> bool
|
|      Return True if all characters in S are alphanumeric
|      and there is at least one character in S, False otherwise.
|
| isalpha(...)
|      S.isalpha() -> bool
|
|      Return True if all characters in S are alphabetic
|      and there is at least one character in S, False otherwise.
|
| isdigit(...)
|      S.isdigit() -> bool
|
|      Return True if all characters in S are digits
|      and there is at least one character in S, False otherwise.
|
| islower(...)
|      S.islower() -> bool
|
|      Return True if all cased characters in S are lowercase and there is
|      at least one cased character in S, False otherwise.
|
| isspace(...)
|      S.isspace() -> bool
|
|      Return True if all characters in S are whitespace
|      and there is at least one character in S, False otherwise.
|
| istitle(...)
|      S.istitle() -> bool
|
|      Return True if S is a titlecased string and there is at least one
|      character in S, i.e. uppercase characters may only follow uncased
|      characters and lowercase characters only cased ones. Return False
|      otherwise.
|
| isupper(...)
|      S.isupper() -> bool
|
|      Return True if all cased characters in S are uppercase and there is
|      at least one cased character in S, False otherwise.
|
| join(...)
|      S.join(iterable) -> string
|
|      Return a string which is the concatenation of the strings in the
|      iterable. The separator between elements is S.
|
| ljust(...)
|      S.ljust(width[, fillchar]) -> string
|
|      Return S left-justified in a string of length width. Padding is
|      done using the specified fill character (default is a space).
|
| lower(...)
|      S.lower() -> string
|
|      Return a copy of the string S converted to lowercase.
|
| lstrip(...)
|      S.lstrip([chars]) -> string or unicode
|
|      Return a copy of the string S with leading whitespace removed.
|      If chars is given and not None, remove characters in chars instead.
|      If chars is unicode, S will be converted to unicode before stripping
|
| partition(...)
|      S.partition(sep) -> (head, sep, tail)
|
|      Search for the separator sep in S, and return the part before it,
|      the separator itself, and the part after it. If the separator is not
|      found, return S and two empty strings.
|
| replace(...)
|      S.replace(old, new[, count]) -> string
|
|      Return a copy of string S with all occurrences of substring
|      old replaced by new. If the optional argument count is
|      given, only the first count occurrences are replaced.
|
| rfind(...)
|      S.rfind(sub [,start [,end]]) -> int
|
|      Return the highest index in S where substring sub is found,
|      such that sub is contained within S[start:end]. Optional
|      arguments start and end are interpreted as in slice notation.
|
|      Return -1 on failure.
|
| rindex(...)
|      S.rindex(sub [,start [,end]]) -> int
|
|      Like S.rfind() but raise ValueError when the substring is not found.
|
| rjust(...)
|      S.rjust(width[, fillchar]) -> string
|
|      Return S right-justified in a string of length width. Padding is
|      done using the specified fill character (default is a space)
|
| rpartition(...)
|      S.rpartition(sep) -> (head, sep, tail)
|
|      Search for the separator sep in S, starting at the end of S, and return
|      the part before it, the separator itself, and the part after it. If the
|      separator is not found, return two empty strings and S.
|
| rsplit(...)
|      S.rsplit([sep [,maxsplit]]) -> list of strings
|
|      Return a list of the words in the string S, using sep as the
|      delimiter string, starting at the end of the string and working
|      to the front. If maxsplit is given, at most maxsplit splits are
|      done. If sep is not specified or is None, any whitespace string
|      is a separator.
|
| rstrip(...)
|      S.rstrip([chars]) -> string or unicode
|
|      Return a copy of the string S with trailing whitespace removed.
|      If chars is given and not None, remove characters in chars instead.
|      If chars is unicode, S will be converted to unicode before stripping
|
| split(...)
|      S.split([sep [,maxsplit]]) -> list of strings
|
|      Return a list of the words in the string S, using sep as the
|      delimiter string. If maxsplit is given, at most maxsplit
|      splits are done. If sep is not specified or is None, any
|      whitespace string is a separator and empty strings are removed
|      from the result.
|
| splitlines(...)
|      S.splitlines(keepends=False) -> list of strings
|
|      Return a list of the lines in S, breaking at line boundaries.
|      Line breaks are not included in the resulting list unless keepends
|      is given and true.
|
| startswith(...)
|      S.startswith(prefix[, start[, end]]) -> bool
|
|      Return True if S starts with the specified prefix, False otherwise.
|      With optional start, test S beginning at that position.
|      With optional end, stop comparing S at that position.
|      prefix can also be a tuple of strings to try.
|
| strip(...)
|      S.strip([chars]) -> string or unicode
|
|      Return a copy of the string S with leading and trailing
|      whitespace removed.
|      If chars is given and not None, remove characters in chars instead.
|      If chars is unicode, S will be converted to unicode before stripping
|
| swapcase(...)
|      S.swapcase() -> string
|
|      Return a copy of the string S with uppercase characters
|      converted to lowercase and vice versa.
|
| title(...)
|      S.title() -> string
|
|      Return a titlecased version of S, i.e. words start with uppercase
|      characters, all remaining cased characters have lowercase.
|
| translate(...)
|      S.translate(table [,deletechars]) -> string
|
|      Return a copy of the string S, where all characters occurring
|      in the optional argument deletechars are removed, and the
|      remaining characters have been mapped through the given
|      translation table, which must be a string of length 256 or None.
|      If the table argument is None, no translation is applied and
|      the operation simply removes the characters in deletechars.
|
| upper(...)
|      S.upper() -> string
|
|      Return a copy of the string S converted to uppercase.
|
| zfill(...)
|      S.zfill(width) -> string
|
|      Pad a numeric string S with zeros on the left, to fill a field
|      of the specified width. The string S is never truncated.
|
| ----------------------------------------------------------------------
| Data and other attributes defined here:
|
| __new__ = <built-in method __new__ of type object>
|      T.__new__(S, ...) -> a new object with type S, a subtype of T