urlparse和urlsplit
urlparse与urlsplit一般用于分析网页url的结构,从而快速提取网页中的各个参数,如协议、域名、路径、查询字段等。
区别
urlparse与urlsplit的区别是,urlsplit不匹配params属性
urlparse
from urllib import parse
url = 'http://www.baidu.com/s=wd=python&username=abc#1'
result = parse.urlparse(url)
print(result)
#分别打印出对应的值
print('scheme:',result.scheme)
print('netloc:',result.netloc)
print('path:',result.path)
print('params:',result.params)
print('query:',result.query)
print('fragment:',result.fragment)
result:
ParseResult(scheme=‘http’, netloc=‘www.baidu.com’, path=‘/s=wd=python&username=abc’, params=‘’, query=‘’, fragment=‘1’)
scheme: http
netloc: www.baidu.com
path: /s=wd=python&username=abc
params:
query:
fragment: 1
urlsplit
from urllib import parse
url = 'http://www.baidu.com/s=wd=python&username=abc#1'
result = parse.urlsplit(url)
print(result)
#分别打印出对应的值
print('scheme:',result.scheme)
print('netloc:',result.netloc)
print('path:',result.path)
#没有params属性
#print('params:',result.params)
print('query:',result.query)
print('fragment:',result.fragment)
result:
SplitResult(scheme=‘http’, netloc=‘www.baidu.com’, path=‘/s=wd=python&username=abc’, query=‘’, fragment=‘1’)
scheme: http
netloc: www.baidu.com
path: /s=wd=python&username=abc
query:
fragment: 1
快捷键:
运行Python文件 CTRL F5