Python urlparse总结

res = urlparse.urlparse(url,scheme,allow_fragments)
返回一个6-tuple,类型是ParseResult(scheme, netloc, path, params, query, fragment)
ParseResult类还有几个常用方法:
res.username
res.password
res.hostname
res.port
res.geturl()


urlparse.urlunparse(data)
返回一个string
data必须是six-item iterable


res = urlparse.urlsplit(url,scheme,allow_fragments)
返回一个5-tuple,类型是.SplitResult(scheme, netloc, path, query, fragment)
这里的path相当于urlparse的path+params,具体见例子


urlparse.urlunsplit(data)
返回一个string
data必须是five-item iterable


urlparse.urljoin(base, url, allow_fragments)

这个函数比较复杂,不同的数据得出的结果大不一样,而且容易出现错误,不建议用这个函数,详见下面几个例子

'''
Created on 2013-4-15

@author: xkey
'''
import urlparse

url = "https://www.google.com.hk:8080/home/search;12432?newwi.1.9.serpuc#1234"

r = urlparse.urlparse(url)
print r
print r.port,r.hostname
print r.geturl()
r = urlparse.urlsplit(url)
print r
parts = ["http","www.facebook.com","/home/email","132","parts","md5=?"]
print urlparse.urlunparse(parts)

print urlparse.urlunsplit(parts[0:5])
base = "http://baidu.com/home"
url = "index.html"
print urlparse.urljoin(base, url)
base = "http://baidu.com/home/action.jsp"
url = "index.html"
print urlparse.urljoin(base, url)
base = "http://baidu.com/home/action.jsp"
url = "/index.html"
print urlparse.urljoin(base, url)
base = "http://baidu.com/home/action.jsp"
url = "../../index.html"
print urlparse.urljoin(base, url)

输出结果:

ParseResult(scheme='https', netloc='www.google.com.hk:8080', path='/home/search', params='12432', query='newwi.1.9.serpuc', fragment='1234')
8080 www.google.com.hk
https://www.google.com.hk:8080/home/search;12432?newwi.1.9.serpuc#1234
SplitResult(scheme='https', netloc='www.google.com.hk:8080', path='/home/search;12432', query='newwi.1.9.serpuc', fragment='1234')
http://www.facebook.com/home/email;132?parts#md5=?
http://www.facebook.com/home/email?132#parts
http://baidu.com/index.html
http://baidu.com/home/index.html
http://baidu.com/index.html
http://baidu.com/../index.html

参照官方文档2.7.4: http://docs.python.org/2/library/urlparse.html

  • 13
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值