辉火_-CSDN博客

原创 python学习笔记（五）---替换函数

python学习笔记（五）—替换函数replace()函数替换内容用法：replace(‘需要替换的内容’,‘替换后的内容’,替换次数)备：如果需要替换单引号需要加’\’例子：str = "aaaaaa";print str.replace("a", "b");print str.replace("a", "b", 3);输出结果：bbbbbbbbbaaastrip()函数去除头尾指定内容 strip()去除头尾空格例子：str = ' a 'str = st

2022-01-17 20:39:37 1357

原创 python学习笔记（四）---python获取json内容

python学习笔记（四）—python获取json内容使用json模块import jsonjson.load()用于读取文件json.loads()用于读取字符串content = urllib.request.urlopen(url).read() result = json.loads(content)#读取网页json内容获取key为data的值data = result['data']或者使用values()获取值values = result.values()

2022-01-17 20:20:26 439

原创 python学习笔记（三）---python爬取网页指定内容

python学习笔记（三）—python爬取网页指定内容1、利用正则匹配爬取指定内容，例如标题正则表达式：<title>(.*?)</title>req = urllib.request.Request(url=url,headers=headers)content = urllib.request.urlopen(req).read() content = content.decode('utf-8') title = re.findall(r'<t

2022-01-17 20:10:33 6574

原创 python学习笔记（二）---python爬取网页源代码

python学习笔记（二）—python爬取网页源代码使用模块urllib#coding:utf-8import urllib.request请求url，获取网页源代码def getHtml(url): h = urllib.request.urlopen(url).read() return h保存文档def saveHtml(file_name,file_content): with open (file_name,"wb") as f:

2022-01-17 19:06:01 2106

原创 python学习笔记（一）---python爬虫起步

python学习笔记（一）—python爬虫起步python爬虫起步urllib模块import urllib.request 获取urlcontent = urllib.request.urlopen(url).read()#获取网页content = content.decode('utf-8')print(content)#设置用户代理（爬取一些需要登陆的网站时）headers = { 'Accept-Language':'zh-Hans-CN, zh-Hans; q=0

2022-01-17 18:57:09 342

原创信息收集

信息收集1.whois 信息什么是whois?whois 指的是域名注册时留下的信息，比如留下管理员的名字、电话号码、邮箱为什么要收集whois?域名注册人可能就是网站管理员，可以尝试社工、套路，查询是不是注册了其他域名扩大攻击范围在线查询whois链接 http://whois.chinaz.com/2.子域名什么是子域名?顶级域名下的二级域名或者三级甚至更多级的域名都属于子域...

2019-10-15 15:21:53 156

qq_41838340的博客

原创 python学习笔记（五）---替换函数

原创 python学习笔记（四）---python获取json内容

原创 python学习笔记（三）---python爬取网页指定内容

原创 python学习笔记（二）---python爬取网页源代码

原创 python学习笔记（一）---python爬虫起步

原创信息收集

空空如也

空空如也

原创 python学习笔记（五）---替换函数

原创 python学习笔记（四）---python获取json内容

原创 python学习笔记（三）---python爬取网页指定内容

原创 python学习笔记（二）---python爬取网页源代码

原创 python学习笔记（一）---python爬虫起步

原创 信息收集

空空如也

空空如也

原创信息收集