bs4
文章平均质量分 77
Arthur54271
人生苦短,我用Python
展开
-
python3-bs4~Beautifulsoup
from urllib import requestfrom bs4 import BeautifulSoup#(一)获取网页内容base_url = "http://langlang2017.com/route.html"response = request.urlopen(base_url)html = response.read()#二进制内容,属于非格式化内容# html =...原创 2018-05-11 14:15:03 · 754 阅读 · 0 评论 -
Python3~爬取某翻译网页的单词与解释
from urllib import requestfrom bs4 import BeautifulSoupimport sslssl._create_default_https_context=ssl._create_unverified_context#一、网络请求页面base_url = "https://www.shanbay.com/wordlist/110521/232...原创 2018-05-11 16:53:50 · 1287 阅读 · 0 评论 -
Python3-selenium\phantomjs\bs4爬取斗鱼页面
from selenium import webdriverimport timefrom bs4 import BeautifulSoupclass douyuSelenium(): #初始化,启动斗鱼浏览器 def setup(self): self.driver=webdriver.PhantomJS() #获取斗鱼房间信息 def ...原创 2018-05-19 12:59:23 · 277 阅读 · 0 评论