![](https://img-blog.csdnimg.cn/20201014180756724.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
python爬虫
Jacklucicc
这个作者很懒,什么都没留下…
展开
-
python爬虫2_xpath
lxml xpath from lxml import etree from lxml import etree text = """ <!--hello.html--> <div> <ul> <li class="item-0"><a href="link1.html">first item</a></li> <li class="item-1"><a hre原创 2021-03-02 11:46:36 · 73 阅读 · 0 评论 -
python爬虫1
bf4.BeautifulSoup 实践 #test import requests from bs4 import BeautifulSoup import bs4 import numpy as np def getHtmlText(url): headers = { "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/88原创 2021-03-01 14:46:17 · 123 阅读 · 2 评论