关于python爬虫中的细节问题
当我学习python爬虫用到BeautifulSoup的时候我自己注意到的一个小问题
`html = "<p><span class='bjh-p'><span class='bjh-strong'>"
"this is a good man</span></span></p>"
soup = BeautifulSoup(html, "lxml")
print(soup.p.prettify())
print(soup.p.span.string)
我注意到如这样的话我是打印soup.p.span.string这个的话是None但是当我把代码改这样后就可以打印其中的字符串啦
html = "<p><span class='bjh-p'><span class='bjh-strong'>" \
"this is a good man</span></span></p>"
soup = BeautifulSoup(html, "lxml")
print(soup.p.prettify())
print(soup.p.span.string)`
或者这样
html = "<p><span class='bjh-p'><span class='bjh-strong'this is a good" "man</span></span></p>"
soup = BeautifulSoup(html, "lxml")
print(soup.p.prettify())
print(soup.p.span.string)
后面的两种都可以打印出其中的字符串。