这是来自request.get(url).text的响应
Jumia Central AuthenticationThis is your fallback content in case JavaScript fails to load.我想解析此变量(window.csrfToken)
解决方案
soup = BeautifulSoup(html, 'html.parser')
script = soup.findAll("script")[2].text
print(script.split("'")[1])
输出:
g9dlVS6D-8KFzc_wWQrlJIXe4NfSKPMzKEQc
要么
script = [item.text for item in soup.findAll(
"script") if "window.csrfToken" in item.text]
print(script[0].split("'")[1])
输出:
g9dlVS6D-8KFzc_wWQrlJIXe4NfSKPMzKEQc
或使用re,假设是,HTML则string可以使用以下命令将其转换为字符串str(html):
import re
target = re.search(r"window.csrfToken = '(.+)'", html).group(1)
print(target)
输出:
g9dlVS6D-8KFzc_wWQrlJIXe4NfSKPMzKEQc