Python3 爬虫--- 问卷星内容爬取

最新推荐文章于 2022-09-07 21:20:06 发布

wozaiyizhideng

最新推荐文章于 2022-09-07 21:20:06 发布

阅读量6.1k

点赞数 3

分类专栏： python3 爬虫

本文链接：https://blog.csdn.net/wozaiyizhideng/article/details/106485259

版权

本文介绍如何使用Python3爬取问卷星上的内容，以解决无法直接复制笔试题目的问题。首先通过pip3安装requests_html库，然后提供目标链接：https://ks.wjx.top/jq/55123312.aspx，演示爬取过程。

摘要由CSDN通过智能技术生成

今天面试有个问卷星的笔试题，但是无法复制题目内容。

所以爬取一下。

pip3 install requests_html

目标链接：https://ks.wjx.top/jq/55123312.aspx

import time
from requests_html import HTMLSession

wenjuanxing_ID = 55123312
wenjuanxing_URL = "https://ks.wjx.top/jq/{}.aspx".format(wenjuanxing_ID)


def parse_post_data(resp):
    '''
    解析出问题和选项
    '''
    questions = resp.html.find('fieldset', first=True).find('.div_question')

    for i, q in enumerate(questions):
        title = q.find('.div_title_question_all', first=True).text
        choices = [t.text for t in q.find('label')]
        print(title)
        for choice in choices:
            print(choice)
        print(

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

wozaiyizhideng

关注关注

3
点赞
踩
33

收藏

觉得还不错? 一键收藏
0
评论
Python3 爬虫--- 问卷星内容爬取

今天面试有个问卷星的笔试题，但是无法复制题目内容。所以爬取一下。import timefrom requests_html import HTMLSessionwenjuanxing_ID = idwenjuanxing_URL = "https://ks.wjx.top/jq/{}.aspx".format(wenjuanxing_ID)def parse_post_data(resp): ''' 解析出问题和选项 ''' questions .
复制链接

扫一扫