Python爬取网页表格数据并写入excel中No.7

小雨喳

已于 2022-09-02 09:30:58 修改

阅读量1.3w

点赞数 7

分类专栏： Python学习篇文章标签： python 开发语言

于 2019-02-21 12:36:21 首次发布

本文链接：https://blog.csdn.net/m0_38004619/article/details/87858738

版权

Python学习篇专栏收录该内容

9 篇文章 1 订阅

订阅专栏

Python爬取网页表格数据并写入Excel

import requests
from bs4 import BeautifulSoup
import xlwt
#请求headers 模拟谷歌浏览器访问
headers = {
    'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.100 Safari/537.36'
}
def get_data():
    response = requests.get('http://www.hs-bianma.com/hs_chapter_01.htm', headers=headers)
    bs = BeautifulSoup(response.text,'lxml')

    # 标题处理
    title = bs.find_all('th')
    data_list_title = [] #定义一个空列表
    for data in title:
        data_list_title.append(data.text.strip()) #获取标签的内容去掉两边空格并添加到列表里

    # 内容处理
    content = bs.find_all('td')
    data_list_content = [] #定义一个空列表
    for data in content:
        data_list_content.append(data.text.strip()) #获取标签的内容去掉两边空格并添加到列表里
    #语句featList = [example[i] for example in dataSet]作用为： 将dataSet中的数据按行依次放入example中，然后取得example中的example[i]元素，放入列表featList中
    new_list = [data_list_content[i:i + 16] for i in range(0, len(data_list_content), 16)]

    # 存入excel表格
    book = xlwt.Workbook()
    sheet1 = book.add_sheet('sheet1', cell_overwrite_ok=True)

    # 标题存入
    heads = data_list_title[:] #将data_list_title第一位到最后一位赋值给heads
    ii = 0
    for head in heads:
        sheet1.write(0, ii, head)
        ii += 1

    # 内容录入
    i = 1
    for list in new_list:
        j = 0
        for data in list:
            sheet1.write(i, j, data)
            j += 1
        i += 1
    # 文件保存
    book.save('./data.xls')
print("全部完成")

#调用
get_data()

有问题请关注公众号【运维开发实战】小编会及时回复

小雨喳

关注

7
点赞
踩
80

收藏

觉得还不错? 一键收藏
9
评论
Python爬取网页表格数据并写入excel中No.7

Python爬取网页表格数据并写入Excelimport requestsfrom bs4 import BeautifulSoupimport xlwt#请求headers 模拟谷歌浏览器访问headers = { 'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) Apple......
复制链接

扫一扫