python爬取电影

最新推荐文章于 2025-05-27 20:07:06 发布

孺子牛 for world

最新推荐文章于 2025-05-27 20:07:06 发布

阅读量1.9k

点赞数 4

文章标签： python 开发语言

本文链接：https://blog.csdn.net/weixin_45570158/article/details/138148082

版权

这是一个简单的Python代码示例，使用requests和BeautifulSoup库来爬取电影信息。这个示例将从一个电影网站（比如IMDb）上抓取电影的标题。请注意，这个代码只是一个示例，并且网站的结构可能会发生变化，导致代码不再有效。此外，频繁的请求可能会对网站造成负担，甚至可能违反其服务条款。在实际使用中，请确保遵守网站的robots.txt文件和使用条款。

import requests  
from bs4 import BeautifulSoup  
  
def fetch_movie_info(url):  
    # 发送GET请求  
    response = requests.get(url)  
  
    # 检查请求是否成功  
    if response.status_code != 200:  
        print(f"Failed to retrieve the webpage. Status code: {response.status_code}")  
        return None  
  
    # 使用BeautifulSoup解析HTML  
    soup = BeautifulSoup(response.text, 'html.parser')  
  
    # 查找电影标题。这取决于网站的具体结构。这里只是一个示例。  
    movie_titles = soup.find_all('h2', class_='title')  # 假设电影标题在class为'title'的h2标签中  
  
    # 存储电影标题  
    movies = []  
    for title in movie_titles:  
        movies.append(title.text)