下面是一个简单的示例代码,用于演示如何使用Python编写地图商家电话采集软件:
import requests
from bs4 import BeautifulSoup
def get_business_info():
url = 'https://www.example.com/business'
headers = {'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.88 Safari/537.36'}
response = requests.get(url, headers=headers)
soup = BeautifulSoup(response.text, 'html.parser')
business_list = soup.find_all('div', class_='business-info')
results = []
for business in business_list:
name = business.find('h2').text.strip()
phone = business.find('span', class_='phone').text.strip()
address = business.find('span', class_='address').text.strip()
results.append({'name': name, 'phone': phone, 'address': address})
return results
if __name__ == '__main__':
business_info = get_business_info()
for info in business_info:
print(info)
这个示例使用了Python的requests库和BeautifulSoup库来发送HTTP请求和解析HTML页面。首先,你需要设置要爬取的网页URL和请求头部信息。然后使用requests库发送HTTP GET请求并获取响应内容。接下来,使用BeautifulSoup库解析HTML页面,并定位到包含商家信息的特定元素。最后,提取商家的名称、电话和地址,并存储在一个列表中。你可以根据自己的需求对代码进行修改和扩展。