之前学习swing时写个一个简单的汇率转换器,之前又学习了python,于是想用python写个爬虫,把数据给java展示出来,媒介采用jython,将python解析的结果保存到txt文档中,java读取文档 ,如果不用jython就得手动先运行一遍python,就没意思了,等于只用运行java程序,每一次运行时通过jython自动调用java程序.
python程序:(爬取数据,正则匹配,结果保存到e:1.txt)
import urllib
import urllib2
import re
import sys
reload(sys)
sys.setdefaultencoding( "utf-8" )
url="http://www.boc.cn/sourcedb/whpj/";
spath="E://1.txt"
f=open(spath,"w")
user_agent = "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/46.0.2490.80 Safari/537.36"
headers = { 'User-Agent' : user_agent}
request=urllib2.Request(url,None,headers);
response =urllib2.urlopen(request)
content=response.read().decode("utf-8")
#f.write(content)
result = re.findall('(?<=<td>).+?(?=</td>)',content)
f.write(result[178]