网页中抓取数据

下面写个例子,实现从网页中抓取数据。

这个例子中,只是从网页中获取了数据,但是没有进行任何处理,只是将数据保存到一个txt文件中。

该例子是在android工程中写的。

package com.example.creepertest;


import java.io.BufferedReader;
import java.io.BufferedWriter;
import java.io.File;
import java.io.FileOutputStream;
import java.io.IOException;
import java.io.InputStreamReader;
import java.io.OutputStream;
import java.io.OutputStreamWriter;
import java.net.HttpURLConnection;
import java.net.MalformedURLException;
import java.net.URL;


public class Controller
{
public static final String SD_LOTTERY_URL = 
"http://www.sdticai.com/zjhfx/cpmain.asp?cptype=115";

public static final String SD_LOTTERY_FILE_PATH = "/data/data/com.example.creepertest/test.txt";

private BufferedWriter mBufferedWriter = null;
private BufferedReader mBufferedReader = null;

public Controller(){
Runnable runnable = new Runnable(){
@Override
public void run() {
captureHtml();
}
};
Thread thread = new Thread(runnable);
thread.start();

}

private void captureHtml(){
try{
URL sdLotterUrl = new URL(SD_LOTTERY_URL);
HttpURLConnection httpConn = 
(HttpURLConnection) sdLotterUrl.openConnection();


InputStreamReader inputStreamReader = 
new InputStreamReader(httpConn.getInputStream(), "utf-8");
mBufferedReader = new BufferedReader(inputStreamReader);  
 
OutputStream outputStream = 
new FileOutputStream(SD_LOTTERY_FILE_PATH,true);
OutputStreamWriter outputStreamWriter = new OutputStreamWriter(outputStream);
mBufferedWriter = new BufferedWriter(outputStreamWriter); 
 
String lineStr = null;
while(true){
lineStr = mBufferedReader.readLine();
if(lineStr != null){
mBufferedWriter.write(lineStr);
mBufferedWriter.newLine();
mBufferedWriter.flush();
}
else
break;
}
 
}
catch (MalformedURLException e){
e.printStackTrace();
}
catch (IOException e){
e.printStackTrace();
}

finally{
try {
if(mBufferedWriter != null)
mBufferedWriter.close();

if(mBufferedReader != null)
mBufferedReader.close();
} catch (Exception exception){
exception.printStackTrace();
}
   }
}
}

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值