//这个例子.给我的感觉还是有一点不好..希望同行看到.能给一些更高,,更有效率的搜索引擎
//以下是我自己写的.
package com.newer.seacha;
import java.io.BufferedReader;
import java.io.InputStream;
import java.io.InputStreamReader;
import java.net.URL;
import java.net.URLConnection;
import java.util.regex.*;
public class Test {
URL url = new URL(http://www.iteye.com);
URLConnection urlConnection = url.openConnection();
InputStream inn = urlConnection.getInputStream();
BufferedReader in = new BufferedReader(
new InputStreamReader(inn));
StringBuffer str = new StringBuffer();
//给一个暖冲区2048
char[] arrayBytes = new char[2048];
int i = 0;
while((i = in.read(arrayBytes,0,2048)) != -1) {
String str1 = new String(arrayBytes,0,i);
str.append(str1);
}
// 正则表达式载取
Pattern pattern = Pattern.compile("<\\s*a\\s+href\\s*=\".*\"\\s*>.*<\\s*/\\s*a\\s*>");
Matcher m = pattern.matcher(str);
int i1 = 0;
while(m.find()) {
//输出控制台
System.out.println(" " + (i1++) + " " + m.group());
}
System.out.println(i1);
}
}