1、获取内容中的第一张img
//匹配img正则表达式
private static Pattern p_img = Pattern.compile("(<img[^>]+src\\s*=\\s*['\"]([^'\"]+)['\"][^>]*>)");
Matcher m_img = p_img.matcher(“内容”);
while (m_img.find()) {
//为获得整个img标签
//m_img.group(1)
//为获得src的值
String img = m_img.group(2);
}
2、获取内容中所有的img
//传入需要查找img的内容 将取得的src放入集合中
public List<String> getImgSrc(String htmlStr) {
String img = "";
Pattern p_image;
Matcher m_image;
Matcher m;
List<String> pics = new ArrayList<String>();
String regEx_img = "<img.*src\\s*=\\s*(.*?)[^>]*?>";
p_image = Pattern.compile(regEx_img, Pattern.CASE_INSENSITIVE);
m_image = p_image.matcher(htmlStr);
while (m_image.find()) {
img = img + "," + m_image.group();
m = Pattern.compile("src\\s*=\\s*\"?(.*?)(\"|>|\\s+)").matcher(img);
while (m.find()) {
pics.add(m.group(2));
}
}
return pics;
}
px:项目中用到了,怕自己忘记,所以在此记录一下,如果也能对大家有帮助的话,那最好不过了