java解析html中的img标签，并且取得所有图片地址

最新推荐文章于 2022-03-24 17:01:05 发布

iteye_5109

最新推荐文章于 2022-03-24 17:01:05 发布

阅读量1.4k

点赞数 1

分类专栏： JAVA 文章标签： JAVA IMG 正则表达式

本文链接：https://blog.csdn.net/iteye_5109/article/details/82611279

版权

JAVA 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

	private String[] getImgs(String content) {
		String img = "";
		Pattern p_image;
		Matcher m_image;
		String str = "";
		String[] images = null;
		String regEx_img = "(<img.*src\\s*=\\s*(.*?)[^>]*?>)";
		p_image = Pattern.compile(regEx_img, Pattern.CASE_INSENSITIVE);
		m_image = p_image.matcher(content);
		while (m_image.find()) {
			img = m_image.group();
			Matcher m = Pattern.compile("src\\s*=\\s*\"?(.*?)(\"|>|\\s+)").matcher(img);
			while (m.find()) {
				String tempSelected = m.group(1);

				if ("".equals(str)) {
					str = tempSelected;
				} else {
					String temp = tempSelected;
					str = str + "," + temp;
				}
			}
		}
		if (!"".equals(str)) {
			images = str.split(",");
		}
		return images;
	}