Java在做网络爬虫时，判断是否含有汉字

最新推荐文章于 2024-03-28 17:07:10 发布

凌晨之星

最新推荐文章于 2024-03-28 17:07:10 发布

阅读量101

点赞数

本文链接：https://blog.csdn.net/qq_43910202/article/details/105035029

版权

构造如下方法（可以直接拿我的代码）：

    /**
     * 判断是否有汉字
     * @param str
     * @return
     * @author：严天贺
     */
    public static boolean extractChinese(String str){
        String results = "";
        Pattern pattern = Pattern.compile("[\u4e00-\u9fa5]");
        Matcher m = pattern.matcher(str);
        if(m.find()){
            return true;
        }

        return false;
    }

在其他java类里可以直接调用这个方法

/**
     * aothor:严天贺
     */
    public static String rules1(String html) {
        String result = "";
        Document document = Jsoup.parse(html);
        Elements elements = document.select("#detail > div.main > div > div.vF_deail_maincontent > div > div.table > table > tbody > tr:nth-child(17) > td:nth-child(2)");
        for (Element element : elements) {
            result = element.text();

        }
        if (  SNRules.extractChinese(result)){
            result = "";
        }

        return result;
    }

小结：当我们遇到问题时，往往人脑会自动默认执行一些方法，但是程序还是不会的，它所使用的方法，都必须由我们提前写好，并且能够正确调用，才能达到我们预期的结果。

凌晨之星

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Java在做网络爬虫时，判断是否含有汉字

构造如下方法（可以直接拿我的代码）： /** * 判断是否有汉字 * @param str * @return * @author：严天贺 */ public static boolean extractChinese(String str){ String results = ""; Patt...
复制链接

扫一扫