天猫,淘宝,京东收货信息中,自动识别手机号、姓名、省市区

最近工作需要,需要从一串字符串中识别出姓名,手机号和省市区,例如下面的一些字符串:

橱之友,包卫贞,13600000000,浙江省宁波市,慈溪市,庵东镇 杭州湾新区世纪城翠湖苑5栋2000

【识别出】姓名为“包卫贞”,手机号为“13600000000”,省市区为“浙江省 宁波市 慈溪市”


marui谢sir,谢义海,13900000000,江苏省南京市,秦淮区,健康路1号水游城5楼1号

【识别出】姓名为“谢义海”,手机号为“13900000000”,省市区为“江苏省 南京市 秦淮区”


刘伟,13120000000,北京 北京市 朝阳区 东湖街道 利泽中园二区208号,000000

【识别出】姓名为“刘伟”,手机号为“13120000000”,省市区为“北京 北京市 朝阳区”

在网上找了很多资料,都没有解决,下面是我的详细解决思路和Java代码。

【解决思路】

1.最先识别出11位手机号,这个最简单

2.其次识别出姓名,姓名难度有点大,也没找到好的方法,我是从姓氏下手识别的。

3.最后识别出省市区,这个难度也很大,试了很多方法,最好找到一种方法,效果不错。

如果需要源码,可以去下载

 

【java代码】

1.识别手机号

        private static String getMobile(String addressInfo) {
            //*识别电话号码*/
            Pattern pattern = Pattern.compile("([1][3-9][\\d]{9})|(0\\d{2,4}-\\d{7,8})");
            Matcher matcher = pattern.matcher(addressInfo);
            StringBuffer bf = new StringBuffer(64);
            while (matcher.find()) {
                bf.append(matcher.group());
            }
            return bf.toString();
        }

2.识别姓名

        private static String getName(String addressInfo,String splitStr) {
            //*识别姓名*/
            String[] firstNames = {"赵","钱","孙","李","周","吴","郑","王","冯","陈","褚","卫","蒋","沈","韩","杨","朱","秦","尤","许","何","吕","施","张","孔","曹","严","华","金","魏","陶","姜","戚","谢","邹","喻","柏","水","窦","章","云","苏","潘","葛","奚","范","彭","郎","鲁","韦","昌","马","苗","凤","花","方","俞","任","袁","柳","酆","鲍","史","唐","费","廉","岑","薛","雷","贺","倪","汤","滕","殷","罗","毕","郝","邬","安","常","乐","于","时","傅","皮","卞","齐","康","伍","余","元","卜","顾","孟","平","黄","和","穆","萧","尹","姚","邵","湛","汪","祁","毛","禹","狄","米","贝","明","臧","计","伏","成","戴","谈","宋","茅","庞","熊","纪","舒","屈","项","祝","董","粱","杜","阮","蓝","闵","席","季","麻","强","贾","路","娄","危","江","童","颜","郭","梅","盛","林","刁","钟","徐","邱","骆","高","夏","蔡","田","樊","胡","凌","霍","虞","万","支","柯","昝","管","卢","莫","经","房","裘","缪","干","解","应","宗","丁","宣","贲","邓","郁","单","杭","洪","包","诸","左","石","崔","吉","钮","龚","程","嵇","邢","滑","裴","陆","荣","翁","荀","羊","於","惠","甄","麴","家","封","芮","羿","储","靳","汲","邴","糜","松","井","段","富","巫","乌","焦","巴","弓","牧","隗","山","谷","车","侯","宓","蓬","全","郗","班","仰","秋","仲","伊","宫","宁","仇","栾","暴","甘","钭","厉","戎","祖","武","符","刘","景","詹","束","龙","叶","幸","司","韶","郜","黎","蓟","薄","印","宿","白","怀","蒲","邰","从","鄂","索","咸","籍","赖","卓","蔺","屠","蒙","池","乔","阴","欎","胥","能","苍","双","闻","莘","党","翟","谭","贡","劳","逄","姬","申","扶","堵","冉","宰","郦","雍","舄","璩","桑","桂","濮","牛","寿","通","边","扈","燕","冀","郏","浦","尚","农","温","别","庄","晏","柴","瞿","阎","充","慕","连","茹","习","宦","艾","鱼","容","向","古","易","慎","戈","廖","庾","终","暨","居","衡","步","都","耿","满","弘","匡","国","文","寇","广","禄","阙","东","殴","殳","沃","利","蔚","越","夔","隆","师","巩","厍","聂","晁","勾","敖","融","冷","訾","辛","阚","那","简","饶","空","曾","毋","沙","乜","养","鞠","须","丰","巢","关","蒯","相","查","後","荆","红","游","竺","权","逯","盖","益","桓","公","万俟","司马","上官","欧阳","夏侯","诸葛","闻人","东方","赫连","皇甫","尉迟","公羊","澹台","公冶","宗政","濮阳","淳于","单于","太叔","申屠","公孙","仲孙","轩辕","令狐","钟离","宇文","长孙","慕容","鲜于","闾丘","司徒","司空","亓官","司寇","仉","督","子车","颛孙","端木","巫马","公西","漆雕","乐正","壤驷","公良","拓跋","夹谷","宰父","谷梁","晋","楚","闫","法","汝","鄢","涂","钦","段干","百里","东郭","南门","呼延","归","海","羊舌","微生","岳","帅","缑","亢","况","后","有","琴","梁丘","左丘","东门","西门","商","牟","佘","佴","伯","赏","南宫","墨","哈","谯","笪","年","爱","阳","佟"};
            String[] names=addressInfo.split(splitStr);
            String name="";
            for(int i=0;i<names.length;i++){
                String nametmp=names[i];
                //System.out.println(names.length+"//"+nametmp);
                if(nametmp==null||nametmp.length()<=1||nametmp.length()>4){
                    continue;
                }
                for(int j=0;j<firstNames.length;j++){
                    if(nametmp.startsWith(firstNames[j])){
                        name=nametmp;
                        break;
                    }
                }
                if(!"".equals(name)){
                    break;
                }
            }
            return name;
        }

3.识别省市区

    public static Map<String,String> getAddress(String address) throws MalformedURLException{
        //根据市和详细地址获取地址经纬度
        String LatitudeAndLongitude = getGeocoderLatitude(address, "");
        JsonNode jsonNode = getposition(LatitudeAndLongitude);
        String province = jsonNode.findValue("province").toString().replace("\"","");
        String city = jsonNode.findValue("city").toString().replace("\"","");
        String area = jsonNode.findValue("district").toString().replace("\"","");
        String street=jsonNode.findValue("street").toString().replace("\"","");
        Map<String,String> addMap = new HashMap<String,String>();
        addMap.put("province", province);
        addMap.put("city", city);
        addMap.put("area", area);
        addMap.put("street", street);
        return addMap;        
    }

    /**
     * 根据经纬度查询地址信息
     * @param
     * @throws MalformedURLException
     */
    public static JsonNode getposition(String LatitudeAndLongitude)
            throws MalformedURLException {
        BufferedReader in = null;
        JsonNode locationNode = null;
        /*URL tirc = new URL("http://api.map.baidu.com/geocoder?location="
                + LatitudeAndLongitude + "&output=json&key="
                + "E4805d16520de693a3fe707cdc962045");*/
        URL tirc = new URL("http://api.map.baidu.com/geocoder/v2/?location="
                + LatitudeAndLongitude + "&output=json&ak="
                + ak);
        try {
            in = new BufferedReader(new InputStreamReader(tirc.openStream(),
                    "UTF-8"));
            String res;
            StringBuilder sb = new StringBuilder("");
            while ((res = in.readLine()) != null) {
                sb.append(res.trim());
            }
            String str = sb.toString();
            ObjectMapper mapper = new ObjectMapper();
            if (StringUtils.isNotEmpty(str)) {
                JsonNode jsonNode = mapper.readTree(str);
                jsonNode.findValue("status").toString();
                JsonNode resultNode = jsonNode.findValue("result");
                /*JsonNode locationNode = resultNode
                        .findValue("formatted_address");*/
                locationNode = resultNode
                        .findValue("addressComponent");
                /*JsonNode area = locationNode.findValue("district");
                System.out.println(area);*/
            }

        } catch (Exception e) {

        }
        return locationNode;

    }
    

 

Java完整项目 源码下载

Python完整项目 源码下载

  • 3
    点赞
  • 18
    收藏
    觉得还不错? 一键收藏
  • 打赏
    打赏
  • 9
    评论
可以使用 Apache POI 库来实现 Java 解析 Excel 文件的功能,并使用正则表达式来匹配手机号姓名省市区信息。 首先,你需要导入 Apache POI 库,然后使用其提供的 API 来读取 Excel 文件的数据。 然后,你可以使用正则表达式来匹配手机号省市区信息,例如: 手机号:使用正则表达式 `\d{11}` 来匹配11位数字,即手机号省市区:你可以使用第三方库或者公开数据源来获取省市区信息,然后使用正则表达式来匹配相应的信息。 最后,将匹配到的信息保存到一个数据结构,例如一个 List 或者一个 Map。 以下是一个简单的示例代码,可以帮助你入门: ```java import java.io.FileInputStream; import java.util.ArrayList; import java.util.List; import java.util.regex.Matcher; import java.util.regex.Pattern; import org.apache.poi.ss.usermodel.Cell; import org.apache.poi.ss.usermodel.Row; import org.apache.poi.ss.usermodel.Sheet; import org.apache.poi.ss.usermodel.Workbook; import org.apache.poi.ss.usermodel.WorkbookFactory; public class ExcelParser { public static void main(String[] args) { try { // 读取 Excel 文件 Workbook workbook = WorkbookFactory.create(new FileInputStream("data.xlsx")); Sheet sheet = workbook.getSheetAt(0); // 定义正则表达式 Pattern phonePattern = Pattern.compile("\\d{11}"); Pattern areaPattern = Pattern.compile("上海市浦东新区"); // 定义数据结构 List<String> names = new ArrayList<>(); List<String> phones = new ArrayList<>(); List<String> areas = new ArrayList<>(); // 遍历所有行 for (Row row : sheet) { // 获取单元格数据 Cell nameCell = row.getCell(0); Cell phoneCell = row.getCell(1); Cell areaCell = row.getCell(2); // 匹配手机号 String phone = phoneCell.getStringCellValue(); Matcher phoneMatcher = phonePattern.matcher(phone); if (phoneMatcher.matches()) { phones.add(phone); } // 匹配省市区 String area = areaCell.getStringCellValue(); Matcher areaMatcher = areaPattern.matcher(area); if (areaMatcher.matches()) { areas.add(area); } // 匹配姓名 String name = nameCell.getStringCellValue(); if (!phoneMatcher.matches() && !areaMatcher.matches()) { names.add(name); } } // 输出结果 for (int i = 0; i < names.size(); i++) { System.out.println(names.get(i) + " " + phones.get(i) + " " + areas.get(i)); } // 关闭 Excel 文件 workbook.close(); } catch (Exception e) { e.printStackTrace(); } } } ``` 注意:上面的代码示例是针对 xlsx 格式的 Excel 文件,如果你要解析 xls 格式的文件,则需要使用不同的 API。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 9
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

秋9

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值