现在假设有一个IT教育网站,有Java,PHP,net等多个栏目,下面是模拟实现的网站日志
第一个字段是访问日期,第二个字段是访问的URL,其中每个栏目有一个独立域名,如下:
java.aaaaaaa.cn
net.aaaaaaa.cn
php.aaaaaaa.cn
20160321101954 http://java.aaaaaaa.cn/java/course/javaeeadvanced.shtml
20160321101954 http://java.aaaaaaa.cn/java/course/javaee.shtml
20160321101954 http://java.aaaaaaa.cn/java/course/android.shtml
20160321101954 http://java.aaaaaaa.cn/java/video.shtml
20160321101954 http://java.aaaaaaa.cn/java/teacher.shtml
20160321101954 http://java.aaaaaaa.cn/java/course/android.shtml
20160321101954 http://php.aaaaaaa.cn/php/teacher.shtml
20160321101954 http://net.aaaaaaa.cn/net/teacher.shtml
20160321101954 http://java.aaaaaaa.cn/java/course/hadoop.shtml
20160321101954 http://java.aaaaaaa.cn/java/course/base.shtml
20160321101954 http://net.aaaaaaa.cn/net/course.shtml
20160321101954 http://php.aaaaaaa.cn/php/teacher.shtml
20160321101954 http://net.aaaaaaa.cn/net/video.shtml
20160321101954 http://java.aaaaaaa.cn/java/course/base.shtml
20160321101954 http://net.aaaaaaa.cn/net/teacher.shtml
20160321101954 http://java.aaaaaaa