写的不到位的地方,欢迎评论指出不足之处
Serde
- Serializer and Deserializer :用于序列化和反序列化
- 构建在数据存储和执行引擎之间,对两者实现解耦
- Hive 通过 row format delimited 以及 serde 进行内容的读写
实例
数据
192.168.1.4 - - [20/Jul/2021:9:20:30 +0800] "GET /1.png HTTP/1.1" 304 -
192.168.1.4 - - [20/Jul/2021:9:20:30 +0800] "GET /1.PNG HTTP/1.1" 304 -
192.168.1.4 - - [20/Jul/2021:9:20:31 +0800] "GET /1.jpg HTTP/1.1" 304 -
192.168.1.4 - - [20/Jul/2021:9:20:31 +0800] "GET /1.JPG HTTP/1.1" 304 -
192.168.1.4 - - [20/Jul/2021:9:20:32 +0800] "GET /1.css HTTP/1.1" 304 -
192.168.1.4 - - [20/Jul/2021:9:20:32 +0800] "GET / HTTP/1.1" 200 11217
192.168.1.4 - - [20/Jul/2021:9:20:33 +0800] "GET /1.mp4 HTTP/1.1" 304 -
192.168.1.4 - - [20/Jul/2021:9:20:33 +0800] "GET /1.html HTTP/1.1" 304 -
192.168.1.4 - - [20/Jul/2021:9:20:34 +0800] "GET /1.gif HTTP/1.1" 304 -
192.168.1.4 - - [20/Jul/2021:9:20:34 +0800] "GET / HTTP/1.1" 200 11217
实操
注
实例中的正则,在多个正则工具上测试过,无结果