三 Hive 自定义函数UDF和Transform
开篇提示:
快速链接beeline的方式:
./beeline -u jdbc:hive2://hadoop1:10000 -n hadoop
1.自定义函数UDF
当Hive提供的内置函数无法满足你的业务处理需要时,此时就可以考虑使用用户自定义函数(UDF:user-defined function)
UDF 作用于单个数据行,产生一个数据行作为输出。(数学函数,字符串函数)
2开发实例
2.1 原始数据格式
{"movie":"1193","rate":"5","timeStamp":"978300760","uid":"1"}
{"movie":"661","rate":"3","timeStamp":"978302109","uid":"1"}
{"movie":"914","rate":"3","timeStamp":"978301968","uid":"1"}
{"movie":"3408","rate":"4","timeStamp":"978300275","uid":"1"}
{"movie":"2355","rate":"5","time