Hive的UDF编程

最新推荐文章于 2022-06-05 16:28:39 发布

你说_

最新推荐文章于 2022-06-05 16:28:39 发布

阅读量233

点赞数

分类专栏： hive 文章标签： hiveUDF

本文链接：https://blog.csdn.net/yuanyi0501/article/details/83244719

版权

hive 专栏收录该内容

7 篇文章 0 订阅

订阅专栏

wiki

编程步骤：

继承org.apache.hadoop.hive.ql.UDF
需要实现evaluat函数，evaluate函数支持重载

注意事项：

UDF必须要有返回值类型，可以返回null，但是不能为void
UDF中常用Text/LongWritable等类型，不推荐使用Java类型

例子

pom.xml添加依赖

<dependency>
    <groupId>org.apache.hive</groupId>
    <artifactId>hive-jdbc</artifactId>
    <version>2.3.3</version>
</dependency>
<dependency>
    <groupId>org.apache.hive</groupId>
    <artifactId>hive-exec</artifactId>
    <version>2.3.3</version>
</dependency>

创建java类 Lower.java
- 打成jar包，上传到本地目录/opt/datas

package com.example.hive.udf;
 
import org.apache.hadoop.hive.ql.exec.UDF;
import org.apache.hadoop.io.Text;
 
public final class Lower extends UDF {
  public Text evaluate(Text s) {
    if (s == null) { return null; }
    return new Text(s.toString().toLowerCase());
  }
}

创建函数
- 加载jar包至hive
- add jar /opt/dadas/hiveudf.jar
- 注册函数
- create temporary function my_lower as 'com.example.hive.udf.Lower';
运行
hive> select my_lower(title), sum(freq) from titles group by my_lower(title);

注册函数的方法：

create temporary function my_lower as 'com.example.hive.udf.Lower'//临时函数
create function my_db.my_lower as 'com.example.hive.udf.Lower';
CREATE FUNCTION myfunc AS 'myclass' USING JAR 'hdfs:///path/to/jar';//必须保证jar在hdfs文件系统中
- eg: CREATE FUNCTION mylower AS 'com.example.hive.udf.Lower' USING JAR 'hdfs://hdp-node-01:8020/user/root/hive/jar/hiveudf.jar

你说_

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Hive的UDF编程

wiki编程步骤：继承org.apache.hadoop.hive.ql.UDF需要实现evaluat函数，evaluate函数支持重载注意事项：UDF必须要有返回值类型，可以返回null，但是不能为voidUDF中常用Text/LongWritable等类型，不推荐使用Java类型例子pom.xml添加依赖&lt;dependency&gt; &lt;gro...
复制链接

扫一扫