python transform hive_Hive 中写Transform

最新推荐文章于 2022-03-14 11:38:01 发布

深海的葬礼

最新推荐文章于 2022-03-14 11:38:01 发布

阅读量63

点赞数

文章标签： python transform hive

本文链接：https://blog.csdn.net/weixin_32770697/article/details/113964425

版权

#!/usr/bin/python

#coding:utf8

import sys

for line in sys.stdin:

line = line.strip('\n')

arr = line.split('\t')

arr[1] = arr[1].replace("sutao","biansutao").replace("bian","biansutao")

print '\t'.join([arr[0],arr[1]])

'''

add file /home/hadoop/demo.py

select transform(t.id,t.name) using '/usr/bin/python demo.py' as (a int,b string) from test t;

'''

ADD FILE mapper.py;

ADD FILE reducer.py;

FROM (

FROM tweets_parsed

MAP tweets_parsed.time, tweets_parsed.id, tweets_parsed.tweet

USING 'python mapper.py'

AS word, count

CLUSTER BY word) map_output

INSERT OVERWRITE TABLE word_count

REDUCE map_output.word, map_output.count

USING 'python reducer.py'

AS word, count;

'''

分享到：

2012-03-09 18:05

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注