avrorecord.java,失败，但发生异常java.io.IOException：org.apache.avro.AvroTypeException：发现的很长，期望在配置单元中实现联合...

最新推荐文章于 2022-07-06 00:30:00 发布

weixin_39746229

最新推荐文章于 2022-07-06 00:30:00 发布

阅读量303

点赞数

文章标签： avrorecord.java

需要帮忙！！！

我正在使用flume将Twitter提要流式传输到hdfs中，并将其加载hive进行分析。

步骤如下：

我已经avro schema在avsc文件中描述了并将其放入hadoop：

{"type":"record",

"name":"Doc",

"doc":"adoc",

"fields":[{"name":"id","type":"string"},

{"name":"user_friends_count","type":["int","null"]},

{"name":"user_location","type":["string","null"]},

{"name":"user_description","type":["string","null"]},

{"name":"user_statuses_count","type":["int","null"]},

{"name":"user_followers_count","type":["int","null"]},

{"name":"user_name","type":["string","null"]},

{"name":"user_screen_name","type":["string","null"]},

{"name":"created_at","type":["string","null"]},

{"name":"text","type":["string","null"]},

{"name":"retweet_count","type":["boolean","null"]},

{"name":"retweeted","type":["boolean","null"]},

{"name":"in_reply_to_user_id","type":["long","null"]},

{"name":"source","type":["string","null"]},

{"name":"in_reply_to_status_id","type":["long","null"]},

{"name":"media_url_https","type":["string","null"]},

{"name":"expanded_url","type":["string","null"]}]}

我写了一个.hql文件来创建表并在其中加载数据：

create table tweetsavro

row format serde

'org.apache.hadoop.hive.serde2.avro.AvroSerDe'

stored as inputformat

'org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat'

outputformat

'org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat'

tblproperties ('avro.schema.url'='hdfs:///avro_schema/AvroSchemaFile.avsc');

load data inpath '/test/twitter_data/FlumeData.*' overwrite into table tweetsavro;

我已经成功运行.hql文件，但是当我select *from 在蜂巢中运行命令时，它显示以下错误：

tweetsavro的输出为：

hive> desc tweetsavro;

OK

id string

user_friends_count int

user_location string

user_description string

user_statuses_count int

user_followers_count int

user_name string

user_screen_name string

created_at string

text string

retweet_count boolean

retweeted boolean

in_reply_to_user_id bigint

source string

in_reply_to_status_id bigint

media_url_https string

expanded_url string

Time taken: 0.697 seconds, Fetched: 17 row(s)

weixin_39746229

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。