mysql import tsv_运行 importtsv 导入数据时报错

最新推荐文章于 2022-04-06 15:34:35 发布

Thegirlisvery

最新推荐文章于 2022-04-06 15:34:35 发布

阅读量260

点赞数

文章标签： mysql import tsv

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_36332394/article/details/113310282

版权

在尝试使用hadoop jar执行HBase的importtsv命令导入TSV数据时，遇到NoClassDefFoundError，原因是缺少google-collect库，但实际上该库已更名为Guava。解决办法是将HBASE_HOME/lib下的guava-xx.jar复制到HADOOP_HOME/lib目录，然后重新执行命令，此时出现错误提示缺少参数。确保指定-Dimporttsv.columns选项并正确设置TSV数据的列名，以及根据需要配置其他选项，如分隔符、跳过坏行等，以成功导入数据。

摘要由CSDN通过智能技术生成

运行 importtsv 导入数据时报错：

[hadoop@master ~]$ hadoop jar /usr/hbase/hbase-0.94.12-security.jar importtsv

Exception in thread "main" java.lang.NoClassDefFoundError: com/google/common/collect/Multimap

at org.apache.hadoop.hbase.mapreduce.Driver.main(Driver.java:43)

at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)

at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

at java.lang.reflect.Method.invoke(Method.java:606)

at org.apache.hadoop.util.RunJar.main(RunJar.java:156)

Caused by: java.lang.ClassNotFoundException: com.google.common.collect.Multimap

at java.net.URLClassLoader$1.run(URLClassLoader.java:366)

at java.net.URLClassLoader$1.run(URLClassLoader.java:355)

at java.security.AccessController.doPrivileged(Native Method)

at java.net.URLClassLoader.findClass(URLClassLoader.java:354)

at java.lang.ClassLoader.loadClass(ClassLoader.java:424)

at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)

at java.lang.ClassLoader.loadClass(ClassLoader.java:357)

... 6 more

[hadoop@master ~]$

这是因为运行jar文件时缺少google-collect-.jar 文件，但是This library was renamed to Guava!，所以应该找guava文件，

那么在这里是hadoop 执行jar,因此是因为hadoop lib目录中缺少 guava-xx.jar文件。

把$HBASE_HOME/lib目中的guava-xx.jar 复制到$HADOOP_HOME/lib目录中，解决问题：

[hadoop@master lib]$ hadoop jar /usr/hbase/hbase-0.94.12-security.jar importtsv

ERROR: Wrong number of arguments: 0

Usage: importtsv -Dimporttsv.columns=a,b,c

Imports the given input directory of TSV data into the specified table.

The column names of the TSV data must be specified using the -Dimporttsv.columns

option. This option takes the form of comma-separated column names, where each

column name is either a simple column family, or a columnfamily:qualifier. The special

column name HBASE_ROW_KEY is used to designate that this column should be used

as the row key for each imported record. You must specify exactly one column

to be the row key, and you must specify a column name for every column that exists in the

input data. Another special column HBASE_TS_KEY designates that this column should be

used as timestamp for each record. Unlike HBASE_ROW_KEY, HBASE_TS_KEY is optional.

You must specify atmost one column as timestamp key for each imported record.

Record with invalid timestamps (blank, non-numeric) will be treated as bad record.

Note: if you use this option, then 'importtsv.timestamp' option will be ignored.

By default importtsv will load data directly into HBase. To instead generate

HFiles of data to prepare for a bulk data load, pass the option:

-Dimporttsv.bulk.output=/path/for/output

Note: if you do not use this option, then the target table must already exist in HBase

Other options that may be specified with -D include:

-Dimporttsv.skip.bad.lines=false - fail if encountering an invalid line

'-Dimporttsv.separator=|' - eg separate on pipes instead of tabs

-Dimporttsv.timestamp=currentTimeAsLong - use the specified timestamp for the import

-Dimporttsv.mapper.class=my.Mapper - A user-defined Mapper to use instead of org.apache.hadoop.hbase.mapreduce.TsvImporterMapper

For performance consider the following options:

-Dmapred.map.tasks.speculative.execution=false

-Dmapred.reduce.tasks.speculative.execution=false

[hadoop@master lib]$

到这里说明已经正常使用

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。