Hadoop问题记录

最新推荐文章于 2021-07-09 21:24:08 发布

gaoxu1990god

最新推荐文章于 2021-07-09 21:24:08 发布

阅读量391

点赞数

分类专栏：大数据 hadoop 文章标签： hadoop

本文链接：https://blog.csdn.net/gaoxu1990god/article/details/47784177

版权

大数据同时被 2 个专栏收录

1 篇文章 0 订阅

订阅专栏

hadoop

1 篇文章 0 订阅

订阅专栏

namenode.NameNode: java.io.IOException: Cannot create directory /export/home/dfs/name/current

百度上搜的方法如下：可没解决。。。。

解法一：帮 /home/hadoop 加入其他使用者也可以写入的权限

sudo chmod -R a+w /home/hadoop/tmp

解法二：改用 user 身份可以写入的路径 hadoop.tmp.dir 的路径 - 修改 core-site.xml

<name>hadoop.tmp.dir</name>

<value>/tmp/hadoop-${user.name}</value>

</property>

在对namenode格式化之前，要确保dfs.name.dir参数指定的目录不存在。
Hadoop这样做的目的是防止错误地将已存在的集群格式化了

将/data0/hadoop-name文件夹整个删除。然后再格，成功！！！

-----------

我google了“Too many failed volumes”下，然后在hdfs-site.xml下面配置了dfs.datanode.failed.volumes.tolerated（默认应该0，一个盘坏了，这个DN就启动不了）

终于知道为什么了，因为dfs.datanode.failed.volumes.tolerated配置的参数比dfs.datanode.data.dir的盘数还多，这个参数就是针对一个datanode的，而不是针对集群总共的坏盘数。接下来我把dfs.datanode.failed.volumes.tolerated设置成1，总算不报这个上面的错误了，DN正常起来了。但是因为我对第二块盘的所有者和组设置的都是admin，所以那块盘还是加入不了DFS，但是因为每个DN只是坏了一个盘，而dfs.datanode.failed.volumes.tolerated设置的是1，小于等于这个值，所以就能够正常启动。接下来就只需要把第二块盘的用户所有者和权限设置下就行了。

------------------------

添加新磁盘用root格式化，挂载后的目录自动变成了root的目录，导致hadoop用户无权限打开！！！

ls -ral /mnt/hadoop/

chown -R hadoop:hadoop /mnt/hadoop/data/

检查目录权限一定要精确！！！！！

========================

Container [...] is running beyond virtual memory limits.

Application containers are not actually using much physical memory, but still have a large virtual memory size. By default, Hadoop will kill containers that use a great deal more virtual memory than physical memory, as determined by yarn.nodemanager.vmem-pmem-ratio. We actually recommend disabling this check by setting yarn.nodemanager.vmem-check-enabled to false as there is reason to believe the virtual/physical ratio is exceptionally high with some versions of Java / Linux.

==========

第三方jar包:

The files/archives can be distributed by setting the property mapreduce.job.cache.{files |archives}. If more than one file/archive has to be distributed, they can be added as comma separated paths. The properties can also be set by APIs Job.addCacheFile(URI)/ Job.addCacheArchive(URI) and [Job.setCacheFiles(URI[])](../../api/org/apache/hadoop/mapreduce/Job.html)/ [Job.setCacheArchives(URI[])](../../api/org/apache/hadoop/mapreduce/Job.html) where URI is of the formhdfs://host:port/absolute-path\#link-name. In Streaming, the files can be distributed through command line option -cacheFile/-cacheArchive.

The DistributedCache can also be used as a rudimentary software distribution mechanism for use in the map and/or reduce tasks. It can be used to distribute both jars and native libraries. The Job.addArchiveToClassPath(Path) or Job.addFileToClassPath(Path) api can be used to cache files/jars and also add them to theclasspath of child-jvm. The same can be done by setting the configuration properties mapreduce.job.classpath.{files |archives}. Similarly the cached files that are symlinked into the working directory of the task can be used to distribute native libraries and load them.

==============

<property>
  <name>mapreduce.map.java.opts</name>
  <value>
  -Xmx512M -Djava.library.path=/home/mycompany/lib -verbose:gc -Xloggc:/tmp/@taskid@.gc
  -Dcom.sun.management.jmxremote.authenticate=false -Dcom.sun.management.jmxremote.ssl=false
  </value>
</property>

<property>
  <name>mapreduce.reduce.java.opts</name>
  <value>
  -Xmx1024M -Djava.library.path=/home/mycompany/lib -verbose:gc -Xloggc:/tmp/@taskid@.gc
  -Dcom.sun.management.jmxremote.authenticate=false -Dcom.sun.management.jmxremote.ssl=false
  </value>
</property>

解决unable to load hadoop native

hadoop checknative -a

如果文件太短,

mv libhadoop.so libhadoop.so.bak

ln -s libhadoop.so.1.*** libhadoop.so

如果显示glibc版本问题

下载对应版本,2.7.1要求2.14.1

yum -y install gcc

export CFLAGS="-g -O2"

cd glibc-2.14.1

yum install msgfmt makeinfo autoconf

../glibc-2.14.1/configure --prefix=/usr --disable-profile --enable-add-ons --with-headers=/usr/include --with-binutils=/usr/bin

make

make install

不过在centos6上装glibc2.14.1遇到了麻烦

最后直接拷贝2.6.0用的native目录即可!

cp -R /usr/hadoop/hadoop-2.6.0/lib/native/ /usr/hadoop/hadoop-2.7.1/lib/

cp -R /usr/hadoop/hadoop-2.6.0-old/lib/native/ /usr/hadoop/hadoop-2.7.1/lib/
chmod -R 755 /usr/spark/spark-1.4.1/sbin /usr/spark/spark-1.4.1/bin

mv /usr/spark/spark-1.4.0 /usr/spark/spark-1.4.0-old

mv /usr/hadoop/hadoop-2.7.1 /usr/hadoop/hadoop-2.7.1-old

chmod

------------------------

: Name or service not knownstname slave006

: Name or service not knownstname slave002

: Name or service not knownstname slave011

--

把slaves文件删除了，重新创建了一个slaves文件，使用命令创建的，估计是slaves文件被格式污染

gaoxu1990god

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Hadoop问题记录

namenode.NameNode: java.io.IOException: Cannot create directory /export/home/dfs/name/current百度上搜的方法如下：可没解决。。。。解法一：帮 /home/hadoop 加入其他使用者也可以写入的权限 sudo chmod -R a+w /home/hadoop/tmp 解法
复制链接

扫一扫