BigData
markix
什么问题,什么结果,预期结果?
展开
-
hadoop本地调试环境配置
在Windows 上调试MapReduce下载hadoop-<version>.tar.gz (版本随意)三方镜像地址:http://mirror.bit.edu.cn/apache/hadoop/common/解压添加环境变量,将Hadoop的bin目录添加到环境变量右键“计算机”-> 高级系统设置原创 2018-11-15 00:32:43 · 714 阅读 · 1 评论 -
无法停止hadoop集群(stop-all.sh)
执行 ./bin/stop-all.sh 脚本一直提示没有可停止的namenode、datanode、secondarynode。可是输入 jps 命令,发现hadoop 已经启动。[root@xxxxxx src]# bash hadoop-2.6.5/sbin/stop-all.sh This script is Deprecated. Instead use stop-dfs.sh a...原创 2019-01-05 14:04:00 · 7602 阅读 · 0 评论 -
pyspark
python环境、jdk环境、spark配置环境变量新建SPARK_HOME=E:\Hadoop\spark-2.1.3-bin-hadoop2.6PYSPARK_PYTHON=E:\ProgramData\Anaconda3\envs\py27\python.exe添加PATH=%SPARK_HOME%\bin将E:\Hadoop\spark-2.1.3-bin-hadoop2.6...原创 2019-01-15 00:53:12 · 1777 阅读 · 1 评论 -
pyspark java.net.SocketException: Connection reset by peer
在window、运行pyspark训练模型,报错Caused by: java.net.SocketException: Connection reset by peer: socket write errorpy4j.protocol.Py4JJavaError: An error occurred while calling z:org.apache.spark.api.python.Py...原创 2019-01-20 11:29:45 · 3286 阅读 · 0 评论 -
spark source
spark-submit.shorg/apache/spark/deploy/SparkSubmit.scalamainsubmitdoRunMainprepareSubmitisStandaloneCluster => childMainClass = “org.apache.spark.deploy.Client”isYarnCluster => childMainC...原创 2019-10-08 23:11:49 · 496 阅读 · 0 评论