jupyter环境下用tensorflow读取hdfs数据

tensorflow读取hdfs数据

设置classpath变量直接执行如下脚本test.py

import tensorflow as tf
 
file_path = "hdfs:///a/b/.csv"
files = tf.io.gfile.glob(file_path)
print(files)

有两种加载classpath的方式

1、shell中这样运行

CLASSPATH=$($HADOOP_HDFS_HOME/bin/hadoop classpath --glob) python test.py

2、.bashrc设置环境变量

export JAVA_HOME=/usr/local/jdk1.8.0_131
export HADOOP_HDFS_HOME=/usr/local/hadoop-2.7.3
export PATH=$PATH:$HADOOP_HDFS_HOME/libexec/hadoop-config.sh
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$JAVA_HOME/jre/lib/amd64/server
export PATH=$PATH:$HADOOP_HDFS_HOME/bin:$HADOOP_HDFS_HOME/sbin
export CLASSPATH="$(hadoop classpath --glob)"

 

jupyter环境下用tensorflow读取hdfs数据

对于想看到数据的中间结果,例如tensor长什么样,有多少维度,是不是自己想要的结果等,就实现不了。自然而然想到jupyter,可以非常方便的显示每个变量的结果。

但是这个CLASSPATH一般都是shell脚本中加,jupyter不好弄,折腾了大半天,终于解决了。

1、找到CLASSPATH的内容 a

2、os.environ["CLASSPATH"]=a

第一步找到CLASSPATH的内容

写个shell脚本search_classpath.sh

CLASSPATH=$($HADOOP_HDFS_HOME/bin/hadoop classpath --glob)
echo $CLASSPATH

然后

bash search_classpath.sh

就会输出一大堆内容,我的是

/data/server/hadoop-2.6.0-cdh5.12.0/etc/hadoop:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-compress-1.4.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jersey-server-1.9.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jackson-jaxrs-1.8.8.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jettison-1.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/hadoop-annotations-2.6.0-cdh5.12.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/slf4j-api-1.7.5.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/curator-client-2.7.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/servlet-api-2.5.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jackson-xc-1.8.8.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jaxb-api-2.2.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-logging-1.1.3.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/guava-11.0.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-cli-1.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/api-asn1-api-1.0.0-M20.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/hadoop-auth-2.6.0-cdh5.12.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-codec-1.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/apacheds-kerberos-codec-2.0.0-M15.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-el-1.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jetty-util-6.1.26.cloudera.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/logredactor-1.0.3.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jasper-runtime-5.5.23.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/zookeeper-3.4.5-cdh5.12.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-digester-1.8.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jersey-json-1.9.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/mockito-all-1.8.5.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-io-2.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/htrace-core4-4.0.1-incubating.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-math3-3.1.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jetty-6.1.26.cloudera.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/curator-recipes-2.7.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/log4j-1.2.17.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-net-3.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-beanutils-1.9.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/httpcore-4.2.5.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/httpclient-4.2.5.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-httpclient-3.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-configuration-1.6.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/gson-2.2.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/asm-3.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jsr305-3.0.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/xmlenc-0.52.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jersey-core-1.9.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/netty-3.10.5.Final.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/activation-1.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/api-util-1.0.0-M20.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/snappy-java-1.0.4.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jets3t-0.9.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jsp-api-2.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-lang-2.6.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/java-xmlbuilder-0.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/paranamer-2.3.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/protobuf-java-2.5.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/xz-1.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jaxb-impl-2.2.3-1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-beanutils-core-1.8.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/hamcrest-core-1.3.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/junit-4.11.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/commons-collections-3.2.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jsch-0.1.42.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/stax-api-1.0-2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/avro-1.7.6-cdh5.12.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/apacheds-i18n-2.0.0-M15.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/curator-framework-2.7.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/lib/jasper-compiler-5.5.23.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/hadoop-common-2.6.0-cdh5.12.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/hadoop-common-2.6.0-cdh5.12.0-tests.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/common/hadoop-nfs-2.6.0-cdh5.12.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/xml-apis-1.3.04.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jersey-server-1.9.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/servlet-api-2.5.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/commons-logging-1.1.3.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/guava-11.0.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/commons-cli-1.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/commons-daemon-1.0.13.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jackson-mapper-asl-1.8.8.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/commons-codec-1.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/commons-el-1.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jetty-util-6.1.26.cloudera.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jasper-runtime-5.5.23.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/commons-io-2.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/htrace-core4-4.0.1-incubating.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jetty-6.1.26.cloudera.4.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/log4j-1.2.17.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/asm-3.2.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jsr305-3.0.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/xmlenc-0.52.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jersey-core-1.9.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/netty-3.10.5.Final.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/xercesImpl-2.9.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/leveldbjni-all-1.8.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jackson-core-asl-1.8.8.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/jsp-api-2.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/commons-lang-2.6.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/lib/protobuf-java-2.5.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/hadoop-hdfs-2.6.0-cdh5.12.0-tests.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/hadoop-hdfs-2.6.0-cdh5.12.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/hdfs/hadoop-hdfs-nfs-2.6.0-cdh5.12.0.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/yarn/lib/commons-compress-1.4.1.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/yarn/lib/jersey-server-1.9.jar:/data/server/hadoop-2.6.0-cdh5.12.0/share/hadoop/yarn/lib/jackson-jaxrs-1.8.8.jar:/data/server/hadoop-2.6.
  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值