tf_spark:Run MNIST example

最新推荐文章于 2021-03-27 09:41:24 发布

风吴痕

最新推荐文章于 2021-03-27 09:41:24 发布

阅读量487

点赞数

分类专栏： tensorflow 文章标签： tensorflow

本文链接：https://blog.csdn.net/wc781708249/article/details/78668268

版权

tensorflow 专栏收录该内容

91 篇文章 1 订阅

订阅专栏

参考：https://github.com/yahoo/TensorFlowOnSpark/wiki/GetStarted_YARN

Run MNIST example

Download/zip the MNIST dataset

# 下载mnist数据
mkdir mnist
cd mnist
curl -O "http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz"
curl -O "http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz"
curl -O "http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gz"
curl -O "http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz"
zip -r mnist.zip * # 创建mnist.zip

# 上传到hdfs
hdfs dfs -mkdir mnist
hdfs dfs -put mnist.zip mnist # 上传到hdfs mnist文件夹下

hdfs dfs -ls mnist # 查看
# hdfs dfs -rm -r mnist # 删除

Convert the MNIST.zip files into HDFS files

# save images and labels as CSV files
spark-submit \
--master yarn \
--deploy-mode cluster \
--queue default \
--num-executors 45 \
--executor-memory 2G \
--driver-memory 12G \
--conf spark.dynamicAllocation.enabled=false \
--conf spark.yarn.maxAppAttempts=1 \
--jars hdfs://xxxx:xx/spark-tensorflow/spark-tensorflow-connector-1.0-SNAPSHOT.jar \
--archives hdfs://xxxx:xx/user/root/mnist/mnist.zip#mnist \
TensorFlowOnSpark/examples/mnist/mnist_data_setup.py \
--output hdfs://xxxx:xx/user/root/mnist/csv \
--format csv

# save images and labels as pickle files
spark-submit \
--master yarn \
--deploy-mode cluster \
--queue default \
--num-executors 45 \
--executor-memory 2G \
--driver-memory 12G \
--conf spark.dynamicAllocation.enabled=false \
--conf spark.yarn.maxAppAttempts=1 \
--jars hdfs://xxxx:xx/spark-tensorflow/spark-tensorflow-connector-1.0-SNAPSHOT.jar \
--archives hdfs://xxxx:xx/user/root/mnist/mnist.zip#mnist \
TensorFlowOnSpark/examples/mnist/mnist_data_setup.py \
--output hdfs://xxxx:xx/user/root/mnist/pickle \
--format pickle

Run distributed MNIST training (using feed_dict)

${SPARK_HOME}/bin/spark-submit \
--master yarn \
--deploy-mode cluster \
--queue default \
--num-executors 4 \
--executor-memory 27G \
--py-files TensorFlowOnSpark/tfspark.zip,TensorFlowOnSpark/examples/mnist/spark/mnist_dist.py \
--conf spark.dynamicAllocation.enabled=false \
--conf spark.yarn.maxAppAttempts=1 \
TensorFlowOnSpark/examples/mnist/spark/mnist_spark.py \
--images hdfs://xxx:xx/user/root/mnist/pickle/train/images \
--labels hdfs://xxx:xx/user/root/mnist/pickle/train/labels \
--mode train \
--model hdfs://xxx:xx/user/root/mnist_model

Run distributed MNIST inference (using feed_dict)

${SPARK_HOME}/bin/spark-submit \
--master yarn \
--deploy-mode cluster \
--queue default \
--num-executors 4 \
--executor-memory 27G \
--py-files TensorFlowOnSpark/tfspark.zip,TensorFlowOnSpark/examples/mnist/spark/mnist_dist.py \
--conf spark.dynamicAllocation.enabled=false \
--conf spark.yarn.maxAppAttempts=1 \
TensorFlowOnSpark/examples/mnist/spark/mnist_spark.py \
--images hdfs://xxx:xx/user/root/mnist/pickle/test/images \
--labels hdfs://xxx:8020/user/root/mnist/pickle/test/labels \
--mode inference \
--model hdfs://xxx:8020/user/root/mnist_model \
--output hdfs://xxx:8020/user/root/predictions

附加 hadoop 文件操作命令

hdfs dfs -ls # 显示目录
hdfs dfs -ls xxx/|wc -l # 显示xxx目录下的文件和文件夹个数
hdfs dfs -mkdir xxx # 新建目录
hdfs dfs -rm -r xxx # 删除文件或目录
hdfs dfs -put  xxx data # 将xxx 上传到 hdfs的data目录
hdfs dfs -get xxx ./ # 将hdfs的xxx（文件或文件夹）复制到本地

yarn application -kill application_1502181070712_0574  # 杀掉进程

spark-submit test.py  # 执行脚本 test.py

风吴痕

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
tf_spark:Run MNIST example

参考：https://github.com/yahoo/TensorFlowOnSpark/wiki/GetStarted_YARNRun MNIST exampleDownload/zip the MNIST dataset# 下载mnist数据mkdir mnistcd mnistcurl -O "http://yann.lecun.com/exdb/mnist/train-images-
复制链接

扫一扫