想在JupyterNotebook中执行scala的原因是为了方便笔记和快速调试代码
找了蛮久,下面的可行
环境
- 环境:windows10
- python :Anaconda
- spark:spark-2.4.4-bin-hadoop2.7.tgz
- hadoop:hadoop-2.7.0.tar
- scala:scala-2.11.12
安装
本地要安装好上述环境,有问题可以参考:
https://zhuanlan.zhihu.com/p/23141422
https://zhuanlan.zhihu.com/p/23141509
下面执行对应程序:
# Step 1: Install spylon kernel
pip install spylon-kernel
# Step 2: create a kernel spec
python -m spylon_kernel install
# Step 3: start jupyter notebook
jupyter notebook
坑
如果JupyterNotebook安装了代码格式化插件,可能会报错。不管他,不影响代码执行
另外环境变量要显示的设置SPARK_HOME D:\code\spark
当scala的kernel启动时,findspark Will first check the SPARK_HOME env variable, and otherwise search common installation locations, e.g. from homebrew
所以要先设置,否则找到spark环境变量会报错
Ref
https://stackoverflow.com/questions/35563545/how-do-i-install-scala-in-jupyter-ipython-notebook
2019-12-06于南京市江宁区九龙湖