准备条件spark on hadoop环境已经安装好
版本:
spark-3.2.1-bin-hadoop3.2
hadoop-3.2.2
操作系统:
ubuntu 18.04
安装步骤:
1、执行安装pip3
apt-get update
apt-get upgrade
apt-get update
apt install python3-pip
2、下载并安装notebook
pip3 install --user jupyterlab notebook
pip3 install --user findspark
///
root@zw001:/opt# pip3 install --user jupyterlab notebook -i https://pypi.org/simple
Collecting jupyterlab
Using cached https://files.pythonhosted.org/packages/4b/0d/03deff4501e9ffafe755e561e375ffa9f5822fec93a09ce1c7c5147bdcb3/jupyterlab-3.2.9-py3-none-any.whl
Collecting notebook
Downloading https://files.pythonhosted.org/packages/27/b7/7e602dc8b868bba8a542269205237b400be3427d8489b5851de5f7587996/notebook-6.4.10-py3-none-any.whl (9.9MB)
51% |████████████████▍ | 5.1MB 120kB/s eta 0:00:41
root@zw001:/usr/local# pip3 install --user findspark
WARNING: pip is being invoked by an old script wrapper. This will fail in a future version of pip.
Please see https://github.com/pypa/pip/issues/5599 for advice on fixing the underlying issue.
To avoid this problem you can invoke Python with '-m pip' instead of running pip directly.
Collecting fin