安装spconv库遇到的疑难杂症和解决方法

spconv安装步骤:

 $ sudo apt-get install libboost-all-dev
 $ git clone https://github.com/traveller59/spconv.git --recursive
 $ cd spconv && git checkout 7342772
 $ python setup.py bdist_wheel
 $ cd ./dist && pip install *

第一步:执行:sudo apt-get install libboost-all-dev

提示:

/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_ops_infer.so.8 is not a symbolic link

/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_adv_train.so.8 is not a symbolic link

/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_adv_infer.so.8 is not a symbolic link

/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_cnn_train.so.8 is not a symbolic link

/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_cnn_infer.so.8 is not a symbolic link

/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn.so.8 is not a symbolic link

/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_ops_train.so.8 is not a symbolic link

解决:
终端输入:

sudo ldconfig -v
(cp2) twilight@ROG: ~/project/CenterPoint/apex$ sudo ldconfig -v
/sbin/ldconfig.real: Can't stat /usr/local/lib/x86_64-linux-gnu: No such file or directory
/sbin/ldconfig.real: Path `/lib/x86_64-linux-gnu' given more than once
/sbin/ldconfig.real: Path `/usr/lib/x86_64-linux-gnu' given more than once
/usr/local/cuda-11.0/targets/x86_64-linux/lib:
	libnppim.so.11 -> libnppim.so.11.1.0.218
	libcublasLt.so.11 -> libcublasLt.so.11.1.0.229
	libnvjpeg.so.11 -> libnvjpeg.so.11.1.0.218
	libnvblas.so.11 -> libnvblas.so.11.1.0.229
	libcurand.so.10 -> libcurand.so.10.2.1.218
/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_ops_infer.so.8 is not a symbolic link

	libcudnn_ops_infer.so.8 -> libcudnn_ops_infer.so.8.0.5
	libnppist.so.11 -> libnppist.so.11.1.0.218
/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_adv_train.so.8 is not a symbolic link

	libcudnn_adv_train.so.8 -> libcudnn_adv_train.so.8.0.5
	libcusolver.so.10 -> libcusolver.so.10.5.0.218
	libnppial.so.11 -> libnppial.so.11.1.0.218
	libnppidei.so.11 -> libnppidei.so.11.1.0.218
	libnvrtc.so.11.0 -> libnvrtc.so.11.0.194
	libnppicc.so.11 -> libnppicc.so.11.1.0.218
	libaccinj64.so.11.0 -> libaccinj64.so.11.0.194
	libnppisu.so.11 -> libnppisu.so.11.1.0.218
	libnppig.so.11 -> libnppig.so.11.1.0.218
	libnppitc.so.11 -> libnppitc.so.11.1.0.218
	libcublas.so.11 -> libcublas.so.11.1.0.229
	libcufftw.so.10 -> libcufftw.so.10.2.0.218
	libcuinj64.so.11.0 -> libcuinj64.so.11.0.194
	libcudart.so.11.0 -> libcudart.so.11.0.194
/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_adv_infer.so.8 is not a symbolic link

	libcudnn_adv_infer.so.8 -> libcudnn_adv_infer.so.8.0.5
/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_cnn_train.so.8 is not a symbolic link

	libcudnn_cnn_train.so.8 -> libcudnn_cnn_train.so.8.0.5
/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_cnn_infer.so.8 is not a symbolic link

	libcudnn_cnn_infer.so.8 -> libcudnn_cnn_infer.so.8.0.5
	libnpps.so.11 -> libnpps.so.11.1.0.218
	libnppif.so.11 -> libnppif.so.11.1.0.218
	libnvToolsExt.so.1 -> libnvToolsExt.so.1.0.0
	libnvrtc-builtins.so.11.0 -> libnvrtc-builtins.so.11.0.194
/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn.so.8 is not a symbolic link

	libcudnn.so.8 -> libcudnn.so.8.0.5
	libcusolverMg.so.10 -> libcusolverMg.so.10.5.0.218
/sbin/ldconfig.real: /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn_ops_train.so.8 is not a symbolic link

	libcudnn_ops_train.so.8 -> libcudnn_ops_train.so.8.0.5
	libcufft.so.10 -> libcufft.so.10.2.0.218
	libnppc.so.11 -> libnppc.so.11.1.0.218
	libcusparse.so.11 -> libcusparse.so.11.1.0.218
	libOpenCL.so.1 -> libOpenCL.so.1.0.0

找到这一行错误:libcudnn.so.8 -> libcudnn.so.8.0.5
是这个链接错误,然后在终端输入:

sudo ln -sf /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn.so.8.0.5 /usr/local/cuda-11.0/targets/x86_64-linux/lib/libcudnn.so.8

##第二步: 执行:git clone https://github.com/traveller59/spconv.git --recursive

问题:第三方库下载不下来
解决:分别用git clone下载第三方库,放到指定文件夹中。
其中 pybind11 会提前下载好一个空文件,要先删除,然后分别执行以下命令:

git clone https://github.com.cnpmjs.org/NVIDIA/cutlass
git clone https://github.com.cnpmjs.org/boostorg/mp11
git clone https://github.com.cnpmjs.org/pybind/pybind11.git

第四步:执行:python setup.py bdist_wheel

先进入到cuDNN的安装文件夹

 cd cudnn

确保把cudnn中的cudnn_version.h复制到了cuda目录(新版本cudnn的版本信息包含在cudnn_version.h而不是cudnn.h,安装cudnn时把所有cudnn开头的都复制过去)

sudo cp cuda/include/cudnn* /usr/local/cuda/include

找到cuda.cmake文件

locate  cuda.cmake

我这里cuda.cmake的目录是: /home/twilight/.conda/envs/cp2/lib/python3.7/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake

 code /home/twilight/.conda/envs/cp2/lib/python3.7/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake

替换 : file(READ ${CUDNN_INCLUDE_PATH}/cudnn.h CUDNN_HEADER_CONTENTS)
为 : file(READ ${CUDNN_INCLUDE_PATH}/cudnn_version.h CUDNN_HEADER_CONTENTS)

然后再执行:

python setup.py bdist_wheel

报错:

/home/twilight/project/CenterPoint0/spconv/src/spconv/all.cc:20:91: error: no matching function for call to ‘torch::jit::RegisterOperators::RegisterOperators(const char [28], <unresolved overloaded function type>)’
     torch::jit::RegisterOperators("spconv::get_indice_pairs_2d", &spconv::getIndicePair<2>)

修改:

code /home/twilight/project/CenterPoint0/spconv/src/spconv/all.cc

参考:
替换:torch::jit::RegisterOperators
为:torch::RegisterOperators

参考链接:
– Found cuDNN: v? (include: /usr/include, library: /usr/lib/x86_64-linux-gnu/libcudnn.so) CMake Er

  • 6
    点赞
  • 20
    收藏
    觉得还不错? 一键收藏
  • 1
    评论
在使用Docker时,可能会遇到一些疑难杂症。其中,一些常见的问题及解决办法如下: 1. 运行docker version时报错"Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running?"这个错误通常是由于Docker守护进程未启动引起的。可以通过运行以下命令来启动守护进程:`sudo systemctl start docker`(适用于基于systemd的Linux发行版)。如果您不是使用systemd,请根据您的操作系统和版本来启动Docker守护进程。 2. 使用yum安装Docker时报错"Cannot retrieve metalink for repository: epel. Please verify its path and try again."这个错误通常是由于epel源(Extra Packages for Enterprise Linux)未正确安装或配置引起的。您可以尝试以下解决办法: - 首先,确保您的系统与互联网连接正常。 - 检查您的操作系统和版本,并根据官方文档正确安装epel源。 - 如果您已经安装了epel源,但仍然遇到这个错误,请尝试更新epel源并再次运行安装命令。 这些是一些常见的Docker疑难杂症及其解决办法。当然,Docker的使用过程中可能还会遇到其他问题,您可以参考官方文档、社区论坛或搜索引擎来寻找更多解决办法。<span class="em">1</span><span class="em">2</span><span class="em">3</span> #### 引用[.reference_title] - *1* [docker 疑难杂症](https://blog.csdn.net/weixin_33805992/article/details/92266045)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_1"}}] [.reference_item style="max-width: 50%"] - *2* *3* [docker常见疑难杂症](https://blog.csdn.net/weixin_45776707/article/details/103142818)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_1"}}] [.reference_item style="max-width: 50%"] [ .reference_list ]

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值