参考项目:https://github.com/zhongTao99/ollama
直接docker 拉取镜像:
docker pull leopony/ollama:latest
docker save -o ollama.tar leopony/ollama:latest
docker 镜像转 singularity 镜像:
singularity build ollama.sif oci-archive:ollama.tar
singularity exec \
--contain \
--bind /dev/davinci0:/dev/davinci0 \
--bind /dev/davinci_manager:/dev/davinci_manager \
--bind /dev/devmm_svm:/dev/devmm_svm \
--bind /dev/hisi_hdc:/dev/hisi_hdc \
--bind /usr/local/dcmi:/usr/local/dcmi \
--bind /usr/local/bin/npu-smi:/usr/local/bin/npu-smi \
--bind /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
--bind /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
--bind /etc/ascend_install.info:/etc/ascend_install.info \
--bind /share/home/lyzeng24/.ollama:/share/home/lyzeng24/.ollama \
--network-args "portmap=11434:11434/tcp" \
ollama.sif /bin/bash
-c "source /usr/local/Ascend/ascend-toolkit/set_env.sh && /bin/bash"
ollama serve &
虽然能顺利识别华为升腾NPU,也能成功加载但是无法推理。
可能原因:
ollama_Atlas_A2_series_cann8.0.rc2.bin.gz
不适配910b3,是A2编译的。