cd ..
uv venv ./venv --python 3.11--python-preference=only-managed
source venv/bin/activate
实验性使用flashinfer替代triton
uv pip install flashinfer-python
# uv pip install flashinfer-python
Using Python 3.11.11 environment at: venv
Resolved 26 packages in 2.16s
Built flashinfer-python==0.2.2.post1
Prepared 1package in 3.25s
░░░░░░░░░░░░░░░░░░░░ [0/26] Installing wheels... warning: Failed to hardlink files; falling back to full copy. This may lead to degraded performance.
If the cache and target directories are on different filesystems, hardlinking may not be supported.
If this is intentional, set `export UV_LINK_MODE=copy` or use `--link-mode=copy` to suppress this warning.
Installed 26 packages in 14.92s
+ filelock==3.17.0+ flashinfer-python==0.2.2.post1
+ fsspec==2025.2.0+ jinja2==3.1.5+ markupsafe==3.0.2+ mpmath==1.3.0+ networkx==3.4.2+ ninja==1.11.1.3+ numpy==2.2.3+ nvidia-cublas-cu12==12.4.5.8+ nvidia-cuda-cupti-cu12==12.4.127+ nvidia-cuda-nvrtc-cu12==12.4.127+ nvidia-cuda-runtime-cu12==12.4.127+ nvidia-cudnn-cu12==9.1.0.70+ nvidia-cufft-cu12==11.2.1.3+ nvidia-curand-cu12==10.3.5.147+ nvidia-cusolver-cu12==11.6.1.9+ nvidia-cusparse-cu12==12.3.1.170+ nvidia-cusparselt-cu12==0.6.2+ nvidia-nccl-cu12==2.21.5+ nvidia-nvjitlink-cu12==12.4.127+ nvidia-nvtx-cu12==12.4.127+ sympy==1.13.1+ torch==2.6.0+ triton==3.2.0+ typing-extensions==4.12.2