1. nvcc fatal : Unsupported gpu architecture ‘compute_86’

# vim ~/.bashrc
export TORCH_CUDA_ARCH_LIST="8.0"  # 因为是CUDA11.0,对应的算力为8.0
# source  ~/.bashrc

需要这个的原因,可能是下面的nvidia-cuda-toolkit残留,导致被判定为cuda11.0了,不过设置了后,问题也不出现了

2.fatal error: cuda_fp8.h: No such file or directory

解决办法是:
在这里插入图片描述
因为系统上 nvidia-cuda-toolkit 也 存在 NVCC 在 /usr/bin下面

which nvcc
dpkg -L nvidia-cuda-toolkit

如果上面两个命令都有nvcc路径,并且不一致,就是不行的,需要卸载nvidia-cuda-toolkit

sudo apt-get remove --purge nvidia-cuda-toolkit

3. pip 安装中 git下载失败的问题

Cloning into ‘cutlass-src’…
remote: Enumerating objects: 27428, done.
remote: Counting objects: 16% (1/6)
remote: Counting objects: 33% (2/6)
remote: Counting objects: 50% (3/6)
remote: Counting objects: 66% (4/6)
remote: Counting objects: 83% (5/6)
remote: Counting objects: 100% (6/6)
remote: Counting objects: 100% (6/6), done.
remote: Compressing objects: 16% (1/6)
remote: Compressing objects: 33% (2/6)
remote: Compressing objects: 50% (3/6)
remote: Compressing objects: 66% (4/6)
remote: Compressing objects: 83% (5/6)
remote: Compressing objects: 100% (6/6)
remote: Compressing objects: 100% (6/6), done.
Receiving objects: 0% (1/27428)
Receiving objects: 1% (275/27428)
Receiving objects: 2% (549/27428), 188.01 KiB | 358.00 KiB/s
Receiving objects: 2% (565/27428), 316.01 KiB | 276.00 KiB/s
Receiving objects: 2% (620/27428), 428.01 KiB | 258.00 KiB/s
。。。
Receiving objects: 22% (6143/27428), 14.37 MiB | 142.00 KiB/s
error: RPC failed; curl 92 HTTP/2 stream 0 was not closed cleanly: CANCEL (err 8)
error: 39 bytes of body are still expected
fetch-pack: unexpected disconnect while reading sideband packet
fatal: early EOF
fatal: fetch-pack: invalid index-pack output
Had to git clone more than once: 3 times.
CMake Error at cutlass-subbuild/cutlass-populate-prefix/tmp/cutlass-populate-gitclone.cmake:50 (message):
Failed to clone repository: ‘https://github.com/nvidia/cutlass.git’

设置GIT_CURL_VERBOSE=1就行,此处是禁用 HTTP/2

MAX_JOBS=8 GIT_CURL_VERBOSE=1 pip install -e .
Logo

欢迎来到FlagOS开发社区,这里是一个汇聚了AI开发者、数据科学家、机器学习爱好者以及业界专家的活力平台。我们致力于成为业内领先的Triton技术交流与应用分享的殿堂,为推动人工智能技术的普及与深化应用贡献力量。

更多推荐