我安转的是openmpi,用mpirun在两个节点上运行的时候出现如下错误,求助是什么原因。
shell$: /usr/local/openmpi/bin/mpiexec -np 2 --hostfile nodeinfo ./test
错误提示:
Primary job terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.
./test: error while loading shared libraries: libcudart.so.9.0: cannot open shared object file: No such file or directory
./test: error while loading shared libraries: libcudart.so.9.0: cannot open shared object file: No such file or directory
mpiexec detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:
Process name: [[65150,1],0]