【cudaFree() failed. Reason: driver shutting down 】

问题说明：
定义检测网络时，定义为全局变量会引发问题，但是定义为局部变量就没有这个问题。哪位大神遇到过类似情况，请求指教。。

//net_type detect_net;
//anet_type feature_net;

//示例代码：
#include
#include
#include
#include
//#include

#include

using namespace std;
using namespace dlib;

// ----------------------------------------------------------------------------------------
//hog face detecting

template class,int,typename> class block, int N, templateclass BN, typename SUBNET>
using residual = add_prev1>>;

template class,int,typename> class block, int N, templateclass BN, typename SUBNET>
using residual_down = add_prev2>>>>>;

template class BN, int stride, typename SUBNET>
using block = BN>>>>;

template using ares = relu>;
template using ares_down = relu>;

template using alevel0 = ares_down;
template using alevel1 = ares>>;
template using alevel2 = ares>>;
template using alevel3 = ares>>>;
template using alevel4 = ares>>;

using anet_type = loss_metric alevel0 alevel1 alevel2 alevel3 alevel4 max_pool input_rgb_image_sized
>>>>>>>>>>>>;

// ------------------------------------------------------------------------------------------------------------
// dnn face detecting

template using con5d = con;
template using con5 = con;

template using downsampler = relu>>>>>>>>;
template using rcon5 = relu>>;

using net_type = loss_mmod>>>>>>>;

// ------------------------------------------------------------------------------------------------------------
// define the global variable

//shape_predictor sp;
//net_type detect_net;
//anet_type feature_net;

void sc_face_recog_init();
void sc_face_recog_init()
{
/*
shape_predictor sp;
net_type detect_net;
anet_type feature_net;

*/

deserialize("mmod_human_face_detector.dat") >> detect_net;
deserialize("shape_predictor_5_face_landmarks.dat") >> sp;
deserialize("dlib_face_recognition_resnet_model_v1.dat") >> feature_net;

}

int main(int argc, char *argv[])
{
printf("hello\n\n\n\n");
//运行是出现错误 cudaFree() failed. Reason: driver shutting down
// cudaFreeHost() failed. Reason: driver shutting down
sc_face_recog_init();
detect_net.clean();

return 0;

}

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
KMCDTC 2018-01-30 08:57
关注
驱动缺少，不可以把他重复使用

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

【tensorrt】——全局静态变量释放cudastream时，报driver shutting down问题?
2020-10-20 18:04

农夫山泉2号的博客声明一个全局类，在类的析构函数中会释放显存。 ... int main(int argc, char* argv[]){ std::string engine = "model/yolov4....CUDA error driver shutting down at /home/data/CM/3_image_classification/pytorch_
ubuntu darknet: ./src/cuda.c:36: check_error: Assertion `0' failed.
2020-03-10 14:09

荪荪的博客报错如上图所示 0 CUDA Error: unknown error darknet: ./src/cuda.c:36: check_error: Assertion `0’ failed. 解决方法：加sudo权限：即加一个sudo 【编译的时候，用的root权限的原因吧】 ...
ctx-＞cvdl-＞cuvidGetDecoderCaps(&ctx-＞caps8) failed -＞ CUDA_ERROR_DEINITIALIZED: driver shutting down
2020-11-15 08:14

柳鲲鹏的博客 0x7f38c8028180] ctx->cvdl->cuvidGetDecoderCaps(&ctx->caps8) failed -> CUDA_ERROR_DEINITIALIZED: driver shutting down [h264_cuvid @ 0x7f38c8028180] ctx->cvdl->cuvidGetDecoderCaps(&ctx->caps10) failed ...
linux cuda 异常退出,cudaErrorCudartUnloading问题排查及建议方案
2021-05-19 02:48

酱婆的美学的博客原文请猛戳这里敲黑板划重点——顺求异构计算/高性能计算/CUDA/ARM优化类开发职位最近一段时间一直在负责做我厂神经网络前向框架库的优化，前几天接了一个...due to "driver shutting down" on CUDA API call to cud...
超详细版讲解importerror: libcudart.so.11.0的各种触发场景
2026-01-18 07:22

Neo-ke的博客 cudaMalloc , cudaFree ） - 管理流（Streams）与事件（Events）当你 import 一个支持 CUDA 的 Python 包时，它的底层 .so 模块会尝试加载 libcudart.so.11.0 。如果失败，Python 就抛出那个熟悉的 ImportError 。 ...
CUDA_CHECK(cudaFree(...))报错CUDA error 1
2023-08-02 10:05

我现在强的可怕~的博客 GPT-3.5太好用了，报错情况如下：总结一下, 在使用cudaFree释放之前cudaMalloc()分配的GPU内存时，报错cuda error,最有可能的几个原因就是：试图释放已经释放的gpu内存，在调用cudafree（）时确保没有重复释放相同...
解决报错ImportError: unique_cuda.cpython-37m-x86_64-linux-gnu.so: undefined symbol: _ZN6caffe28T
2021-07-30 23:05

雨•人的博客最近在调试有关pytorch-geometric包的代码的时候遇到了这个错误，具体报错如下所示： ImportError: /home/amax/.conda/envs/SGE/lib/python3.7/site-packages/torch_sparse/unique_cuda.cpython-37m-x86_64-linux-...
main.cpp:(.text+0x1180): undefined reference to `cv::randn(cv::_InputOutputArray const&, cv::_InputA
2020-04-28 11:30

up_up_Rui的博客在运行KalmanFilter的时候需要用到opencv的库，按照之前的方式在CMakeLists.txt中： target_link_libraries(KalmanFilter ${OPENCV_LIBS} ) 但是发现make的时候找不到opencv，报错如下： main.cpp:(.text+0x1eb): ...
pointnet2_cuda.cpython-36m-x86_64-linux-gnu.so: undefined symbol: 的可能原因
2021-02-25 09:18

York1996的博客编译完的pointNet模块找不到，提示错误 Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/public/home/G19940018/3DGroup/Yaochun/PointRCNN/point...module>...
cuda thrust编译出错：tmpxft_testTrust.cudafe1.cpp:undefined reference to `operator delete(void*, unsigned
2019-03-29 21:37

乐观就完事了的博客在编译一个需要thrust里的vector的cu文件时，报如下的错： > nvcc testTrust.cu -o testT /tmp/tmpxft_00005f37_00000000-10_testTrust.o: In function `thrust::system::error_category::~error_category()': ...
【别折腾显卡驱动了】ImportError: libcudart.so.10.2: cannot open shared object file: No such file or directory
2024-01-15 18:57

幽蓝的天skylight的博客笔者的DL项目，在本地 PC (cuda 11.2)上运行良好，在服务器 (cuda 11.8)上报上面的错。排查发现，如果把DL项目里的模块B换成原作者使用的模块A，则运行顺利。单独运行模块B发现，问题出在模块B使用了dgl上（划重点：...
解决方法：Makefile:77: recipe for target ‘darknet‘ failed make: *** [darknet] Error 1
2021-07-26 16:26

Shadownow的博客 /usr/bin/ld: warning: libcudart.so.9.0, needed by /usr/local/lib/libopencv_core.so, not found (try using -rpath or -rpath-link) /usr/local/lib/libopencv_core.so: undefined reference to cudaFree@...
sampleMNIST.obj : error LNK2019: 无法解析的外部符号 cudaStreamCreate
2021-07-15 15:00

Mr.Q的博客 doInference@@YAXAEAVIExecutionContext@nvinfer1@@PEAM1H@Z) 中引用了该符号 1>sampleMNIST.obj : error LNK2019: 无法解析的外部符号 cudaFree，函数 "void __cdecl doInference(class nvinfer1::...
1>main.cu.obj : error LNK2005: _main 已经在 kernel.cu.obj 中定义
2016-10-10 16:22

qing101hua的博客 1>kernel.cu.obj : error LNK2019: 无法解析的外部符号 _cudaFree@4，该符号在函数 "enum cudaError __cdecl addWithCuda(int *,int const *,int const *,unsigned int)" (?addWithCuda@@YA?AW4cudaError@@PAHPBH1I@...
undefined symbol：__cuda……
2020-04-22 13:29

philipwelia的博客 undefined symbol：__cuda…… 解决方法： ①在出错文件夹下添加make.sh文件 ②保证编译pytorch的cuda版本与系统的cuda版本一致（例如：pytorch0.3.1对应Torchvision0.2.1对应cuda8.0） ...
解决libtorch安装编译链接时出错
2022-06-23 10:26

ゞωáиɡホ辛鴻ゾ的博客解决libtorch编译链接报错
如何在其他torch和cuda框架下安装pytorch3d（运行报错：undefined symbol: _ZNK2at10TensorBase8data_ptrIdEEPT_v）
2022-12-06 21:33

求卓的博客如何在非官方指定的gpu环境安装pytorch3d：下载原始代码，自行安装。
mma.sync.aligned.m16n8k16.row.col.f16.f16.f16.f16测试
2024-08-09 16:28

Hi20240217的博客本文演示了如何按PTX指令文档中的layout格式要求,加载数据,执行mma指令,并且跟numpy对比结果的一致性
报错：CUDA 运行时未能正确初始化，导致后续所有 CUDA 操作（如释放显存、销毁流）都失败
2025-11-19 14:22

ssliq的博客 Reason: initialization error cudaFree() failed. Reason: initialization error cudaFreeHost() failed. Reason: initialization error cudaStreamDestroy() failed. Reason: initialization error cudaFree() ...
一次惨痛的debug的经历-RuntimeError: CUDA error: an illegal memory access was encountered
2021-07-29 20:05

YongjieShi的博客之所以说惨痛是有原因的。这个错误有人严重怀疑是显卡和pytorch二者之一有一个是有问题的，也曾经想一度放弃，最后还是分享我的解决方法是啥，不确定对大家都适用。一开始遇到这个错误，报的是我写的一个模块内的：...
没有解决我的问题, 去提问

【cudaFree() failed. Reason: driver shutting down 】

3条回答 默认 最新

3条回答默认最新