Tony Einstein 2021-11-13 10:01 采纳率: 45%
浏览 44
已结题

和鲸社区的GPU环境出现报错

报错场景:
运行torch1.8.0

报错内容:


Args in experiment:
Namespace(activation='gelu', attn='prob', batch_size=32, c_out=1, checkpoints='./checkpoints/', cols=None, d_ff=2048, d_layers=1, d_model=512, data='chicken', data_path='月均价.csv', dec_in=1, des='test', detail_freq='m', devices='0,1,2,3', distil=True, do_predict=True, dropout=0.1, e_layers=2, embed='timeF', enc_in=1, factor=5, features='S', freq='m', gpu=0, inverse=True, itr=100, label_len=6, learning_rate=0.0001, loss='mse', lradj='type1', mix=True, model='informer', n_heads=8, num_workers=0, output='./output', output_attention=False, padding=0, patience=5, pred_len=1, random_choos=True, root_path='./data/chicken/', s_layers=[3, 2, 1], seed=12345, seq_len=12, target='price', train_epochs=100, use_amp=False, use_gpu=True, use_multi_gpu=False)
提示:由于未来还没有发生,在真实值数据中没有这个月份数据,故而无法画出未来预测值~未来值的对比图!
Program to continue!>>>
Use GPU: cuda:0
>>>>>>>start training :  informer_chicken_ftS_sl12_ll6_pl1_dm512_nh8_el2_dl1_df2048_atprob_fc5_ebtimeF_dtTrue_mxTrue_test_0  >>>>>>>>>>>>>>>>>>>>>>>>>>
train 104
val 18
test 33
Traceback (most recent call last):
  File "main_informer.py", line 289, in <module>
    model,info_dict,all_epoch_train_loss,all_epoch_vali_loss,all_epoch_test_loss,epoch_count = exp.train(setting,info_dict,run_name_dir_ckp,run_ex_dir)
  File "/home/mw/project/exp/exp_informer.py", line 240, in train
    pred, true = self._process_one_batch(train_data, batch_x, batch_y, batch_x_mark, batch_y_mark)
  File "/home/mw/project/exp/exp_informer.py", line 498, in _process_one_batch
    outputs = self.model(batch_x, batch_x_mark, dec_inp, batch_y_mark)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/mw/project/models/model.py", line 69, in forward
    enc_out = self.enc_embedding(x_enc, x_mark_enc)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/mw/project/models/embed.py", line 107, in forward
    x = self.value_embedding(x) + self.position_embedding(x) + self.temporal_embedding(x_mark)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/mw/project/models/embed.py", line 37, in forward
    x = self.tokenConv(x.permute(0, 2, 1)).transpose(1,2)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/conv.py", line 263, in forward
    return self._conv_forward(input, self.weight, self.bias)
  File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/conv.py", line 256, in _conv_forward
    return F.conv1d(F.pad(input, self._reversed_padding_repeated_twice, mode=self.padding_mode),
RuntimeError: cuDNN error: CUDNN_STATUS_NOT_INITIALIZED

  • 写回答

0条回答 默认 最新

    报告相同问题?

    问题事件

    • 系统已结题 11月21日
    • 创建了问题 11月13日

    悬赏问题

    • ¥20 西门子S7-Graph,S7-300,梯形图
    • ¥50 用易语言http 访问不了网页
    • ¥50 safari浏览器fetch提交数据后数据丢失问题
    • ¥15 matlab不知道怎么改,求解答!!
    • ¥15 永磁直线电机的电流环pi调不出来
    • ¥15 用stata实现聚类的代码
    • ¥15 请问paddlehub能支持移动端开发吗?在Android studio上该如何部署?
    • ¥20 docker里部署springboot项目,访问不到扬声器
    • ¥15 netty整合springboot之后自动重连失效
    • ¥15 悬赏!微信开发者工具报错,求帮改