亚大伯斯 2025-06-20 13:40 采纳率: 98%

已采纳

PyTorch报错：RuntimeError: Expected all tensors to be on the same device，如何解决？

在PyTorch开发中，经常遇到`RuntimeError: Expected all tensors to be on the same device`错误。该问题通常发生在模型、输入张量或优化器涉及多个设备（如CPU和GPU）时，部分张量未被正确分配到同一设备。 **解决方法：** 1. 确保模型和输入数据在同一设备上。例如，将模型和张量移动到GPU（若有）： ```python device = torch.device("cuda" if torch.cuda.is_available() else "cpu") model.to(device) input_tensor = input_tensor.to(device) ``` 2. 检查模型参数、优化器状态是否一致。如果模型迁移到GPU，优化器中的参数也需要同步更新： ```python optimizer.state = {k: v.cuda() for k, v in optimizer.state.items()} ``` 3. 遍历复杂数据结构（如列表或字典），确保所有张量都在同一设备上。通过以上步骤，可有效避免设备不匹配问题。

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

Qianwei Cheng 2025-06-20 13:41

关注

1. 问题概述

在PyTorch开发过程中，RuntimeError: Expected all tensors to be on the same device 是一个常见的错误。该问题通常发生在模型、输入张量或优化器涉及多个设备（如CPU和GPU）时，部分张量未被正确分配到同一设备。

以下是该问题的常见场景：

模型定义在GPU上，但输入数据仍位于CPU。
使用了多个GPU进行训练，但张量未正确同步。
优化器的状态字典未与模型参数保持一致。

2. 错误分析

当PyTorch执行计算图时，要求所有参与计算的张量必须位于同一设备上。如果模型的一部分在GPU上，而另一部分在CPU上，则会导致设备不匹配错误。

以下是一个简单的代码示例：


import torch
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = MyModel().to(device)  # 模型加载到GPU
input_tensor = torch.randn(1, 3, 224, 224)  # 输入张量默认在CPU
output = model(input_tensor)  # 报错：设备不匹配

上述代码中，input_tensor 仍在CPU上，而模型已加载到GPU，因此触发错误。

3. 解决方案

为了解决设备不匹配问题，可以按照以下步骤逐步排查和修复：

3.1 确保模型和输入数据在同一设备上

将模型和张量显式地移动到相同的设备上：


device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model.to(device)
input_tensor = input_tensor.to(device)

通过这种方式，可以确保模型和输入数据都在同一设备上。

3.2 同步优化器状态

如果模型从CPU迁移到GPU，优化器中的参数也需要同步更新：


for state in optimizer.state.values():
    for k, v in state.items():
        if isinstance(v, torch.Tensor):
            state[k] = v.to(device)

这段代码遍历优化器的状态字典，并将所有张量移动到目标设备。

3.3 遍历复杂数据结构

在实际项目中，输入数据可能是嵌套的列表或字典。此时需要递归地检查并移动所有张量：


def move_to_device(obj, device):
    if isinstance(obj, torch.Tensor):
        return obj.to(device)
    elif isinstance(obj, dict):
        return {k: move_to_device(v, device) for k, v in obj.items()}
    elif isinstance(obj, list):
        return [move_to_device(v, device) for v in obj]
    else:
        return obj

data = {"tensor1": torch.randn(3, 3), "tensor2": torch.randn(2, 2)}
data = move_to_device(data, device)

此函数支持递归处理嵌套数据结构，确保每个张量都正确移动到目标设备。

4. 流程图

以下是解决设备不匹配问题的流程图：

graph TD;
    A[开始] --> B[检查模型设备];
    B --> C{模型是否在GPU？};
    C --否--> D[将模型移动到GPU];
    C --是--> E[检查输入数据设备];
    E --> F{输入数据是否在GPU？};
    F --否--> G[将输入数据移动到GPU];
    F --是--> H[检查优化器状态];
    H --> I{优化器状态是否一致？};
    I --否--> J[同步优化器状态];
    I --是--> K[完成];

5. 总结性表格

以下是常见问题及解决方案的对比表：

问题类型	原因	解决方案
模型与输入设备不匹配	模型在GPU，输入在CPU	使用`.to(device)`统一设备
优化器状态不一致	模型迁移后优化器未更新	手动同步优化器状态
复杂数据结构问题	嵌套数据中部分张量未移动	递归遍历并移动张量

本回答被题主选为最佳回答 , 对您是否有帮助呢?

报告相同问题？

关注问题

报错：RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda
2024-11-06 17:55

小白阿伟2024版的博客在使用transformers中自带的类SuppressTokensLogitsProcessor出现报错，显示张量不在一个设备上。我特地将代码中的张量都加上to(device)仍然报错。
Pytorch——报错解决：RuntimeError: Expected all tensors to be on the same device, but found at least two
2022-07-12 16:31

William.csj的博客 tensors 一会在CPU运算，一会在GPU运算。都改成在GPU运算。
【python报错已解决】RuntimeError: Expected all tensors to be on the same device, but found at least two dev
2024-08-15 09:32

鸽芷咕的博客在运行Python深度学习代码时，你是否遇到过`RuntimeError: Expected all tensors to be on the same device, but found at least two devices`这样的报错？这通常是由于在代码中混合使用了CPU和GPU运算，或者不同...
optimizer.step()处报错：RuntimeError: Expected all tensors to be on the same device, but found cuda:0
2023-02-16 16:11

Matrix_1127的博客 pytorch在加载预训练模型续训时在optimizer.step()处报错：RuntimeError: Expected all tensors to be on the same device, but found cuda:0 and cpu!
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cud
2024-07-29 20:14

Yan-英杰的博客 RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cud
【模型训练】解决问题：RuntimeError: Expected all tensors to be on the same device, but......
2024-07-12 23:06

子非鱼921的博客【代码】【模型训练】解决问题：RuntimeError: Expected all tensors to be on the same device, but......
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0
2024-08-30 20:31

zhangfeng1133的博客 RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument tensors in method wrapper_CUDA_cat) 这个错误再次指出了...
猫头虎分享如何解决 RuntimeError: Expected all tensors to be on the same device, but found at least two device
2024-06-10 23:14

虎头金猫的博客这个错误通常出现在深度学习模型训练过程中，尤其是在使用 ...问题原因解决方法避免措施张量分布在不同设备上使用to()方法将张量同步到同一设备养成良好的编码习惯，定期代码审查模型和数据未同步到同一设备使用to()
【pytorch】RuntimeError: Expected all tensors to be on the same device, but found at least two devices
2023-08-26 11:54

qq_44862379的博客学习pytorch途中将cpu代码转移到gpu上时遇到的一些错误。
解决RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cp
2022-05-26 09:00

AiCharm的博客 RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat2 in method wrapper_mm) 原因代码中的Tensor**，...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已采纳回答 10月23日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 6月20日