解决PyTorch模型加载时的设备不匹配错误

在Easy-Wav2Lip项目中，我遇到了典型的设备不匹配问题。它表明模型权重（weight）和输入数据（input）不在同一个设备上，一个在CPU，另一个在GPU。

melonbo

328人浏览 · 2025-09-29 00:26:15

melonbo · 2025-09-29 00:26:15 发布

在Easy-Wav2Lip项目中，我遇到了典型的设备不匹配问题。
RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same
它表明模型权重（weight）和输入数据（input）不在同一个设备上，一个在CPU，另一个在GPU。

🔧 问题排查

检查inference.py中哪里可能导致权重加载错误。

修改_load 函数

def _load(checkpoint_path):
    print(f"[DEBUG] 当前设备设置: {device}")
    print(f"[DEBUG] GPU ID: {gpu_id}")
    
    if device != "cpu":
        print(f"[DEBUG] 尝试加载到GPU/MPS设备")
        # 明确指定设备映射
        if device == 'cuda':
            checkpoint = torch.load(checkpoint_path, map_location='cuda')
        elif device == 'mps':
            checkpoint = torch.load(checkpoint_path, map_location='mps')
        else:
            checkpoint = torch.load(checkpoint_path)
    else:
        print(f"[DEBUG] 加载到CPU设备")
        checkpoint = torch.load(
            checkpoint_path, map_location=lambda storage, loc: storage
        )
    
    print(f"[DEBUG] 加载的checkpoint设备信息: {next(iter(checkpoint['state_dict'].values())).device if 'state_dict' in checkpoint else '未知'}")
    return checkpoint

修改 do_load 函数：

def do_load(checkpoint_path):
    global model, detector, detector_model
    
    print(f"[DEBUG] === 开始加载模型 ===")
    print(f"[DEBUG] 目标设备: {device}")
    
    model = load_model(checkpoint_path)
    
    # 添加模型设备检查
    print(f"[DEBUG] 主模型加载完成，检查设备:")
    if hasattr(model, 'parameters') and len(list(model.parameters())) > 0:
        first_param = next(model.parameters())
        print(f"[DEBUG] 模型参数设备: {first_param.device}")
    else:
        print(f"[DEBUG] 模型参数设备: 无法检测")
    
    detector = RetinaFace(
        gpu_id=gpu_id, model_path="checkpoints/mobilenet.pth", network="mobilenet"
    )
    detector_model = detector.model
    
    print(f"[DEBUG] === 模型加载完成 ===\n")

在 main 函数开始处添加设备信息：

def main():
    print(f"[SYSTEM] 最终使用的设备: {device}")
    print(f"[SYSTEM] CUDA可用: {torch.cuda.is_available()}")
    print(f"[SYSTEM] MPS可用: {torch.backends.mps.is_available() if hasattr(torch.backends, 'mps') else 'N/A'}")
    print(f"[SYSTEM] GPU ID: {gpu_id}")
    
    # 原有的main函数代码...

问题定位

运行代码，定位到函数do_load 。

(easy_wav) D:\work\easy-Wav2Lip\Easy-Wav2Lip>call run_loop.bat
opening GUI
Saving config
starting Easy-Wav2Lip...
Processing full.mp4 using playlist-file.wav for audio
imports loaded!
[DEBUG] === 开始加载模型 ===
[DEBUG] 目标设备: cuda
[DEBUG] 主模型加载完成，检查设备:
[DEBUG] 模型参数设备: cpu
[DEBUG] === 模型加载完成 ===

[SYSTEM] 最终使用的设备: cuda
[SYSTEM] CUDA可用: True
[SYSTEM] MPS可用: False
[SYSTEM] GPU ID: 0

解决办法

def do_load(checkpoint_path):
    global model, detector, detector_model
    
    # 获取当前设备配置
    device = 'cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu'
    gpu_id = 0 if torch.cuda.is_available() else -1
    
    print(f"[DEBUG] 当前设备: {device}, GPU ID: {gpu_id}")
    
    # 修改_load函数以正确处理设备映射
    def _load(checkpoint_path):
        if device == 'cuda' and torch.cuda.is_available():
            map_location = f'cuda:{gpu_id}' if gpu_id >= 0 else 'cuda'
        elif device == 'mps' and hasattr(torch.backends, 'mps') and torch.backends.mps.is_available():
            map_location = 'mps'
        else:
            map_location = 'cpu'
        
        print(f"[DEBUG] 使用设备映射: {map_location}")
        return torch.load(checkpoint_path, map_location=map_location)
    
    # 加载主模型
    checkpoint = _load(checkpoint_path)
    model.load_state_dict(checkpoint)
    
    # 确保模型在正确的设备上
    if device == 'cuda' and torch.cuda.is_available():
        model = model.cuda(gpu_id if gpu_id >= 0 else None)
    elif device == 'mps' and hasattr(torch.backends, 'mps') and torch.backends.mps.is_available():
        model = model.to('mps')
    
    print(f"[DEBUG] 主模型设备: {next(model.parameters()).device}")
    
    return model

这个解决方案的关键点在于：

动态设备检测：自动识别可用的计算设备
正确的map_location设置：确保权重加载到目标设备
设备一致性检查：验证模型和数据的设备一致性

火山引擎 ADG 社区

火山引擎开发者社区是火山引擎打造的AI技术生态平台，聚焦Agent与大模型开发，提供豆包系列模型（图像/视频/视觉）、智能分析与会话工具，并配套评测集、动手实验室及行业案例库。社区通过技术沙龙、挑战赛等活动促进开发者成长，新用户可领50万Tokens权益，助力构建智能应用。

更多推荐

Chess用户界面设计：Tailwind CSS样式系统和组件库

GitHub推荐项目精选中的ch/chess是一个类似chess.com的多人在线象棋平台，它采用现代化的前端技术栈构建，尤其在用户界面设计上通过Tailwind CSS样式系统和组件库实现了优雅且功能丰富的交互体验。本文将深入探讨该项目如何利用Tailwind CSS打造一致的设计语言和高效的组件系统，为象棋爱好者提供沉浸式的游戏界面。## 🎨 Tailwind CSS样式系统：构建统一视

火山引擎 ADG 社区

终极指南：GPT-Engineer如何通过AI自动发现代码问题并提升质量

GPT-Engineer是一款强大的AI驱动代码工具，它能帮助开发者自动检测潜在代码问题、优化代码质量，让编程效率提升3倍以上。无论是新手还是资深开发者，都能通过这款工具轻松发现代码中的隐藏缺陷，减少调试时间，释放更多精力在创造性工作上。## 一键发现代码问题：GPT-Engineer的AI审查魔力GPT-Engineer的核心能力在于其内置的智能代码分析系统。通过集成Python代码格式

火山引擎 ADG 社区

SatDump中的纠错编码技术：从RS码到Turbo码的完整实现指南

在卫星数据传输过程中，信号往往会受到各种干扰，导致数据错误。SatDump作为一款通用卫星数据处理软件，集成了多种先进的纠错编码技术，确保从卫星接收到的数据能够准确解码。本文将深入解析SatDump中从Reed-Solomon（RS）码到Turbo码的实现细节，帮助读者理解这些技术如何保障卫星通信的可靠性。## 为什么纠错编码对卫星数据至关重要？卫星与地面站之间的通信链路面临着空间辐射、大