deepseekvl2部署询问

想问一下部署deepseekvl2，运行github上面的sample后出现这个问题怎么解决呢？

import torch
from transformers import AutoModelForCausalLM

from deepseek_vl2.models import DeepseekVLV2Processor, DeepseekVLV2ForCausalLM
from deepseek_vl2.utils.io import load_pil_images


# specify the path to the model
model_path = "deepseek-ai/deepseek-vl2-small"
vl_chat_processor: DeepseekVLV2Processor = DeepseekVLV2Processor.from_pretrained(model_path)
tokenizer = vl_chat_processor.tokenizer

vl_gpt: DeepseekVLV2ForCausalLM = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True)
vl_gpt = vl_gpt.to(torch.bfloat16).cuda().eval()

## single image conversation example
conversation = [
    {
        "role": "<|User|>",
        "content": "<image>\n<|ref|>The giraffe at the back.<|/ref|>.",
        "images": ["./images/visual_grounding.jpeg"],
    },
    {"role": "<|Assistant|>", "content": ""},
]

## multiple images (or in-context learning) conversation example
# conversation = [
#     {
#         "role": "User",
#         "content": "<image_placeholder>A dog wearing nothing in the foreground, "
#                    "<image_placeholder>a dog wearing a santa hat, "
#                    "<image_placeholder>a dog wearing a wizard outfit, and "
#                    "<image_placeholder>what's the dog wearing?",
#         "images": [
#             "images/dog_a.png",
#             "images/dog_b.png",
#             "images/dog_c.png",
#             "images/dog_d.png",
#         ],
#     },
#     {"role": "Assistant", "content": ""}
# ]

# load images and prepare for inputs
pil_images = load_pil_images(conversation)
prepare_inputs = vl_chat_processor(
    conversations=conversation,
    images=pil_images,
    force_batchify=True,
    system_prompt=""
).to(vl_gpt.device)

# run image encoder to get the image embeddings
inputs_embeds = vl_gpt.prepare_inputs_embeds(**prepare_inputs)

# run the model to get the response
outputs = vl_gpt.language_model.generate(
    inputs_embeds=inputs_embeds,
    attention_mask=prepare_inputs.attention_mask,
    pad_token_id=tokenizer.eos_token_id,
    bos_token_id=tokenizer.bos_token_id,
    eos_token_id=tokenizer.eos_token_id,
    max_new_tokens=512,
    do_sample=False,
    use_cache=True
)

answer = tokenizer.decode(outputs[0].cpu().tolist(), skip_special_tokens=True)
print(f"{prepare_inputs['sft_format'][0]}", answer)

OSError Traceback (most recent call last)
Cell In[2], line 10
8 # specify the path to the model
9 model_path = "deepseek-ai/deepseek-vl2-small"
---> 10 vl_chat_processor: DeepseekVLV2Processor = DeepseekVLV2Processor.from_pretrained(model_path)
11 tokenizer = vl_chat_processor.tokenizer
13 vl_gpt: DeepseekVLV2ForCausalLM = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True)

File ~/autodl-tmp/conda/envs/DeepSeek_VL2/lib/python3.8/site-packages/transformers/processing_utils.py:465, in ProcessorMixin.from_pretrained(cls, pretrained_model_name_or_path, cache_dir, force_download, local_files_only, token, revision, **kwargs)
462 if token is not None:
463 kwargs["token"] = token
--> 465 args = cls._get_arguments_from_pretrained(pretrained_model_name_or_path, **kwargs)
466 processor_dict, kwargs = cls.get_processor_dict(pretrained_model_name_or_path, **kwargs)
468 return cls.from_args_and_dict(args, processor_dict, **kwargs)

File ~/autodl-tmp/conda/envs/DeepSeek_VL2/lib/python3.8/site-packages/transformers/processing_utils.py:511, in ProcessorMixin._get_arguments_from_pretrained(cls, pretrained_model_name_or_path, **kwargs)
508 else:
509 attribute_class = getattr(transformers_module, class_name)
--> 511 args.append(attribute_class.from_pretrained(pretrained_model_name_or_path, **kwargs))
512 return args

File ~/autodl-tmp/conda/envs/DeepSeek_VL2/lib/python3.8/site-packages/transformers/tokenization_utils_base.py:2032, in PreTrainedTokenizerBase.from_pretrained(cls, pretrained_model_name_or_path, cache_dir, force_download, local_files_only, token, revision, trust_remote_code, *init_inputs, **kwargs)
2026 logger.info(
2027 f"Can't load following files from cache: {unresolved_files} and cannot check if these "
2028 "files are necessary for the tokenizer to operate."
2029 )
2031 if all(full_file_name is None for full_file_name in resolved_vocab_files.values()):
-> 2032 raise EnvironmentError(
2033 f"Can't load tokenizer for '{pretrained_model_name_or_path}'. If you were trying to load it from "
2034 "'https://huggingface.co/models', make sure you don't have a local directory with the same name. "
2035 f"Otherwise, make sure '{pretrained_model_name_or_path}' is the correct path to a directory "
2036 f"containing all relevant files for a {cls.name} tokenizer."
2037 )
2039 for file_id, file_path in vocab_files.items():
2040 if file_id not in resolved_vocab_files:

OSError: Can't load tokenizer for 'deepseek-ai/deepseek-vl2-small'. If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure 'deepseek-ai/deepseek-vl2-small' is the correct path to a directory containing all relevant files for a LlamaTokenizerFast tokenizer.
[ ]:

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
道友老李 JWE233286一种基于机器视觉的水表指针读数识别及修正的方法专利发明者 2025-03-12 20:31
关注
让【道友老李】来帮你解答，本回答参考gpt编写，并整理提供，如果还有疑问可以点击头像关注私信或评论。
如果答案让您满意，请采纳、关注，非常感谢！
这个错误看起来是由于无法找到指定的预训练模型路径引起的。请确保`model_path`指向的模型路径是正确的，如果是使用特定的预训练模型，还需要检查是否已经下载了该模型。另外，如果您要运行GitHub上的示例代码，建议在运行之前先查看README文件或者文档，确保已经按照要求安装了所有必要的依赖项。以下是可能的解决方案和代码片段：
确保正确指定预训练模型路径：

model_path = "deepseek-ai/deepseek-vl2-small"

确保已经下载了指定的预训练模型：

transformers-cli repo_info deepseek-ai/deepseek-vl2-small

可尝试更换其他预训练模型路径，或查看DeepseekVLV2Processor.from_pretrained()方法的文档，了解更多参数信息。希望这些提示对您有帮助，如果需要进一步的帮助，请告诉我。
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

DeepSeek VL：突破界限，迈向现实世界的视觉语言理解
2025-08-29 01:07

2. 多模态理解能力：DeepSeek VL 能够处理和理解不同的数据类型，包括视觉和语言信息。这种能力使它能够在处理复杂的多模态问题时表现得更好。 3. 处理逻辑图、网页、公式、科学文献和自然图像：DeepSeek VL 能够...
DeepSeek VL2 本地部署教程
2025-03-18 18:47

牛奔的博客准备 Conda 3.10 虚拟环境 conda create --name deepseekVL2 python=3.10 conda activate deepseekVL2 安装依赖 cd DeepSeek-VL2/ pip install -e . pip install -r requirements.txt 使用 modelscope 下载模型官网...
Deepseek VL2 多卡部署
2024-12-31 10:39

lvchaofan0101的博客 Deepseek VL2 多卡部署
Deepseek-vl2微调环境部署及各种报错解决方案
2025-02-19 20:52

小瑄的博客基于modelswift解决Deepseek-vl2大模型微调环境部署问题
Deepseek VL-2：可扩展视觉-语言人工智能的未来.pdf
2025-02-18 19:40

Deepseek VL-2是一种基于专家混合（MoE）架构的视觉-语言人工智能模型，具有出色的多模态任务处理能力。其设计核心在于通过激活与特定任务相关的子网络来优化性能和资源利用，实现高效与高性能的平衡。该模型能够...
DeepSeek-VL2论文解读：用于高级多模态理解的专家混合视觉语言模型
2025-02-06 16:09

samoyan的博客在这份技术报告中，我们介绍了DeepSeek-VL2，这是一系列新的开源视觉语言模型，利用专家混合（MoE）架构，在性能和效率上相较其前身DeepSeek-VL取得了显著提升。我们的进步主要集中在三个关键方面：（1）动态的高...
DeepSeek-VL2 环境配置与使用指南
2025-02-14 21:24

zolty的博客本文详细介绍了 DeepSeek-VL2 的环境配置、模型下载与运行、多 GPU 支持等内容。希望这篇指南能帮助您快速上手 DeepSeek-VL2。如果您有任何问题，欢迎在评论区留言讨论！希望这篇博客对您有所帮助！
DeepSeek-VL2：用于高级多模态理解的专家混合视觉-语言模型
2024-12-26 01:00

Together_CZ的博客 DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding DeepSeek-VL2：用于高级多模态理解的专家混合视觉-语言模型
如何在ms-swift 微调训练deepseekvl2时使用sageattention
2025-03-28 11:22

aerror的博客 1.本质上sageattention是sdpa，SDPA的全称为Scaled Dot-Product Attention, 属于乘性注意力机制，简单一...因此，deepseekvl2无法直接简单使用sageattion,我们需要改一下deepseek的开源代码，才有可能用上sageattion.
Win本地部署DeepSeek
2025-02-11 17:59

Sanch37的博客一、Win11本地部署DeepSeek 1、安装Ollama Ollama 是一个用于在本地环境中运行和定制大型语言模型（LLM）的开源工具平台。Ollama 提供了简单的接口，用于创建、运行和管理这些模型，同时还维护了一个丰富的预构建...
【资料分享】DeepSeek-VL2：用于高级多模态理解的专家混合视觉语言模型
2025-02-10 20:18

Zzz 小生的博客 DeepSeek-VL2 作为一款基于 MoE 架构的先进视觉语言模型，其在多个方面的提升使其在多模态理解任务中具有较为突出的表现。通过引入动态切片策略和 MLA 机制，模型不仅在处理大规模图像数据时具备更高的效率，还在...
MNN 支持 DeepSeekVL
2025-05-03 14:03

夕阳叹的博客 DeepSeekVL (https://modelscope.cn/models/deepseek-ai/deepseek-vl-7b-chat/) 是 DeepSeek 开发的多模态大语言模型。7b 模型可以基于 MNN (https://github.com/alibaba/MNN/)在高端手机上运行，因此进行了一下适配...
DeepSeek本地部署教程
2025-02-11 15:29

小蝇工作室的博客 LM Studio是一款专为本地运行大型语言模型（LLMs）设计的桌面应用程序，它突破了传统依赖云服务的限制，使用户能够在本地计算机上高效运行LLMs，实现离线状态下的自然语言处理（NLP）任务。LM Studio不仅提供了一种...
【不踩坑部署，多次部署无问题】在Linux服务器上搭建Deepseek-VL2/vl2-small/tiny，实现可视化页面,可实现公网访问（另加配置）。
2025-04-29 18:34

小野巴的博客 1、官网下载Deepseek-VL2-main 下载到本地...源代码只可以单卡运行，Deepseek-VL2/vl2-small/tiny分别对应80/40显存，太小可能oom。3、进入到Deepseek-VL2目录一下，执行 pip install -e .
DeepSeek-VL2解读
2025-03-16 09:25

xuebodx0923的博客这是一系列先进的大型混合专家(MoE)视觉语言模型，比其前身DeepSeek-VL有了显著改进。DeepSeek-VL2在各种任务中都表现出卓越的能力，包括但不限于视觉问答、光学字符识别、文档/表格/图表理解和视觉基础。我们的模型...
从 DeepSeek-VL 到 DeepSeek-VL2：深入解读DeepSeek-VL2
2025-06-04 13:20

Jim_gaogao的博客 DeepSeek-VL2 不仅提升了模型性能，也展现了多模态大模型在通用性、高效性和实用性方面的新高度。作为一款“可落地”的视觉语言模型，其在科研与产业之间搭建了坚实桥梁。
DeepSeek-VL2 推理
2025-02-10 19:22

二分掌柜的的博客指定预训练模型的路径 model_path = "deepseek-ai/deepseek-vl2-small" # 初始化视觉语言处理的处理器实例，使用指定路径的预训练模型权重 vl_chat_processor: DeepseekVLV2Processor = DeepseekVLV2Processor.from_...
【AI】DeepSeek API优劣势与多种开发语言的demo
2025-02-05 23:16

IT女民工的博客中文长文本处理 | DeepSeek-R1 | 128K上下文窗口，支持12万字长文本 |{"type": "text", "text": "描述这张图片"},| 多模态内容生成 |...1.5B | 部署仅需1-2分钟，资源消耗低 |"content" => "公司2024年战略重点是什么？
DeepSeek-V2-Chat多卡推理(不考虑性能)
2024-06-12 22:12

Hi20240217的博客 pretrained(model_name, trust_remote_code=True,attn_implementation="eager",torch_dtype=torch.bfloat16) model=model.eval() no_split_module_classes = ['DeepseekV2MLP','DeepseekV2Attention'] #no_split_...
本地部署DeepSeek-VL2服务
2025-02-27 11:16

CSwhit的博客 DeepSeek-VL2服务部署
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 3月12日

deepseekvl2部署询问

4条回答 默认 最新

问题事件

4条回答默认最新