springAI，vllm，调用mcp

我使用springAI，调用vllm，让大模型调用mcp工具，一直报400错误，以下是我编写的代码
我添加了依赖：


<dependency>
            <groupId>org.springframework.ai</groupId>
            <artifactId>spring-ai-openai-spring-boot-starter</artifactId>
            <version>1.0.0-SNAPSHOT</version>
        </dependency>
        <dependency>
            <groupId>org.springframework.ai</groupId>
            <artifactId>spring-ai-starter-mcp-client-webflux</artifactId>
            <version>1.0.0-M7</version>
        </dependency>
<dependencyManagement>
        <dependencies>
            <dependency>
                <groupId>org.springframework.ai</groupId>
                <artifactId>spring-ai-bom</artifactId>
                <version>1.0.0-M7</version>
                <type>pom</type>
                <scope>import</scope>
            </dependency>
        </dependencies>
    </dependencyManagement>

定义了chatClient:

@Bean
    ChatClient chatClient(ChatModel chatModel, List<McpSyncClient> mcpClients) {
        var toolCallbackProvider = new SyncMcpToolCallbackProvider(mcpClients);
        OpenAiChatOptions options = OpenAiChatOptions.builder()
                    .model("/home/ai/models/Qwen/Qwen2.5-VL-7B-Instruct-AWQ")
                    .temperature(0.7)
                    .maxTokens(500)
                    .build();

        return ChatClient
                .builder(chatModel)
                .defaultSystem("你是一个专业的金融领域教授")
                .defaultTools(toolCallbackProvider.getToolCallbacks())
                .defaultOptions(options)
                .build();
    }

然后我定义了controller用于接口暴露

public McpController(ChatClient chatClient) {
        this.chatClient = chatClient;
    }

    @RequestMapping(value = "/generate_stream", method = RequestMethod.GET)
    public Flux<ServerSentEvent<Object>> generateStream(HttpServletResponse response,
                                                        @RequestParam("id") String id,
                                                        @RequestParam("prompt") String prompt) {
        response.setCharacterEncoding("UTF-8");
        var messageChatMemoryAdvisor = new MessageChatMemoryAdvisor(chatMemory, id, 10);
        return this.chatClient
                .prompt(prompt)
                .advisors(messageChatMemoryAdvisor)
                .stream()
                .chatResponse()
                .map(data->{
                    String text=data.getResult().getOutput().getText();
                    return ServerSentEvent.builder()
                            .data(text)
                            .build();
                }).doOnError(error->{
                    System.out.println(error);
                });
    }

然后是application.yml文件配置了mcp工具调用：

spring:
  ai:
    openai:
      base-url: http://192.168.8.4:8002  # vLLM ip
      api-key: sk-no-key-required
    mcp:
      client:
        type: SYNC
        enabled: true
        name: call-mcp-server
        request-timeout: 30s
        stdio:
          servers-configuration: classpath:mcp-server.json

接着是mcp-server.json文件，调用的高德mcp

{
    "mcpServers": {
        "amap-maps": {
            "command": "cmd",
            "args": ["/c","npx","-y",
            "@amap/amap-maps-mcp-server"],
            "env": {
                "AMAP_MAPS_API_KEY": "e0a48e0ccdf0a45d3e4019809eb79633"
            }
        }
    }
}

这一套结束之后，我调用接口localhost:8080/generate_stream?id=2&prompt=@agen 规划去西安游玩路线，vllm就会直接报400错误，但是我不加defaultTools调用大模型没有问题，或者我用通义千问也可以回调mcp，就是用我自己的模型不行，有遇见相似问题的吗，共同探讨的吗

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
紫雾凌寒新星创作者: 人工智能技术领域 2025-04-24 10:59
关注
让 “紫雾凌寒” 来帮你回答,本回答结合 DeepSeek

如果还有问题，请评论或点击头像私信，如果有帮助到您, 请关注、采纳、点赞

关键点分析：
- 目前已知使用Spring AI调用VLLM时出现400错误，但提供的代码片段不完整，无法全面了解具体的调用逻辑和参数设置等情况。不过从报错400来看，很可能是请求参数格式不正确、参数缺失或者参数值不符合接收方的要求等原因导致。
解决方案：
- 首先，确保你的Spring AI配置和VLLM集成正确。假设你有一个简单的调用示例代码如下（这里只是示例框架，实际需根据你的业务逻辑和VLLM接口完善）：
```java
import org.springframework.beans.factory.annotation.Autowired;
import org.springframework.web.bind.annotation.PostMapping;
import org.springframework.web.bind.annotation.RequestBody;
import org.springframework.web.bind.annotation.RestController;

import com.theokanning.openai.completion.CompletionRequest;
import com.theokanning.openai.service.OpenAiService;

@RestController
public class VllmController {

@Autowired private OpenAiService openAiService; @PostMapping("/vllm") public String callVllm(@RequestBody String input) { CompletionRequest completionRequest = CompletionRequest.builder() .prompt(input) .maxTokens(100) .build(); try { var completions = openAiService.createCompletion(completionRequest); return completions.getChoices().get(0).getText(); } catch (Exception e) { e.printStackTrace(); return "Error calling VLLM: " + e.getMessage(); } }

}
- 关于调用MCP工具部分，你需要明确MCP工具接收的请求格式和参数要求。如果MCP工具通过HTTP接口接收请求，你可能需要按照以下方式构建请求：java
import org.springframework.http.HttpEntity;
import org.springframework.http.HttpHeaders;
import org.springframework.http.MediaType;
import org.springframework.http.ResponseEntity;
import org.springframework.web.bind.annotation.GetMapping;
import org.springframework.web.bind.annotation.RequestParam;
import org.springframework.web.bind.annotation.RestController;
import org.springframework.web.client.RestTemplate;

@RestController
public class McpController {

@GetMapping("/mcp") public String callMcp(@RequestParam String input) { RestTemplate restTemplate = new RestTemplate(); HttpHeaders headers = new HttpHeaders(); headers.setContentType(MediaType.APPLICATION_JSON); String requestBody = "{\"input\":\"" + input + "\"}";//假设MCP工具接收JSON格式请求，这里根据实际情况调整 HttpEntity<String> entity = new HttpEntity<>(requestBody, headers); ResponseEntity<String> response = restTemplate.postForEntity("MCP_TOOL_URL", entity, String.class); return response.getBody(); }

}
- 然后在你的业务逻辑中，将VLLM的输出作为输入传递给MCP工具的调用接口：java
@PostMapping(“/vllmAndMcp”)
public String callVllmAndMcp(@RequestBody String input) {
String vllmOutput = callVllm(input);
if (!vllmOutput.startsWith(“Error”)) {
return callMcp(vllmOutput);
} else {
return vllmOutput;
}
}
`` 3. **多种解决方案的优缺点**： - **优点**： - **代码分层清晰**：上述代码示例将VLLM调用、MCP工具调用以及业务逻辑处理分别封装在不同的方法中，便于维护和扩展。例如，如果VLLM的调用逻辑发生变化，只需要修改callVllm方法，不会影响到其他部分的代码。 - **便于调试**：每个方法都有明确的功能，当出现问题时，可以方便地在每个方法内部进行调试，查看具体的参数传递和调用结果。比如在callVllm方法中打印completionRequest的内容，以检查请求参数是否正确。 - **缺点**： - **依赖外部服务**：过度依赖VLLM和MCP工具的接口稳定性。如果这些外部服务出现故障或者接口发生变化，可能需要对代码进行较大的修改。例如，MCP工具的请求格式改变，就需要同时修改callMcp`方法中的请求构建部分。
- 性能开销：每次调用VLLM和MCP工具都需要进行网络请求，可能会带来一定的性能开销。特别是在高并发场景下，频繁的网络调用可能会影响系统的整体性能。
4. 总结：
- 解决这个问题的关键在于仔细检查请求参数的格式和内容是否符合VLLM和MCP工具的要求。通过完善代码逻辑，逐步排查问题所在，比如在调用VLLM时确保输入参数正确，获取到正确的输出后再以合适的格式传递给MCP工具。同时，要注意外部服务的稳定性和接口变化，及时调整代码以适应这些情况。

请注意，以上代码是基于常见的Spring和HTTP调用场景的示例，实际应用中需要根据VLLM和MCP工具的具体文档和接口进行详细调整。

希望以上解答对您有所帮助。如果您有任何疑问，欢迎在评论区提出。
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

使用springAI，调用vllm，让大模型调用mcp工具，一直报400错误，如何解决？
2025-04-30 19:15

bug菌¹的博客原问题描述我使用springAI，调用vllm，让大模型调用mcp工具，一直报400错误，以下是我编写的代码，展示部分，看看哪里有问题？如下是我添加的依赖： <dependency> <groupId>org.springframework.ai</groupId> ...
Spring AI MCP实战（1）调用高德api查询天气
2025-11-13 19:09

HHYYKKKKK的博客有一些模型不支持mcp：ollama中的gemma和deepseek现在我使用的支持的：ollama中的qwen3， gpt， openai(deepseek)
Spring Ai （Function Calling / Tool Calling）工具调用
2025-08-25 22:15

Dajiaonew的博客本文介绍了大语言模型(LLM)工具调用的实现方法。工具调用允许LLM在生成回答时决定是否需要调用外部函数获取信息或执行操作，如联网搜索、网页抓取等。文章详细讲解了工具调用的流程：用户提问→LLM判断是否需要工具...
基于 Spring AI + MCP + DeepSeek-R1-7B 构建企业级智能 Agent 工具调用系统
2025-04-23 14:01

奔向理想的星辰大海的博客模块技术说明模型推理Tool Calling 支持良好工具服务快速构建工具 API调度管理Ragflow可选，增强上下文能力将 MCP 工具封装为微服务，提高可复用性使用数据库、搜索引擎作为 Agent 工具数据源在企业应用中使用 ...
Spring AI + bge-large + Milvus 构建私有化语义内容检索方案
2025-06-17 19:50

python_知世的博客无论您是科研人员、工程师，还是对AI大模型感兴趣的爱好者，这套报告合集都将为您提供宝贵的信息和启示。：大数据时代，越来越多的企业和机构需要处理海量数据，利用大模型技术可以更好地处理这些数据，提高数据...
java 使用 spring AI 实战MCP_springai mcp，零基础入门到精通，收藏这篇就够了
2026-01-07 20:16

黑客大白的博客最近在腾讯云edgeone的直播中了解到了MCP，随着了解发现MCP确实是一个未来发展的趋势：全称 Model Context Protocol 是一种专为人工智能模型设计的通信协议，于2024年11月由Anthropic推出的开放标准。它旨在解决复杂...
AI + Spring 新玩法：告别API，MCP协议让它们直接“对话”！
2025-08-05 10:44

AGI大模型学习的博客主要内容包括：1)构建SpringBoot应用，将CRM用户查询功能作为AI可调用的工具；2)使用@Tool注解暴露业务方法；3)支持STDIO和SSE两种通信方式；4)配置CursorIDE连接MCP服务进行测试。文章还分享了AI大模型学习路径，...
Java开发者LLM实战指南：SpringAI vs LangChain4j
2025-06-12 09:37

和老莫一起学AI的博客尽管Python在AI研究领域仍占据主导地位，但Java不仅“活得很好”，还在企业级AI部署中默默支撑着大量应用。如今两大框架正逐步崛起，让Java开发者无需走出舒适区，即可轻松接入大语言模型（LLM）：
使用springai接入阿里各种ai模型
2025-04-09 19:09

凉小刀的博客首先注册阿里云百炼然后获取一个key 创建一个...ai-bom ${spring-ai.version} pom import org.springframework.boot spring-boot-maven-plugin yaml文件为: spring: ai: openai: api-key: 你的key base-url: ...
使用SpringAI实现MCP服务并与Qwen集成使用
2025-04-21 20:26

智泊AI官网的博客 MCP（Model Context Protocol，模型上下文协议）是一种开放协议，旨在实现大型语言模型（LLM）应用与外部数据源、工具和服务之间的无缝集成，类似于网络中的 HTTP 协议或邮件中的 SMTP 协议。MCP 协议通过标准化...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已结题（查看结题原因） 4月24日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
修改了问题 4月24日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 4月24日

springAI，vllm，调用mcp

4条回答 默认 最新

问题事件

4条回答默认最新