feat: Add reasoning effort control for Claude 3.7 #2540

DeJeune · 2025-02-28T05:25:44Z

Add reasoning effort settings with low/medium/high options
Implement reasoning effort for Claude 3.7 Sonnet models with OpenAI and Anthropic Provider
Update localization tips for reasoning effort
Enhance provider handling of reasoning effort parameters

Resolve #2304

…c Provider and OpenAI Provider - Add reasoning effort settings with low/medium/high options - Implement reasoning effort for Claude 3.7 Sonnet models - Update localization tips for reasoning effort - Enhance provider handling of reasoning effort parameters

DeJeune · 2025-02-28T06:01:16Z

大概说明一下，对于推理相关参数的处理复用了新版OpenRouterAPI: https://openrouter.ai/docs/use-cases/reasoning-tokens

{
"model": "your-model",
"messages": [],
"reasoning": {
// One of the following (not both):
"effort": "high", // Can be "high", "medium", or "low" (OpenAI-style)
"max_tokens": 2000, // Specific token limit (Anthropic-style)

// Optional: Default is false. All models support this.
"exclude": false // Set to true to exclude reasoning tokens from response
  }
}

reasoning_effort是走 openai，如果anthropic这种不支持effort的模型设置了effort参数会按百分比转换成max_token：budget_tokens = max(min(global_max_tokens * {effort_ratio}, 32000), 1024) effort比例等于0.2, 0.5, 0.8
reasoning里面的max_tokens是指推理的最大token，也对应了 anthropic的budget_token，如果openai这种不支持的模型会按照max_tokens/global_max_tokens换算成effort

所以我们只需要2选1，最好的方法就是复用本身有的UI即可，这样一个配置就能支持

在这里如果是o3系列和o1模型或者是OpenRouter的claude-sonnet-3.7，就会直接使用reasoning_effort
如果是其他中转商或者Anthropic原生接口的Claude-sonnet-3.7 就会换算成buget_token
我直接在代码里把所有的o3系列和o1模型，以及claude-sonnet-3.7全部标记成了Reasoning模型，这样才走得到我上述的逻辑

如果要Claude开启推理，只需要打开思维链长度即可

关闭思维链时：

打开思维链时

如果要使用满血思考，可以把max_tokens拉到官方推荐的128k token

然后这位老哥 @preszzz 提到的直接在渲染进程发起请求这问题，我感觉没必要单独处理一个Claude-3.7。如果确实需要处理，需要把所有api请求搬到主进程，然后为APIKey加密存储，防止恶意软件。后面可以考虑整体进行改动

总之这样在前端进行了最小的改动，把budget_token的使用交给程序，这样也避免用户设置超过max_token的budget_token

DeJeune · 2025-02-28T06:02:08Z

@ousugo 麻烦看一下，感谢🙏

DeJeune · 2025-02-28T06:18:11Z

然后下面是一些测试结果

原生anthropic接口测试，该代码里面aihubmix的type，没有anthropic API😭

ousugo · 2025-02-28T07:57:56Z

我觉得可以 👍 ，让猫哥再看一下就可以了

DeJeune added 4 commits February 28, 2025 13:23

fix: Extract o1-mini and o1-preview

a15c74f

fix: Add OpenAI o-series model to ReasoningModel

b8edeae

fix: Improve OpenAI o-series model detection

572b4b5

style: Reduce font size

8496a6b

fix: Add default token handling using DEFAULT_MAX_TOKENS

a8edb36

DeJeune marked this pull request as ready for review February 28, 2025 06:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add reasoning effort control for Claude 3.7 #2540

feat: Add reasoning effort control for Claude 3.7 #2540

DeJeune commented Feb 28, 2025 •

edited by ousugo

Loading

DeJeune commented Feb 28, 2025

DeJeune commented Feb 28, 2025

DeJeune commented Feb 28, 2025

ousugo commented Feb 28, 2025

feat: Add reasoning effort control for Claude 3.7 #2540

Are you sure you want to change the base?

feat: Add reasoning effort control for Claude 3.7 #2540

Conversation

DeJeune commented Feb 28, 2025 • edited by ousugo Loading

DeJeune commented Feb 28, 2025

DeJeune commented Feb 28, 2025

DeJeune commented Feb 28, 2025

ousugo commented Feb 28, 2025

DeJeune commented Feb 28, 2025 •

edited by ousugo

Loading