Stable Function Calling in open-r1 #63
Replies: 3 comments 2 replies
-
I can support adding Code agentic behaviour! Rationale for using code rather than the JSON blobs traditionally used in function calling are here: https://huggingface.co/docs/smolagents/en/conceptual_guides/intro_agents#code-agents |
Beta Was this translation helpful? Give feedback.
-
Have you considered integrating function calling capabilities within long CoT? An r1 model that can obtain tool results in real-time during the internal thinking process would be exciting, although it might be challenging. |
Beta Was this translation helpful? Give feedback.
-
I built a web app to test the tools plan and tool use capabilities of different LLMs. I found that DeepSeek v3 is also prone to errors. For example, it is very easy to fall into a function call loop, which Claude 3.5 Sonnet will not make. As DeepSeek said, function calls need to be improved. |
Beta Was this translation helpful? Give feedback.
-
Hello Everyone,
My goal is to help create stable Function Calling in open-R1 so model pipelines can properly and reliably integrate external tools.
This is the current status of Function Calling in deepseek-chat that can use as a foundation to start: https://api-docs.deepseek.com/guides/function_calling.
Look forward to collaborating on this with the community so feel free to drop me a message
Beta Was this translation helpful? Give feedback.
All reactions