compensate for LLMs misunderstanding a prompt that should result in a tool call #46

codefromthecrypt · 2024-09-10T05:47:32Z

It is unlikely that a prompt that should result in a tool call will misfire using gpt-4o, at least according to this blog, using gorilla to test calls. the success rate is very high.

However, even if we don't see in practice gpt-4o mistaking a tool call for a question (and returning text instead), it is certainly possible. It is much more likely in local inference, where sometimes 1/10 will misfire or will misfire based on not understanding tool or prompts exactly how gpt does.

I suggest we make an approach where, when we know there's a tool call expected, we retry when the response from the LLM is text instead. Right now, we have retries, but only on HTTP failure. This is application layer and would help make local LLMS more usable.

…are#46) Co-authored-by: Bradley Axen <[email protected]>

codefromthecrypt pushed a commit to codefromthecrypt/exchange that referenced this issue Oct 13, 2024

fix: goose should track files it reads and not overwrite changes (squ…

b3652cf

…are#46) Co-authored-by: Bradley Axen <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compensate for LLMs misunderstanding a prompt that should result in a tool call #46

compensate for LLMs misunderstanding a prompt that should result in a tool call #46

codefromthecrypt commented Sep 10, 2024

compensate for LLMs misunderstanding a prompt that should result in a tool call #46

compensate for LLMs misunderstanding a prompt that should result in a tool call #46

Comments

codefromthecrypt commented Sep 10, 2024