Use native tool calling for dspy.ReAct #3921

chenmoneygithub · 2025-01-25T03:03:53Z

Support native tool calling in DSPy. This is a relatively big change that involves multiple modules:

dspy.LM: Add an explicit tools arg, and parse the LLM response's tool_calls part as a special field in the LM output.
Adapter: adapter is modified accordingly to handle the tool calls.
dspy.ReAct: In addition to the custom tool calling (as of dspy==2.5.42), we support native tool calling by allowing users to define the tools, and inside the implementation we do automatic input formatting and output parsing in order to execute the tools.
dspy.Tool: we improve the dspy.Tool abstraction to facilitate the tool calling process.

Custom testing/benchmarking script:

import dspy

lm_4o_mini = dspy.LM("openai/gpt-4o-mini")
lm_4o = dspy.LM("openai/gpt-4o")
dspy.configure(lm=lm_4o_mini)


import litellm

from dspy.datasets import DataLoader

litellm.cache = None

kwargs = dict(fields=("claim", "supporting_facts", "hpqa_id", "num_hops"), input_keys=("claim",))
hover = DataLoader().from_huggingface(dataset_name="hover-nlp/hover", split="train", trust_remote_code=True, **kwargs)

hpqa_ids = set()
filtered_hover = []
for x in hover:
    if x["num_hops"] == 3 and x["hpqa_id"] not in hpqa_ids:
        hpqa_ids.add(x["hpqa_id"])
        filtered_hover.append(
            dspy.Example(claim=x.claim, titles=list(set([y["key"] for y in x.supporting_facts]))).with_inputs("claim")
        )
hover = filtered_hover

trainset, devset, testset = hover[:100], hover[100:200], hover[650:]

example = trainset[0]

print("Claim:", example.claim)
print("Pages that must be retrieved:", example.titles)

DOCS = {}


def search(query: str, k: int) -> list[str]:
    results = dspy.ColBERTv2(url="http://20.102.90.50:2017/wiki17_abstracts")(query, k=k)
    results = [x["text"] for x in results]

    for result in results:
        title, text = result.split(" | ", 1)
        DOCS[title] = text

    return results


def search_wikipedia(query: str) -> list[str]:
    """Returns top-5 results and then the titles of the top-5 to top-30 results."""

    topK = search(query, 30)
    titles, topK = [f"`{x.split(' | ')[0]}`" for x in topK[5:30]], topK[:5]
    return topK + [f"Other retrieved pages have titles: {', '.join(titles)}."]


def lookup_wikipedia(title: str) -> str:
    """Returns the text of the Wikipedia page, if it exists."""

    if title in DOCS:
        return DOCS[title]

    results = [x for x in search(title, 10) if x.startswith(title + " | ")]
    if not results:
        return f"No Wikipedia page found for title: {title}"
    return results[0]


instructions = "Find all Wikipedia titles relevant to verifying (or refuting) the claim."
signature = dspy.Signature("claim -> titles: list[str]", instructions)
tools = [dspy.Tool.from_function(search_wikipedia), dspy.Tool.from_function(lookup_wikipedia)]

react = dspy.ReAct(signature, tools=tools, max_iters=20, use_litellm_tool_calling=True)

output = react(claim="David Gregory was born in 1625.")
print(output)

dspy.inspect_history(n=2)


def top5_recall(example, pred, trace=None):
    gold_titles = example.titles
    recall = sum(x in pred.titles[:5] for x in gold_titles) / len(gold_titles)

    # If we're "bootstrapping" for optimization, return True if and only if the recall is perfect.
    if trace is not None:
        return recall >= 1.0

    # If we're just doing inference, just measure the recall.
    return recall


evaluate = dspy.Evaluate(devset=devset[:10], metric=top5_recall, num_threads=10, display_progress=True, display_table=5)


def safe_react(claim: str):
    try:
        return react(claim=claim)
    except Exception:
        return dspy.Prediction(titles=[])


evaluate(safe_react)

chenmoneygithub marked this pull request as draft January 25, 2025 03:03

chenmoneygithub changed the title ~~Use native tool calling for dspy.ReAct~~ [WIP] Use native tool calling for dspy.ReAct Jan 25, 2025

chenmoneygithub and others added 10 commits January 26, 2025 21:10

Add tools calling for dspy.LM

5dea33f

fix tests

594a7c5

fix the adapter and predict

42adaf8

Add PredictWithTools

7a51abf

init

50e05db

increment

370e849

fix tests

44443bd

increment

6d5e363

fix

dd4c27b

minor

e9c83ec

chenmoneygithub force-pushed the dspy-tool-def branch from 27c6ee9 to e9c83ec Compare January 27, 2025 05:12

fix arg type thing

39c720b

chenmoneygithub mentioned this pull request Jan 27, 2025

Add tools calling for dspy.LM #2023

Closed

fix tests

755269a

chenmoneygithub changed the title ~~[WIP] Use native tool calling for dspy.ReAct~~ Use native tool calling for dspy.ReAct Jan 27, 2025

chenmoneygithub marked this pull request as ready for review January 27, 2025 06:52

chenmoneygithub changed the title ~~Use native tool calling for dspy.ReAct~~ [WIP] Use native tool calling for dspy.ReAct Jan 27, 2025

chenmoneygithub added 2 commits January 27, 2025 13:18

fix json adapter

0ad665e

hack

19884f4

chenmoneygithub force-pushed the dspy-tool-def branch from 1c1d88d to 19884f4 Compare January 28, 2025 00:49

change defaults

77af440

chenmoneygithub force-pushed the dspy-tool-def branch from 11d9403 to bc41cb7 Compare January 29, 2025 04:57

chenmoneygithub changed the title ~~[WIP] Use native tool calling for dspy.ReAct~~ Use native tool calling for dspy.ReAct Jan 29, 2025

chenmoneygithub requested a review from okhat January 29, 2025 05:03

add test

72c342b

chenmoneygithub force-pushed the dspy-tool-def branch from bc41cb7 to 72c342b Compare January 29, 2025 05:04

improve dspy.Tool

bd2d7cf

chenmoneygithub force-pushed the dspy-tool-def branch from 1975352 to bd2d7cf Compare January 29, 2025 18:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use native tool calling for dspy.ReAct #3921

Use native tool calling for dspy.ReAct #3921

chenmoneygithub commented Jan 25, 2025 •

edited

Loading

Use native tool calling for dspy.ReAct #3921

Are you sure you want to change the base?

Use native tool calling for dspy.ReAct #3921

Conversation

chenmoneygithub commented Jan 25, 2025 • edited Loading

chenmoneygithub commented Jan 25, 2025 •

edited

Loading