Implemented Recursive Criticism and Iteration in Task Creation to Verify the output of the agents #785

chandrakanth137 · 2024-06-18T10:28:00Z

RCI Documentation

Documentation for Recursive Criticism and Iteration (RCI) Methods

Overview

Recursive Criticism and Iteration (RCI) is a systematic process used to iteratively enhance the quality of outputs generated by a language model (LLM). It involves three main steps: Critique, Validate, and Improve. Each step is designed to ensure that the final output is accurate, logically sound, and free of factual errors.

This documentation explains the implementation of RCI in CrewAI library using LangChain and Ollama LLM. The code defines three methods: critique, validate, and improve. Each method leverages prompt templates to interact with the LLM and achieve the desired processing at each stage.

Methods

1. `critique`

This method generates a critique of the given output based on the task description. The critique focuses solely on logical or factual inaccuracies, avoiding grammatical rephrasing or paraphrasing.

Parameters:

agent: The agent performing the task, which includes the agent's backstory.
task: The task description provided to the model.
output: The output generated by the LLM for the given task.
llm: The language model used to process the prompt and generate the critique.

2. `validate`

This method determines if the critique suggests significant changes to the output. It analyzes the critique to see if it indicates that substantial revisions are necessary.

Parameters:

task: The task description provided to the model.
critique: The critique generated by the critique method.
output: The original output generated by the LLM.
llm: The language model used to process the prompt and validate the critique.

Returns:

validate_response: A single word response, either "True" or "False", indicating whether significant changes are required.

3. `improve`

This method refines the original output based on the critique. It rewrites the output to address the errors identified in the critique, ensuring the format specified in the task description is maintained.

Parameters:

task: The task description provided to the model.
output: The original output generated by the LLM.
critique: The critique generated by the critique method.
llm: The language model used to process the prompt and improve the output.

Returns:

improve_response: The improved output generated by the LLM.

Summary

The RCI methods (critique, validate, and improve) form a robust framework for iteratively refining LLM outputs. By identifying and correcting logical or factual errors and validating the significance of required changes, this approach ensures high-quality and accurate outputs tailored to specific task requirements.

To test the functioning RCI in the modified CrewAI

Note: The test code written is suited for local development scenario, kindly modify the code in necessary ways for your working.

Move the src directory
Run the crew_ai_base_test.py to check if your initial setup works
Run this Python notebook for advanced test case: YT_Email_Reply_Llama3_CrewAI_+_Groq.ipynb
- For the above code to work, Groq API is required, make sure you have one before starting
To try out custom test cases, RCI can be enabled while creating an instance of the Task Class
- Set rci=True if you want to use RCI, the default value is True.
- The # of iterations can be modified using the rci_depth parameter, which takes an integer value. The default value is set to 1.

Example:

task = Task(
    description= """ Your Task Description""",
    agent= your_agent,
    expected_output= "However you wish",
    rci=True
    rci_depth=3
)

mbarnathan · 2024-06-27T04:08:47Z

src/crewai/task.py

        result = agent.execute_task(
            task=task,
            context=context,
            tools=tools,
        )

+        # To perform RCI if rci is set to True
+        llm = ChatOllama(model="llama3")


I'm really glad to see this capability coming to CrewAI! This will probably need to be more generic; I doubt a hardcoded dependency on Ollama will get approved since the library doesn't have one anywhere else.

I will try making it more generic. I only had access to local LLMs, so that piece of code ended up there. I will modify it to use the user preferred LLM and make the code more generic. Thanks for the review !

joaomdmoura · 2024-06-27T05:44:06Z

Just got my eyes on this! Very curious about it, little busy today/tomorrow, but bumping this to the top of the list so either myself or someone on the team looks at it!

clearing up local llm changes

clearing up local llm setup

chandrakanth137 · 2024-06-27T06:36:27Z

As @mbarnathan pointed out to make the code more generic, I have modified the code to use the existing LLM from the user provided which would usually be initialized using crew.kickoff().

As I have access only to local LLMs, it would be great if anyone can run the internal tests. I tried using the local LLMs but it eventually came to do modifying the test cases to setup for local LLMs.

I will probably try to modify the test cases to even work with local LLMs.

github-actions · 2024-09-25T12:17:10Z

This PR is stale because it has been open for 45 days with no activity.

chandrakanth137 and others added 4 commits June 14, 2024 11:54

critique added

f03376f

rci using langchain

89a9e6b

Merge branch 'joaomdmoura:main' into dev

7772e99

tests included

442b395

mbarnathan reviewed Jun 27, 2024

View reviewed changes

Merge branch 'joaomdmoura:main' into dev

443dade

chandrakanth137 and others added 6 commits June 27, 2024 11:18

Merge branch 'joaomdmoura:main' into dev

2689504

Delete src/llama3_connector.sh

d00042b

clearing up local llm changes

Delete src/crew_ai_base_test.py

3f9a449

clearing up local llm setup

Delete src/YT_Email_Reply_Llama3_CrewAI_+_Groq.ipynb

9b753c3

clearing up local llm setup

rci with generic llm

1cf9533

generic llm added

946d471

theCyberTech added the feature-request New feature or request label Aug 10, 2024

github-actions bot added the no-pr-activity label Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented Recursive Criticism and Iteration in Task Creation to Verify the output of the agents #785

Implemented Recursive Criticism and Iteration in Task Creation to Verify the output of the agents #785

chandrakanth137 commented Jun 18, 2024

mbarnathan Jun 27, 2024

chandrakanth137 Jun 27, 2024

joaomdmoura commented Jun 27, 2024

chandrakanth137 commented Jun 27, 2024

github-actions bot commented Sep 25, 2024

Implemented Recursive Criticism and Iteration in Task Creation to Verify the output of the agents #785

Are you sure you want to change the base?

Implemented Recursive Criticism and Iteration in Task Creation to Verify the output of the agents #785

Conversation

chandrakanth137 commented Jun 18, 2024

RCI Documentation

Documentation for Recursive Criticism and Iteration (RCI) Methods

Overview

Methods

1. critique

2. validate

3. improve

Summary

To test the functioning RCI in the modified CrewAI

Note: The test code written is suited for local development scenario, kindly modify the code in necessary ways for your working.

mbarnathan Jun 27, 2024

Choose a reason for hiding this comment

chandrakanth137 Jun 27, 2024

Choose a reason for hiding this comment

joaomdmoura commented Jun 27, 2024

chandrakanth137 commented Jun 27, 2024

github-actions bot commented Sep 25, 2024

1. `critique`

2. `validate`

3. `improve`