Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evidence - FeedbackEvaluator Storage & Basic UI #12758

Open
wants to merge 41 commits into
base: develop
Choose a base branch
from

Conversation

dandrabik
Copy link
Member

@dandrabik dandrabik commented Jan 31, 2025

WHAT

Run the FeedbackEvaluator after trials complete, store in a DB table and roll-up, show in UI.

WHY

We want to show this info in the UI of the research tool to make prompt engineering on the feedback prompt more efficient.

HOW

Create a new table evidence_research_gen_ai_feedback_evaluations and run evaluations when GEvalScores are calculated. Show in UI.

Screenshots

Screenshot 2025-01-31 at 5 09 24 PM
Screenshot 2025-01-31 at 5 09 12 PM### Notion Card Links
https://www.notion.so/quill/Feedback-Evaluation-Metrics-Storage-185d42e6f941807191e3ddc2da872cd7

What have you done to QA this feature?

I've run some tests locally in the console while connected to the staging DB. I also ran the metrics on staging and seemed to work properly. Viewed the UI locally to ensure the metrics show properly.

PR Checklist Your Answer
Have you added and/or updated tests? Yes
Have you deployed to Staging? YES
Self-Review: Have you done an initial self-review of the code below on Github? Yes

@dandrabik dandrabik marked this pull request as draft January 31, 2025 20:19
@dandrabik dandrabik changed the title Evidence - FeedbackEvaluator Storage Evidence - FeedbackEvaluator Storage & Basic UI Jan 31, 2025
@dandrabik dandrabik marked this pull request as ready for review January 31, 2025 22:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant