Releases · illuin-tech/grouse · GitHub

20 Nov 13:40

0.4.2 Latest

Latest

Fixed

Fix completion cost computation to support more models with Litellm like vertex_ai models

Assets 2

29 Sep 20:06

0.4.1

Fixed

Fix API Connection Errors with Semaphore

Assets 2

24 Sep 20:50

0.4.0

Added

Added o1 support
Added meta evaluation pipeline to Python package

Fixed

Change jinja template extensions

Assets 2

20 Sep 16:18

0.3.1

Fixed

Fixed prompts newlines
Removed latest structured generation because faithfulness and usefulness dtos did not support it
Fixed dataset loading in plot function

Assets 2

11 Sep 12:08

0.3.0

Added

Add flag to use the training dataset in meta evaluation.

Assets 2

05 Sep 09:16

0.2.1

Fixed

Add missing model register file.

Assets 2

05 Sep 09:02

0.2.0

Added

Register Fireworks Llama 3.1 8b and 70b prices with litellm to better support these models as evaluators.

Changed

Remove instructor package to better understand what is really sent to the LLM. All the LLM generations are simply done using litellm.acompletion.
Removed black from justfile and dev dependencies as ruff plays the same role.

Assets 2

23 Aug 08:21

0.1.0

Added

Created GroundedQAEvaluator that evaluates four metrics per sample: answer relevancy, completeness, faithfulness, usefulness, negative rejection and positive acceptance.
Created MetaEvaluator to evaluate evaluators on GroUSE unit tests.
Added plot sub-command to show the matrices

Assets 2