Skip to content

Releases: illuin-tech/grouse

0.4.2

20 Nov 13:40
Compare
Choose a tag to compare

Fixed

  • Fix completion cost computation to support more models with Litellm like vertex_ai models

0.4.1

29 Sep 20:06
a3ed631
Compare
Choose a tag to compare

Fixed

  • Fix API Connection Errors with Semaphore

0.4.0

24 Sep 20:50
1969e27
Compare
Choose a tag to compare

Added

  • Added o1 support
  • Added meta evaluation pipeline to Python package

Fixed

  • Change jinja template extensions

0.3.1

20 Sep 16:18
089a3dd
Compare
Choose a tag to compare

Fixed

  • Fixed prompts newlines
  • Removed latest structured generation because faithfulness and usefulness dtos did not support it
  • Fixed dataset loading in plot function

0.3.0

11 Sep 12:08
Compare
Choose a tag to compare

Added

  • Add flag to use the training dataset in meta evaluation.

0.2.1

05 Sep 09:16
Compare
Choose a tag to compare

Fixed

  • Add missing model register file.

0.2.0

05 Sep 09:02
1d8a011
Compare
Choose a tag to compare

Added

  • Register Fireworks Llama 3.1 8b and 70b prices with litellm to better support these models as evaluators.

Changed

  • Remove instructor package to better understand what is really sent to the LLM. All the LLM generations are simply done using litellm.acompletion.
  • Removed black from justfile and dev dependencies as ruff plays the same role.

0.1.0

23 Aug 08:21
3976008
Compare
Choose a tag to compare

Added

  • Created GroundedQAEvaluator that evaluates four metrics per sample: answer relevancy, completeness, faithfulness, usefulness, negative rejection and positive acceptance.
  • Created MetaEvaluator to evaluate evaluators on GroUSE unit tests.
  • Added plot sub-command to show the matrices