Mini Evaluation dataset for Home Assistant assist actions

This is a variation on the assist dataset that tests far fewer entities, devices, and areas at once. This is designed given poor performance of local models on the assist baseline to help with even smaller tasks to get right.

This is a dataset for the Home Assistant LLM API (blog post).

See the home-assistant-datasets assist command for more details on how to run evaluations.

See ../assist/README.md for details on how the dataset is configured including the inventory fixtures, and the eval tasks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Mini Evaluation dataset for Home Assistant assist actions

Files

README.md

Latest commit

History

README.md

File metadata and controls

Mini Evaluation dataset for Home Assistant assist actions