- Implement HermesClient for interacting with the Hermes CLI. - Create judge module for grading QA outputs from Hermes memory. - Develop LoCoMo dataset parsing and formatting utilities. - Introduce run_eval script to facilitate memory evaluation using LoCoMo-style datasets.
3 lines
41 B
Python
3 lines
41 B
Python
"""Hermes memory evaluation helpers."""
|
|
|