Add --examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
#3797
This workflow is awaiting approval from a maintainer in #2520
Triggered via pull request
November 26, 2024 19:47
Status
Action required
Total duration
–
Artifacts
–
This workflow is awaiting approval from a maintainer in #2520
new_tasks.yml
on: pull_request
Scan for changed tasks