Add --examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
#3770
This workflow is awaiting approval from a maintainer in #2520
Triggered via pull request
November 26, 2024 19:51
Status
Action required
Total duration
–
Artifacts
–
This workflow is awaiting approval from a maintainer in #2520
unit_tests.yml
on: pull_request
External LM Tests
Linters
Matrix: CPU Tests
Waiting for pending jobs