Add `--examples` Argument for Fine-Grained Task Evaluation in `lm-evaluation-harness`. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #3797

Sign in to view logs

This workflow is awaiting approval from a maintainer in #2520

Triggered via pull request November 26, 2024 19:47

opened #2520

mirianfsilva:examples-arg

Status Action required

Total duration –

Artifacts –

This workflow is awaiting approval from a maintainer in #2520

new_tasks.yml

on: pull_request

Scan for changed tasks