Add `--examples` Argument for Fine-Grained Task Evaluation in `lm-evaluation-harness`. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #3770

This workflow is awaiting approval from a maintainer in #2520

Triggered via pull request November 26, 2024 19:51

synchronize #2520

Status Action required

Total duration –

Artifacts –

This workflow is awaiting approval from a maintainer in #2520

unit_tests.yml

on: pull_request

Matrix: CPU Tests

Waiting for pending jobs