Skip to content

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #3770

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #3770

This workflow is awaiting approval from a maintainer in #2520
Triggered via pull request November 26, 2024 19:51
Status Action required
Total duration
Artifacts
This workflow is awaiting approval from a maintainer in #2520

unit_tests.yml

on: pull_request
External LM Tests
External LM Tests
Linters
Linters
Matrix: CPU Tests
Waiting for pending jobs
Fit to window
Zoom out
Zoom in