Skip to content

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #3798

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]

Add --examples Argument for Fine-Grained Task Evaluation in lm-evaluation-harness. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2] #3798

This workflow is awaiting approval from a maintainer in #2520
Triggered via pull request November 26, 2024 19:51
Status Action required
Total duration
Artifacts
This workflow is awaiting approval from a maintainer in #2520

new_tasks.yml

on: pull_request
Scan for changed tasks
Scan for changed tasks
Fit to window
Zoom out
Zoom in