Skip to content

Actions: EleutherAI/lm-evaluation-harness

Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,053 workflow runs
3,053 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Biology ds
Unit Tests #3685: Pull request #2486 opened by deema-A
November 13, 2024 02:36 Action required deema-A:biology_ds
November 13, 2024 02:36 Action required
wandb logger fix, added pre-commit (#2484)
Unit Tests #3684: Commit 67db63a pushed by baberabb
November 12, 2024 17:34 6m 35s main
November 12, 2024 17:34 6m 35s
fix wandb
Unit Tests #3681: Pull request #2485 opened by baberabb
November 12, 2024 17:10 6m 15s wandb
November 12, 2024 17:10 6m 15s
MILU dataset from AI4Bharat for Indic LLM eval
Unit Tests #3679: Pull request #2482 opened by abhinand5
November 12, 2024 03:55 6m 11s abhinand5:main
November 12, 2024 03:55 6m 11s
change warning to debug (#2481)
Unit Tests #3678: Commit 6b628d9 pushed by baberabb
November 11, 2024 22:25 6m 26s main
November 11, 2024 22:25 6m 26s
release kbl-v0.1
Unit Tests #3677: Pull request #2476 synchronize by whwang299
November 11, 2024 20:47 6m 26s lbox-kr:kbl-release-v0.1
November 11, 2024 20:47 6m 26s
change warning to debug
Unit Tests #3676: Pull request #2481 opened by baberabb
November 11, 2024 20:30 6m 34s debug
November 11, 2024 20:30 6m 34s
release kbl-v0.1
Unit Tests #3675: Pull request #2476 synchronize by whwang299
November 11, 2024 20:18 6m 24s lbox-kr:kbl-release-v0.1
November 11, 2024 20:18 6m 24s
Fix chat template; fix leaderboard math (#2475)
Unit Tests #3674: Commit 77c811e pushed by baberabb
November 11, 2024 17:01 6m 42s main
November 11, 2024 17:01 6m 42s
release kbl-v0.1
Unit Tests #3673: Pull request #2476 synchronize by whwang299
November 11, 2024 07:23 6m 46s lbox-kr:kbl-release-v0.1
November 11, 2024 07:23 6m 46s
mlx Model (loglikelihood & generate_until)
Unit Tests #3671: Pull request #1902 synchronize by chimezie
November 10, 2024 18:46 Action required chimezie:mlx
November 10, 2024 18:46 Action required
mlx Model (loglikelihood & generate_until)
Unit Tests #3670: Pull request #1902 synchronize by chimezie
November 10, 2024 17:44 Action required chimezie:mlx
November 10, 2024 17:44 Action required
release kbl-v0.1
Unit Tests #3669: Pull request #2476 opened by whwang299
November 10, 2024 01:13 5m 42s lbox-kr:kbl-release-v0.1
November 10, 2024 01:13 5m 42s
Fix chat template; fix leaderboard math
Unit Tests #3668: Pull request #2475 opened by baberabb
November 9, 2024 20:56 6m 45s fix-chat-template
November 9, 2024 20:56 6m 45s
Ifeval: Dowload punkt_tab on rank 0 (#2267)
Unit Tests #3667: Commit bd80a6c pushed by baberabb
November 9, 2024 12:23 6m 26s main
November 9, 2024 12:23 6m 26s
OpenAI ChatCompletions: switch max_tokens (#2443)
Unit Tests #3666: Commit 060e876 pushed by baberabb
November 9, 2024 12:19 7m 5s main
November 9, 2024 12:19 7m 5s
OpenAI ChatCompletions: switch max_tokens
Unit Tests #3665: Pull request #2443 synchronize by baberabb
November 9, 2024 12:19 6m 24s openaichat
November 9, 2024 12:19 6m 24s
Update citation
Unit Tests #3664: Pull request #2474 opened by Sypherd
November 8, 2024 12:40 6m 36s Sypherd:update-citation
November 8, 2024 12:40 6m 36s
Use global filter alias
Unit Tests #3663: Pull request #2473 opened by Sypherd
November 8, 2024 11:57 6m 9s Sypherd:use-global-filter
November 8, 2024 11:57 6m 9s
pass device_map other than auto for parallelize (#2457)
Unit Tests #3662: Commit 4155ec7 pushed by baberabb
November 7, 2024 17:44 5m 49s main
November 7, 2024 17:44 5m 49s
pass device_map other than auto for parallelize
Unit Tests #3661: Pull request #2457 synchronize by baberabb
November 7, 2024 17:37 5m 50s devicemap
November 7, 2024 17:37 5m 50s
pass device_map other than auto for parallelize
Unit Tests #3660: Pull request #2457 synchronize by baberabb
November 7, 2024 17:33 2m 34s devicemap
November 7, 2024 17:33 2m 34s