Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
5,679 workflow runs
5,679 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Adding the Evalita-LLM benchmark
Unit Tests #4174: Pull request #2671 synchronize by m-resta
February 7, 2025 08:47 Action required hltfbk:main
February 7, 2025 08:47 Action required
Adding the Evalita-LLM benchmark
Tasks Modified #4202: Pull request #2671 synchronize by m-resta
February 7, 2025 08:47 Action required hltfbk:main
February 7, 2025 08:47 Action required
Adding the Evalita-LLM benchmark
Tasks Modified #4201: Pull request #2671 synchronize by m-resta
February 7, 2025 08:45 Action required hltfbk:main
February 7, 2025 08:45 Action required
Adding the Evalita-LLM benchmark
Unit Tests #4173: Pull request #2671 synchronize by m-resta
February 7, 2025 08:45 Action required hltfbk:main
February 7, 2025 08:45 Action required
[MM] Ai2d
Tasks Modified #4199: Pull request #2542 synchronize by baberabb
February 6, 2025 14:37 2m 2s ai2d
February 6, 2025 14:37 2m 2s
[MM] Ai2d
Unit Tests #4171: Pull request #2542 synchronize by baberabb
February 6, 2025 14:37 7m 29s ai2d
February 6, 2025 14:37 7m 29s
[MM] Ai2d
Unit Tests #4170: Pull request #2542 synchronize by baberabb
February 6, 2025 11:27 7m 29s ai2d
February 6, 2025 11:27 7m 29s
[MM] Ai2d
Tasks Modified #4198: Pull request #2542 synchronize by baberabb
February 6, 2025 11:27 1m 47s ai2d
February 6, 2025 11:27 1m 47s
[MM] Ai2d
Tasks Modified #4197: Pull request #2542 synchronize by baberabb
February 6, 2025 11:24 1m 44s ai2d
February 6, 2025 11:24 1m 44s
[MM] Ai2d
Unit Tests #4169: Pull request #2542 synchronize by baberabb
February 6, 2025 11:24 6m 35s ai2d
February 6, 2025 11:24 6m 35s
[MM] Ai2d
Tasks Modified #4196: Pull request #2542 synchronize by baberabb
February 6, 2025 10:21 2m 3s ai2d
February 6, 2025 10:21 2m 3s
[MM] Ai2d
Unit Tests #4168: Pull request #2542 synchronize by baberabb
February 6, 2025 10:21 7m 30s ai2d
February 6, 2025 10:21 7m 30s
[MM] Ai2d
Unit Tests #4167: Pull request #2542 synchronize by baberabb
February 6, 2025 10:05 7m 8s ai2d
February 6, 2025 10:05 7m 8s
[MM] Ai2d
Tasks Modified #4195: Pull request #2542 synchronize by baberabb
February 6, 2025 10:05 1m 52s ai2d
February 6, 2025 10:05 1m 52s
[MM] Ai2d
Unit Tests #4166: Pull request #2542 synchronize by baberabb
February 6, 2025 09:14 7m 1s ai2d
February 6, 2025 09:14 7m 1s
[MM] Ai2d
Tasks Modified #4194: Pull request #2542 synchronize by baberabb
February 6, 2025 09:14 1m 49s ai2d
February 6, 2025 09:14 1m 49s
[MM] Chartqa
Unit Tests #4165: Pull request #2544 synchronize by baberabb
February 6, 2025 08:53 7m 7s chartqa
February 6, 2025 08:53 7m 7s
[MM] Chartqa
Tasks Modified #4193: Pull request #2544 synchronize by baberabb
February 6, 2025 08:53 1m 49s chartqa
February 6, 2025 08:53 1m 49s
Convert gen tasks to multiple_choice
Unit Tests #4164: Pull request #2670 synchronize by baberabb
February 6, 2025 08:43 7m 7s convert_gen
February 6, 2025 08:43 7m 7s
Convert gen tasks to multiple_choice
Tasks Modified #4192: Pull request #2670 synchronize by baberabb
February 6, 2025 08:43 2m 31s convert_gen
February 6, 2025 08:43 2m 31s
[MM] Chartqa
Unit Tests #4163: Pull request #2544 synchronize by baberabb
February 6, 2025 08:38 7m 10s chartqa
February 6, 2025 08:38 7m 10s
[MM] Chartqa
Tasks Modified #4191: Pull request #2544 synchronize by baberabb
February 6, 2025 08:38 1m 39s chartqa
February 6, 2025 08:38 1m 39s
[MM] Chartqa
Tasks Modified #4190: Pull request #2544 synchronize by baberabb
February 6, 2025 07:51 1m 55s chartqa
February 6, 2025 07:51 1m 55s
[MM] Chartqa
Unit Tests #4162: Pull request #2544 synchronize by baberabb
February 6, 2025 07:51 6m 57s chartqa
February 6, 2025 07:51 6m 57s
fix early return for multuple dict (#2673)
Tasks Modified #4189: Commit 144a1e5 pushed by baberabb
February 6, 2025 06:32 2m 0s main
February 6, 2025 06:32 2m 0s