Skip to content

v0.8.0

Latest
Compare
Choose a tag to compare
@NathanHB NathanHB released this 24 Mar 10:46
· 14 commits to main since this release

What's new

Tasks

Metrics

Features

Better logging

Inference providers

Load details to be evaluated

  • Implemented the possibility to load predictions from details files and continue evaluating from there by @JoelNiklaus in #488

sglang support

Bug Fixes and refacto

Thanks

Huge thanks to Hyneck, Lewis, Ben, Agustín, Elie and everyone helping and and giving feedback 💙

Significant community contributions

The following contributors have made significant changes to the library over the last release:

  • @hynky1999
    • Extractive Match metric (#495)
    • Fix math extraction (#503)
    • Bump up the latex2sympy2_extended version + more tests (#510)
    • Math extraction - allow only trying the first match, more customizable latex extraction + bump deps (#522)
    • add missing inits (#524)
    • Sync Math-verify (#535)
    • Make BLEURT lazy (#536)
    • Bug fix extractive match (#540)
    • Fix VLLM data-parallel (#541)
    • VLLM + Math-Verify fixes (#603)
  • @plaguss
    • Add extended task for LiveCodeBench codegeneration (#548)
    • Add subsets for lcb (#587)
  • @Jayon02
    • Let lighteval support sglang (#552)
  • @NathanHB
    • adds olympiad bench (#521)
    • Fix loading of vllm model from files (#533)
    • [VLLM] Allows for max tokens to be set in model config file (#547)
    • allows better flexibility for litellm endpoints (#549)
    • raise exception when generation size is more than model length (#571)
    • Push details without converting fields to str (#572)
    • adds aime24, 25 and math500 (#586)
    • adds inference providers support (#616)
    • vllm fix sampling params (#625)
    • relax deps for tgi (#626)
    • log model config (#627)