Release with a pip package
What's Changed
- Added BLEU metric by @ArtemVazh in #213
- New model loading scripts by @rvashurin in #211
- Claim-level evaluation for Chinese and Arabic languages by @alfekka and @ruixing76
- Instruct-tuned models support and reflexive methods by @rvashurin in #221
- Blackbox models support via OpenAI API by @yobeen in #228
- Rejection rate limit for PRR calculation by @ArtemVazh in #230
Full Changelog: https://github.com/IINemo/lm-polygraph/compare/v0.3.0..v0.4.0