GEMv2: multilingual NLG benchmarking in a single line of code.
(2022)
Presentation / Conference Contribution
GEHRMANN, S., BHATTACHARJEE, A., MAHENDIRAN, A., WANG, A., PAPANGELIS, A., MADAAN, A., MCMILLAN-MAJOR, A., SHVETS, A., UPADHYAY, A. and BOHNET, B. 2022. GEMv2: multilingual NLG benchmarking in a single line of code. In Proceedings of the 2022 Conference on empirical methods in natural language processing: system demonstrations, 7-11 December 2022, Abu Dhabi, UAE. Stroudsburg: Association for Computational Linguistics [online], pages 266-281. Available from: https://aclanthology.org/2022.emnlp-demos.27/
Evaluations in machine learning rarely use the latest metrics, datasets, or human evaluation in favor of remaining compatible with prior work. The compatibility, often facilitated through leaderboards, thus leads to outdated but standardized evaluati... Read More about GEMv2: multilingual NLG benchmarking in a single line of code..