Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More
Arabic-Leaderboards Space unifies Arabic evaluations, housing AraGen-03-25 and Arabic Instruction Following, with plans to add more modalities. The AraGen-03-25 release expands to 340 QA/Reasoning/Orthography pairs and uses blind testing for fair evaluation, plus sharing Claude-3.5-Sonnet results to invite community review.