New ACL 2022 System Demo paper!
It used to take a lot of technical effort to set up custom AI tasks, evaluate models, and collect crowdworker data with models in-the-loop. We’ve added a new framework to @DynabenchAI that aims to help: Dynatask.
arxiv.org/abs/2204.01906
1/3
@samlightstone Software Engineer at @facebookai will introduce and demo a new open-source paradigm for AI benchmarking called #DynaTask that uses dynamic adversarial data collection to evaluate #AI models & assess how easily an AI can be fooled by humans
Know more at #TMLS2021!
Today, we’re unlocking @DynabenchAI, a first-of-its-kind platform for dynamic AI benchmarking.
AI researchers can now create their own custom tasks to better evaluate the performance of #NLP models in more dynamic, & realistic settings for free. ai.facebook.com/blog/dynatas…