Filter
Exclude
Time range
-
Near
New ACL 2022 System Demo paper! It used to take a lot of technical effort to set up custom AI tasks, evaluate models, and collect crowdworker data with models in-the-loop. We’ve added a new framework to @DynabenchAI that aims to help: Dynatask. arxiv.org/abs/2204.01906 1/3
2
12
45
7 Apr 2022
Replying to @sleepinyourhat
Yes, exactly - in the Dynatask validation setup: some right, some wrong, and some we expect will get flagged.
2
6 Apr 2022
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks abs: arxiv.org/abs/2204.01906
1
11
@samlightstone Software Engineer at @facebookai will introduce and demo a new open-source paradigm for AI benchmarking called #DynaTask that uses dynamic adversarial data collection to evaluate #AI models & assess how easily an AI can be fooled by humans Know more at #TMLS2021!
1
5
Facebook AI Unveils Dynatask, A New Paradigm For Benchmarking AI, Enabling Custom NLP Tasks For AI Community Quick Read: marktechpost.com/2021/09/24/… #AI #ArtificialIntelligence #NLP #BigData #DataScience @facebookai
3
7
5
24 Sep 2021
Super exciting news: You can now create your own Dynabench tasks using Dynatask!
24 Sep 2021
Today, we’re unlocking @DynabenchAI, a first-of-its-kind platform for dynamic AI benchmarking. AI researchers can now create their own custom tasks to better evaluate the performance of #NLP models in more dynamic, & realistic settings for free. ai.facebook.com/blog/dynatas…
1
5
16