Tristan Thrush

Tristan Thrush

Users
Tweets

8 Apr 2022

New ACL 2022 System Demo paper! It used to take a lot of technical effort to set up custom AI tasks, evaluate models, and collect crowdworker data with models in-the-loop. We’ve added a new framework to @DynabenchAI that aims to help: Dynatask. arxiv.org/abs/2204.01906 1/3

Max Bartolo

Max Bartolo

@max_nlp

7 Apr 2022

Replying to @sleepinyourhat

Yes, exactly - in the Dynatask validation setup: some right, some wrong, and some we expect will get flagged.

AK

@_akhaliq

6 Apr 2022

Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks abs: arxiv.org/abs/2204.01906

Toronto Machine Learning Society (TMLS)

Toronto Machine Learning Society (TMLS)@TMLS_TO

22 Oct 2021

@samlightstone Software Engineer at @facebookai will introduce and demo a new open-source paradigm for AI benchmarking called #DynaTask that uses dynamic adversarial data collection to evaluate #AI models & assess how easily an AI can be fooled by humans Know more at #TMLS2021!

Asif Razzaq

Asif Razzaq @asifrazzaq1988

24 Sep 2021

Facebook AI Unveils Dynatask, A New Paradigm For Benchmarking AI, Enabling Custom NLP Tasks For AI Community Quick Read: marktechpost.com/2021/09/24/… #AI #ArtificialIntelligence #NLP #BigData #DataScience @facebookai

Max Bartolo

Max Bartolo

@max_nlp

24 Sep 2021

Super exciting news: You can now create your own Dynabench tasks using Dynatask!

AI at Meta

@AIatMeta

24 Sep 2021

Today, we’re unlocking @DynabenchAI, a first-of-its-kind platform for dynamic AI benchmarking. AI researchers can now create their own custom tasks to better evaluate the performance of #NLP models in more dynamic, & realistic settings for free. ai.facebook.com/blog/dynatas…