This account is no longer maintained. We are now Argilla, follow us @argilla_io

Joined April 2016
23 Photos and videos
Pinned Tweet
20 Oct 2021
šŸŽ‰ Excited to release Selectra (Spanish Electra), a new set of models on the @huggingface Hub 3-5x times smaller than current SOTA Spanish models while achieving competitive results 🧵Overview below (1/4) Thanks @GoogleAI TPU RC for their support #python #opensource #nlproc
4
18
55
25 Oct 2022
🄳 We're extremely excited to announce we're now Argilla Please don't forget to follow us @argilla_io. There are many more exciting things coming up! Read more at: argilla.io/blog/recognai-rub… #python #opensource #nlproc
1
3
4
Recognai retweeted
Find text classification label errors with @CleanlabAI and correct them with @rubrixml rubrix.readthedocs.io/en/sta… #python #opensource #NLProc
13
32
Recognai retweeted
Get started with NLP with custom datasets Create and label datasets for text classification, token classification and text generation rubrix.readthedocs.io/en/sta… #python #opensource #datascience
34
75
Recognai retweeted
Don't have a lot of time to annotate data? SetFit Rubrix, few-shot classification with custom data šŸ¤“ rubrix.readthedocs.io/en/doc… #nlproc #datascience #opensource
1
34
103
Recognai retweeted
6 Oct 2022
⚔ New release 0.18.0 > Better token classification validation > Delete records by id & query for better dataset management > New tutorials! Thanks to our community contributors @AnkushChander, Tom Aarsen, & others github.com/recognai/rubrix/r… #python #nlproc #opensource
1
3
9
Recognai retweeted
SetFit: Efficient few-shot learning with Sentence Transformers So exciting! Train robust models with very few examples, fast training, fast inference, and comparable/better than other LLMs and prompt-based methods. github.com/huggingface/setfi… #python #opensource #NLProc
3
30
110
Recognai retweeted
Active learning for text classification with @rubrixml and the wonderful small-text library by @webis_de Learn how to build a custom active learning loop and teach a šŸ¤— transformers model rubrix.readthedocs.io/en/mas… #python #opensource #NLProc
1
27
69
Recognai retweeted
Want to analyze prediction explanations from your Transformer models? At the dataset level? A new tutorial using SHAP and Transformers interpret! rubrix.readthedocs.io/en/mas… #python #opensource #xai
16
44
Recognai retweeted
humap: Hierarchical Uniform Manifold Approximation and Projection A very cool method and library by @EstecioJunior Reduces visual burden when exploring clusters in large datasets and enables drill-down with hierarchical levels github.com/wilsonjr/humap #python #opensource #umap
1
9
39
Recognai retweeted
Rubrix: the open-source framework for data-centric NLP Build human-in-the-loop workflows for data annotation, monitoring, and review. github.com/recognai/rubrix Follow @rubrixml for updates #python #nlp #opensource
20
66
Recognai retweeted
What can we learn from model predictions vs. training data labels? * Ambiguous examples * (Some) wrong labels * Model improvement patterns A reproducible example using the @stanfordnlp sentiment treebank dataset & @rubrixml huggingface.co/datasets/rubr… #python #opensource #NLProc
5
19
Recognai retweeted
Weak supervision for multilabel text classification. Get instant statistics about heuristics' coverage and precision with @rubrixml UI Define rules programmatically with Python Tutorial: rubrix.readthedocs.io/en/sta… #opensource #datacentricai #python
13
36
Recognai retweeted
Every good model starts with good quality datasets. Iteration and collaboration are key ingredients to achieve this. Here's how you can iterate on data and models using the Hugging Face Hub. rubrix.readthedocs.io/en/sta… #nlproc #datascience #opensource
9
22
Recognai retweeted
Fine-tuning a sentiment classifier starting with no labeled data with @rubrixml rubrix.readthedocs.io/en/mas… Follow @rubrixml for more resources like this one If you love NLP & open-source join our friendly community: join.slack.com/t/rubrixworks… #python #opensource #nlp #transformers
7
31
Recognai retweeted
BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision 1ļøāƒ£ Noisy labels using Wikidata and gazetteers (distant labels) 2ļøāƒ£ Fine-tune Roberta for NER with distant labels 3ļøāƒ£ Self-training github.com/cliang1453/BOND arxiv.org/abs/2006.15509 #python #NLProc
1
18
80
Recognai retweeted
Stanza by @stanfordnlp is powerful for NER Want to see how well it performs with your data? šŸ‘‡ gist.github.com/dvsrepo/239f… New to @rubrixml? github.com/recognai/rubrix Join the community: join.slack.com/t/rubrixworks… #python #nlp #opensource #datascience
1
12
29