Sketch Engine is a corpus query system with text analysis tools for text corpora in 100 languages.

Joined September 2012
596 Photos and videos
๐Ÿ“š Adam Kilgarriff helped many of us see language through data. The prize in his name recognises outstanding work in lexicography and language technology. Applications are open. #lexicography #NLP kilgarriff.co.uk/prize/
4
7
362
Want to understand your corpus at a glance? The Corpus info page now shows average word and sentence counts per document or paragraph, helping you inspect and compare corpora faster. sketchengine.eu/guide/corpusโ€ฆ #NLP #TextAnalysis
2
3
258
From the ELEXAI kick-off meeting in Ljubljana. Lexical Computing is now part of this EU Horizon project, alongside 24 other partners. #ELEXAI builds on the previous @ELEXIS_EU project to improve #LLMs with lexicographic data and apply LLMs in #lexicography.
1
6
7
307
How many languages can you meet at Polyglot Gathering 2026 in Brno ๐Ÿ‡จ๐Ÿ‡ฟ? Weโ€™ve met lots already! Come by our booth and find out how many languages you can explore in Sketch Engine. Happy to be here and chat with fellow language lovers. sketchengine.eu/ #PolyglotGathering
1
4
190
Our new Romanian Trends corpus is now available! With 370M words and daily updates, it lets you explore contemporary Romanian and follow language trends as they develop over time. ske.li/romanian_trends #Romanian #NLP #CorpusLinguistics
2
3
289
Our new Turkish Trends corpus is out! With 170M words and daily updates, it helps you study current Turkish and track language trends over time. sketchengine.eu/turkish-trenโ€ฆ #Turkishlanguage #CorpusLinguistics #NLP
3
5
233
A few moments from #LREC2026 in Mallorca โ˜€๏ธ Ondra and Honza presenting posters, Ondล™ej and Vlasta welcoming visitors at our booth. Great to meet researchers, linguists, and NLP practitioners from around the world. #NLP #ComputationalLinguistics
1
7
316
๐Ÿšจ Early-bird for Lexicom 2026 ends 14 May! Save 20% on this 5-day workshop covering digital #lexicography, corpus-based dictionary building & AI in lexicography. ๐Ÿ“ Palermo ๐Ÿ‡ฎ๐Ÿ‡น ๐Ÿ“… 14โ€“18 Sept 2026 ๐Ÿ”— lexicom.courses/lexicom-2026โ€ฆ #CorpusLinguistics #NLP
3
1
178
๐Ÿ“š Have you worked on a dictionary, corpus, or language tool? The Adam Kilgarriff Prize recognises outstanding work in #Lexicography and #LanguageTechnology. Apply or share with someone who should ๐Ÿ‘‡ kilgarriff.co.uk/prize/
4
5
291
Save 20% with early bird registration for Lexicom 2026. Join this 5-day workshop on digital #lexicography, corpus-based #dictionary building, and AI use in lexicography. Palermo ๐Ÿ‡ฎ๐Ÿ‡น, 14โ€“18 September. lexicom.courses/lexicom-2026โ€ฆ
2
5
166
At last โ€“ a corpus with a bigger ego than a Sandalwood hero. ๐ŸŽฌ The new Kannada Trends in Sketch Engine is now officially the largest Kannada corpus, with 30 million words. That is so much data that if you tried to read it all, you would probably grow old, master Bisi Bele Bath๐Ÿฒ, and still have time to be crowned king of linguistics.๐Ÿ‘‘๐ŸŒ It is basically a digital ocean of Kannada โ€“ and finding more of the language in one place would be quite a challenge. sketchengine.eu/kannada-trenโ€ฆ #corpuslinugstics #NLP
1
99
Join the 25th edition of Lexicom 2026 in Palermo ๐Ÿ‡ฎ๐Ÿ‡น, 14โ€“18 September! Be part of a workshop in #lexicography, #corpuslinguistics, #dictionaries, and lexical computing, attended by 700 participants worldwide. ๐Ÿ”— lexicom.courses/lexicom-2026โ€ฆ
4
6
264
Explore how Afrikaans words change through time with our latest corpus โ€” ideal for discovering #neologisms and emerging language trends. ske.li/afrikaans_trends
2
155
๐Ÿ”นA very small announcement: Weโ€™ve just published the Dot corpus, the tiniest text corpus in the world. Youโ€™ll read it in no time! ๐ŸŒ sketchengine.eu/dot-corpus/ ๐Ÿ‘‰ app.sketchengine.eu/#dashboaโ€ฆ
1
1
5
442
Explore how Malay words change through time with our latest corpus โ€” ideal for discovering #neologisms and emerging language trends. ske.li/malay_trends
2
2
359
460M words of ๐Ÿ‡ฒ๐Ÿ‡น Maltese language data now available in one corpus. A useful resource for research and #NLP on this unique Semitic language written in the Latin script. Special thanks to @UMmalta for making this possible. sketchengine.eu/maltese-refeโ€ฆ #corpuslinguistics
5
180