Filter
Exclude
Time range
-
Near
**Day 5/100** Today I went deep into **Stemming in NLP** – one of the most important text preprocessing techniques! Here are **10 key points** you must know about Stemming: - Stemming is the process of reducing words to their **root/stem** form (e.g., "running" → "run", "happily" → "happi") - Goal: Treat different forms of the same word as one (improves text similarity & reduces vocabulary size) - Works by **chopping off prefixes/suffixes** using heuristic rules (faster but less accurate than lemmatization) - Most famous algorithm: **Porter Stemmer** (1980) – still the gold standard, 5 phases of rules - Another popular one: **Snowball Stemmer** (improved Porter, supports 15 languages) - Lancaster Stemmer = super aggressive (sometimes over-stems → "university" → "univers") - Doesn’t always produce real words → "better" → "bet", "flies" → "fli" (this is normal!) - Language dependent – English stemmers won’t work well on Hindi/French without specific versions - Use cases: Information Retrieval, Search Engines, Text Classification, Topic Modeling - Trade-off: Speed ↑↑ | Accuracy ↓↓ (vs Lemmatization which uses POS tags & dictionary) Master stemming = cleaner data = better NLP models! @clcoding on Which stemmer do you use the most? Drop it below 👇 #100DaysOfCode #Day5 #NLP #NaturalLanguageProcessing #DataScience #MachineLearning #Python #TextPreprocessing #Stemming #AI #DeepLearning #CodingJourney #TechTwitter #WomenWhoCode #DataScientists Keep grinding, see you tomorrow for Day 6! 🔥
3
4
249
26 Jul 2025
Day 6 : Completed TextPreprocessing Covered > TextPreprocessing StopWords > Parts of Speech tagging using NLTK > Named Entity Recognition will start text --> vector Conversion from tommorrow. Goooddniighhttttt people
25 Jul 2025
Day 5 : Covered > Text preprocessing Stemming using NLTK > Porter Stemmer > Snowball Stemmer > Word net Lemmatizer
1
17
3,553
𝗗𝗮𝘆 𝟮𝟯 𝗼𝗳 𝗦𝘂𝗺𝗺𝗲𝗿 𝗜𝗻𝘁𝗲𝗿𝗻𝘀𝗵𝗶𝗽 𝗣𝗿𝗼𝗴𝗿𝗮𝗺 𝗼𝗻 𝗠𝗮𝗰𝗵𝗶𝗻𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 & 𝗗𝗲𝗲𝗽 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 #Stanley #SummerInternship2025 #MachineLearning #DeepLearning #NLP #Tokenization #WordEmbeddings #TextPreprocessing #Word2Vec #GloVe #TextAnalytics
2
5
45
🔥 Read our Paper 📚 Machine Learning-Based Text Classification Comparison: Turkish Language Context 🔗 mdpi.com/2076-3417/13/16/942… 👨‍🔬 by Yehia Ibrahim Alzoubi et al. #Turkishtexts #machinelearning #textpreprocessing @aum_kwtt
2
69
Text Preprocessing - Embark on a journey through the world of AI with our latest thread! We’re breaking down Text Preprocessing into bite-sized insights. Perfect for anyone curious about how AI turns chaos into clarity. 🧠✨ #AI #NLP #TextPreprocessing
1
2
116
At my new blog(lnkd.in/dUKryUcw), you can access all my NLP tutorials including Python/Keras codes. Enjoy! #python #nlp #textclassification #textgeneration #textpreprocessing #MuratKarakayaAkademi #muratkarakaya.net lnkd.in/dxFwSU2U

2
Interesting achievement by @GoogleAI with faster tokenisation time and shorter running time #NLP #Tokenization #TextPreprocessing
10 Dec 2021
Preparing text for processing by an #ML model often involves tokenization, in which text is split into smaller units (e.g., words or word segments). Today we present a new approach that speeds up the process by up to 8x compared to standard methods. goo.gle/3dKHhVi
3
4 Jul 2021
Join @amal_menzli in discovering the complexity and the importance of #Tokenization in #NLP tasks – there are some great examples and tools you can use to practice and perfect this way of #TextPreprocessing 👇 bit.ly/3jy7wCH

2
3
RT TF-IDF : A visual explainer and Python implementation on Presidential Inauguration Speeches dlvr.it/RryM9t #textpreprocessing #python #naturallanguageprocessi #education
1
Drop your answers in comments below!!! Join this free course & get started with #NaturalLanguageProcessing #NLP - learn how to extract information from unstructured text, basics of NLP, Regular Expressions and #TextPreprocessing >>> buff.ly/3f8InsJ
2
3
26 May 2020
#NLP is one of the hottest trends in #MachineLearning and #AI right now. Get a high-level view of what it is and how it works in under 5 minutes with this #blog post. | bit.ly/2TGCPgL #NLPbasics #naturallanguageprocessing #textpreprocessing #NLPusecases #MLbasics

2
7 May 2020
How does one prepare a text before processing it? Read our new article on #textpreprocessing for #NLP and #MachineLearning tasks. medium.com/sciforce/text-pre…

7
4
Word cloud of Pablo Neruda's 'Cien Sonetos de Amor' in Python...because data science just brings out the humanities graduate in you #pabloneruda #ciensonetosdeamor #textpreprocessing #datascience
2
Basic Text Preprocessing Steps in NLP. check out our Image gallery for more such infographics: ashutoshtripathi.com/gallery… #Textpreprocessing #Naturallanguageprocessing #MachineLearning #DataScience @ImAshutoshTr @machine_ml @AnalyticsVidhya @Analyticsindiam
1
3
14 Mar 2019
New post => How to Prepare Text Data for Machine Learning with scikit-learn Link => wp.me/p6WwF2-yc Techniques for Preparing and building a Machine Learning & Deep Learning model from text data #MachineLearning #SentimentAnalysis #NLP #textpreprocessing
2