NetMiner

NetMiner

Users
Tweets

Mar 25

Topics alone don’t reveal the full story. Topic network analysis shows how themes connect across documents. Learn how to build a BERTopic based topic network and identify hub topics in NetMiner. 👉 netminer.medium.com/topic-ne… #TopicModeling #TextMining #NetworkAnalysis

Topic Network: Exploring Discourse and Knowledge Structures

No-Code Text Analysis

netminer.medium.com

505

BJANALYTICS

BJANALYTICS @BJANALYTICS

Jan 19

How NLP Detects Public Health Trends from Text Data youtu.be/3iK-5lXoXuQ?si=ixsA… via @YouTube 🎧Listen to the audio version here: podcasts.apple.com/us/podcas… #naturallanguageprocessing #nlp #datascience #publichealth #textmining #topicmodeling #LDA #population

Amirhossein Abaskohi

Amirhossein Abaskohi @AmirAbaskohi

5 Nov 2025

🔍 Deep Dive: Why CEMTM Redefines Multimodal Topic Modeling At EMNLP 2025 in Suzhou, I’ll be presenting CEMTM (Contextual Embedding-based Multimodal Topic Modeling) — a model that rethinks how we discover topics in multimodal documents by moving entirely into the contextual embedding space. Unlike classical or contextualized topic models such as CWTM, which rely on Dirichlet priors and discrete sampling, CEMTM operates with continuous variational inference, enabling both semantic precision and computational efficiency. Here’s what makes it stand out: 🚀 Key Contributions 1. Multimodal Topic Learning CEMTM unifies text, image, and structural data under a shared embedding space. Topics are no longer word distributions—they are semantic clusters of contextual embeddings that span across modalities. 2. Contextual Embedding Alignment Each token (word, visual patch, or table element) is attracted to its topic vector in the embedding space, replacing Dirichlet sparsity with differentiable optimization. This enforces semantic cohesion within topics. 3. Cross-Modal Coherence Regularization A novel coherence term maximizes cosine similarity among top tokens of each topic—even across modalities—so that text and visual components that convey the same concept naturally align. 4. Variational Efficiency Without Dirichlet sampling or vocabulary-wide softmax operations, CEMTM achieves up to 3× faster training and 5–10× faster inference, fully leveraging GPU-parallelizable vector operations. 5. State-of-the-Art Topic Quality On multiple multimodal datasets, CEMTM outperforms prior models like CWTM, MMNTM, and ZeroShot-LDA in both coherence and diversity, demonstrating that contextualized multimodal alignment leads to more interpretable and scalable topic discovery. 🧠 The Takeaway CEMTM shows that topic modeling can evolve beyond discrete words and priors. By clustering contextual embeddings directly and optimizing cross-modal coherence, it enables interpretable, efficient, and semantically rich topic discovery across heterogeneous documents. 📍 Presentation: Poster Session — Wednesday, Nov 5 · 16:30–18:00 · Hall C (EMNLP 2025, Suzhou) 📄 Paper: arxiv.org/abs/2509.11465 #EMNLP2025 #MultimodalAI #DeepResearch #TopicModeling #ChartUnderstanding #QuestionAnswering #LLMs #Research

824

Amirhossein Abaskohi

Amirhossein Abaskohi @AmirAbaskohi

5 Nov 2025

✨ Excited to be presenting three papers at EMNLP 2025 in Suzhou this week! 🇨🇳 I'll be showcasing our recent work on multimodal reasoning, chart understanding, and few-shot data synthesis — exploring how language models can better connect vision, text, and structured information for deeper understanding. 📍 Poster Sessions — Hall C 🧩 CEMTM: Contextual Embedding-based Multimodal Topic Modeling 📅 Wednesday, Nov 5 · 16:30–18:00 > A framework for contextualized topic discovery across multimodal corpora by aligning visual and textual embeddings. 📊 ChartGaze: Enhancing Chart Understanding in LVLMs with Eye-Tracking Guided Attention Refinement 📅 Wednesday, Nov 5 · 16:30–18:00 > We integrate human gaze supervision to improve LVLM interpretability and reasoning over charts. 🔍 FM²DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering (Findings) 📅 Friday, Nov 7 · 12:30–13:30 > A pipeline for synthesizing multimodal QA data via cross-model knowledge distillation and multihop reasoning. If you’re attending EMNLP, come by and chat! I’d love to connect and discuss multimodal deep research agents, attention interpretability, and data synthesis for reasoning tasks. #EMNLP2025 #MultimodalAI #DeepResearch #TopicModeling #ChartUnderstanding #QuestionAnswering #LLMs #Research

759

Amirhossein Abaskohi

Amirhossein Abaskohi @AmirAbaskohi

17 Sep 2025

Have you ever tried to make sense of thousands of documents — long documents with multiple figures in them — and wished for a way to automatically uncover their main themes? 📚🖼️ That’s where topic modeling comes in. We’re excited to introduce CEMTM, a new state-of-the-art multimodal topic model and the first to handle long multimodal documents at scale! 🚀 CEMTM is a framework for interpretable multimodal topic modeling. Unlike prior models, it: 🔹 Leverages fine-tuned large vision–language models (LVLMs) to unify text image information into contextual embeddings. 🔹 Introduces a distributional importance network that learns which words and image regions truly matter for topic inference. 🔹 Aligns topics with document-level semantics through a reconstruction objective, ensuring coherence across modalities. 🔹 Produces explicit word–topic and document–topic distributions, preserving interpretability while scaling to long, multimodal documents. Across six benchmark datasets, CEMTM sets new state-of-the-art results in topic quality and diversity, while also proving useful for downstream tasks like few-shot retrieval and multimodal QA. In short, it shows how multimodal grounding structured topic modeling can enable better corpus exploration, retrieval, and reasoning. 📄 Paper: arxiv.org/abs/2509.11465 💻 Code: github.com/AmirAbaskohi/CEMT… A huge thanks my supervisors: @careninigiusepp and @JotyShafiq from @SFResearch and all of my collaborators: @liraymond96 and @ChuyuanLi Looking forward to sharing this work at EMNLP 2025 in China 🇨🇳 — hope to see you there! #EMNLP2025 #NLP #MultimodalAI #TopicModeling #AIResearch #NLP #LLM #LargeLanguageModels

267

Amirhossein Abaskohi

Amirhossein Abaskohi @AmirAbaskohi

17 Sep 2025

Ever tried answering a complex question that requires digging through multiple research papers, combining text, tables, and even figures to find the answer? That’s the essence of multimodal multihop question answering (MMQA), and it’s critical for real-world tasks like interpreting medical records, analyzing educational documents, or conducting deep research across long multimodal content. Today, most people turn to large APIs for this, but that’s costly and often impractical. A promising alternative is to build smaller expert models fine-tuned on the right data, models that can perform complex multimodal reasoning without requiring massive compute. Excited to share that our paper FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering has been accepted to EMNLP 2025 Findings! 🎉 FM2DS introduces the first scalable framework for synthesizing high-quality MMQA datasets from long multimodal documents. Our five-stage pipeline automatically generates and validates realistic QA pairs, enabling smaller models to match, and even surpass, those trained on expensive human-labeled data. We also release M2QA-Bench, the first benchmark for MMQA on long documents, to push research forward in this space. 📄 Paper: arxiv.org/abs/2412.07030 💻 Code: github.com/ServiceNow/FM2DS 📊 Benchmark: huggingface.co/datasets/Amir… Big thanks to @ServiceNowRSRCH and my mentors @gspandana , @careninigiusepp, and @ILaradji for this collaboration. Looking forward to seeing everyone in China 🇨🇳 for EMNLP 2025! #EMNLP2025 #NLP #MultimodalAI #TopicModeling #AIResearch #NLP #LLM #LargeLanguageModels

1,184

Salesforce AI Research

Salesforce AI Research

@SFResearch

27 Aug 2025

@emnlpmeeting / #EMNLP2025 Accepted Paper: CEMTM: Contextual Embedding-based Multimodal Topic Modeling 📝 Paper: bit.ly/3JsZFFy This work introduces CEMTM, a context-enhanced multimodal topic model that leverages fine-tuned large vision-language models to infer coherent topic structures from documents containing both text and images. The approach uses distributional attention mechanisms to weight token-level contributions and aligns topic representations through reconstruction objectives. Key contributions: ➡️ Holistic multimodal document encoding using pretrained LVLM embeddings without separate modality encoders ➡️ Distributional attention mechanism for learning token importance and improving semantic alignment ➡️ Reconstruction-based training objective that preserves cross-modal semantics in topic structures ➡️ Strong performance across six benchmarks with average LLM coherence score of 2.61 Results demonstrate significant improvements over unimodal and multimodal baselines, with effectiveness shown in downstream few-shot retrieval tasks and ability to capture visually grounded semantics. 👥 Authors: Amirhossein Abaskohi @AmirAbaskohi, Raymond Li, Chuyuan Li @ChuyuanLi, Shafiq Joty @JotyShafiq, and Giuseppe Carenini @careninigiusepp #FutureOfAI #EnterpriseAI #NLP #MachineLearning #MultimodalAI #TopicModeling

474

Blockchain HC Today

Blockchain HC Today @BHTYJournal

2 Aug 2025

New Research Published: Impact of COVID-19 on Primary Health Care Research Trends and Suggestions for Better Services Approaches Via… blockchainhealthcaretoday.co… #blockchain #blockchaininhealthcare #Covidresearch #blockchaintech #COVID19, #coronavirus, #primarycare #primaryhealthcare, #topicmodeling

135

Nick Byrd, Ph.D.

Nick Byrd, Ph.D.@byrd_nick

11 Jul 2025

How do #bioethics and #PhilosophyOfMedicine relate? Enter Vilius Dranseika with cool #webScraping, #topicModeling, and #dataViz! #PhilMed was more than a branch of #PhilSci, involving #epistemology, #metaethics, and more. It wasn't clear whether Bioethics is part of PhilMed.

Prominence ratios of topics in Philosophy of Medicine and Bioethics journals.

Prominence ratio = Mean prominence (of a topic/cluster) in Philosophy of Medicine journals / Mean prominence (of a topic/cluster) in Bioethics journals

ALT Prominence ratios of topics in Philosophy of Medicine and Bioethics journals. Prominence ratio = Mean prominence (of a topic/cluster) in Philosophy of Medicine journals / Mean prominence (of a topic/cluster) in Bioethics journals

A network of keywords with connection strength visualized and color-coded to distinguish philosophy of medicine (red), institution-based applied ethics (green), individual-based applied ethics (purple), and "beginning and end of life" (blue).

ALT A network of keywords with connection strength visualized and color-coded to distinguish philosophy of medicine (red), institution-based applied ethics (green), individual-based applied ethics (purple), and "beginning and end of life" (blue).

A two-dimensional plot of philosophy of medicine and bioethics prominence ratios (x axis) as well as topic prominence and philosophy citations correlation (y axis) — there was a positive correlation.

ALT A two-dimensional plot of philosophy of medicine and bioethics prominence ratios (x axis) as well as topic prominence and philosophy citations correlation (y axis) — there was a positive correlation.

1919 Google Scholar profiles with self-selected keywords containing 'bioethics' OR 'philosophy of medicine'

ratio =

(proportion of profiles containing the keyword among profiles CONTAINING 'philosophy of medicine")

(proportion of profiles containing the keyword among profiles NOT CONTAINING 'philosophy of medicine")

ALT 1919 Google Scholar profiles with self-selected keywords containing 'bioethics' OR 'philosophy of medicine' ratio = (proportion of profiles containing the keyword among profiles CONTAINING 'philosophy of medicine") (proportion of profiles containing the keyword among profiles NOT CONTAINING 'philosophy of medicine")

297

DH_Potsdam @dh_potsdam@hcommons.social

DH_Potsdam @dh_potsdam@hcommons.social @DH_Potsdam

2 Apr 2025

When was the last time you saw someone teach #DigitalHumanities with a chalkboard? 🧑‍🏫 @cnDuKeli of DH Trier explains the machinery behind #TopicModeling during the 2nd day of the #DHSpringSchool

283

AIML.com

AIML.com

@OfficialAIML

25 Feb 2025

Machine Learning Interview Question 34: 𝐖𝐡𝐚𝐭 𝐢𝐬 𝐭𝐨𝐩𝐢𝐜 𝐦𝐨𝐝𝐞𝐥𝐢𝐧𝐠? 𝐃𝐢𝐬𝐜𝐮𝐬𝐬 𝐢𝐭𝐬 𝐰𝐨𝐫𝐤𝐢𝐧𝐠, 𝐚𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬, 𝐚𝐧𝐝 𝐭𝐡𝐞 𝐩𝐫𝐨𝐬 𝐚𝐧𝐝 𝐜𝐨𝐧𝐬 Answer Link: aiml.com/what-is-topic-model… Topic modeling has emerged as a highly useful technique in Natural Language Processing (NLP) for deriving meaningful insights from unstructured textual data. Example of such data includes articles, blog posts, customer reviews, emails, and social media posts. 👉 Learn how Topic Modeling works, where it's used, and its advantages and challenges in this article. The article is organized into following topics ◾ About Topic Modeling ◾ Algorithms used for Topic Modeling ◾ How Topic Modeling works? ◾ Real world applications of Topic Modeling ◾ Advantages and disadvantages of using Topic Modeling -- 🚀 If you're preparing for Machine Learning interviews, go to AIML.com for top resources and insights 🔗 Link to Top 100 ML Interview Questions: aiml.com/top-100-machine-lea… 🌐 𝑨𝑰𝑴𝑳.𝒄𝒐𝒎 𝒊𝒔 𝒕𝒉𝒆 𝒘𝒐𝒓𝒍𝒅'𝒔 𝒍𝒂𝒓𝒈𝒆𝒔𝒕 𝒓𝒆𝒑𝒐𝒔𝒊𝒕𝒐𝒓𝒚 𝒐𝒇 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑳𝒆𝒂𝒓𝒏𝒊𝒏𝒈 𝒊𝒏𝒕𝒆𝒓𝒗𝒊𝒆𝒘 𝒒𝒖𝒆𝒔𝒕𝒊𝒐𝒏𝒔 𝒂𝒏𝒅 𝑸𝒖𝒊𝒛𝒛𝒆𝒔. (𝑨𝒍𝒍 𝑭𝑹𝑬𝑬) #aiml_com #machinelearning #topicmodeling #machinelearninginterview

548

Abimbola

Abimbola @bimbomuri

6 Jan 2025

I am looking for ways to identify emerging or implicit topics in feedback data, particularly sensitive issues e.g bullying or harassment. I am interested in methods that don’t rely on predefined labels or keywords. Has anyone tackled something similar? #NLP #AI #TopicModeling

443

Barbara S. Lancho Barrantes

Barbara S. Lancho Barrantes @BarbaraLancho

6 Jan 2025

I am excited to announce that our paper has been published online today. We analysed research publications on topic modeling using bibliometrics, as well as topic modeling itself! #bibliometrics #topicmodeling Artificial Intelligence Review link.springer.com/article/10…

Topic modelling through the bibliometrics lens and its technique

Artificial Intelligence Review - Topic modelling (TM) is a significant natural language processing (NLP) task and is becoming more popular, especially, in the context of literature synthesis and...

link.springer.com

156

🛸 𝑶𝒍𝒖𝒘𝒂𝒕𝒐𝒔𝒊𝒏

🛸 𝑶𝒍𝒖𝒘𝒂𝒕𝒐𝒔𝒊𝒏 @ikev007

31 Dec 2024

Handling large volumes of user feedback can feel overwhelming. Surveys, reviews, support tickets, social media comments—where do you even start? 🤔 I wrote an article to help teams navigate this challenge with a structured approach. 🧵#TopicModeling #NMF #MachineLearning

546

André Bittermann

André Bittermann @AndreBittermann

30 Oct 2024

Happy to announce that our #Rstats package ✨topiclabels✨ has been updated on #CRAN 🎉 🤖Using open #LLMs, our package automatically assigns a topic label to a bag of words. 🤝It works with all popular #TopicModeling packages! Find out more: 👉github.com/PetersFritz/topic…

155

AIML.com

AIML.com

@OfficialAIML

7 Oct 2024

Machine Learning Interview Question 34: 𝐖𝐡𝐚𝐭 𝐢𝐬 𝐭𝐨𝐩𝐢𝐜 𝐦𝐨𝐝𝐞𝐥𝐢𝐧𝐠? 𝐃𝐢𝐬𝐜𝐮𝐬𝐬 𝐢𝐭𝐬 𝐰𝐨𝐫𝐤𝐢𝐧𝐠, 𝐚𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬, 𝐚𝐧𝐝 𝐭𝐡𝐞 𝐩𝐫𝐨𝐬 𝐚𝐧𝐝 𝐜𝐨𝐧𝐬 Answer Link: aiml.com/what-is-topic-model… Topic modeling has emerged as a highly useful technique in Natural Language Processing (NLP) for deriving meaningful insights from unstructured textual data. Example of such data includes articles, blog posts, customer reviews, emails, and social media posts. 🌐 👉 Learn how Topic Modeling works, where it's used, and its advantages and challenges in this article. The article is organized into following topics ◾ About Topic Modeling ◾ Algorithms used for Topic Modeling ◾ How Topic Modeling works? ◾ Real world applications of Topic Modeling ◾ Advantages and disadvantages of using Topic Modeling -- 🚀 If you're preparing for Machine Learning interviews, head to AIML.com for top resources and insights 🔗 Link to Top 100 ML Interview Questions: aiml.com/top-100-machine-lea… 🌐 𝑨𝑰𝑴𝑳.𝒄𝒐𝒎 𝒊𝒔 𝒕𝒉𝒆 𝒘𝒐𝒓𝒍𝒅'𝒔 𝒍𝒂𝒓𝒈𝒆𝒔𝒕 𝒓𝒆𝒑𝒐𝒔𝒊𝒕𝒐𝒓𝒚 𝒐𝒇 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑳𝒆𝒂𝒓𝒏𝒊𝒏𝒈 𝒊𝒏𝒕𝒆𝒓𝒗𝒊𝒆𝒘 𝒒𝒖𝒆𝒔𝒕𝒊𝒐𝒏𝒔 𝒂𝒏𝒅 𝑸𝒖𝒊𝒛𝒛𝒆𝒔. (𝑨𝒍𝒍 𝑭𝑹𝑬𝑬) #aiml_com #machinelearning #topicmodeling #machinelearninginterview

What is topic modeling? Discuss key algorithms, working, applications, and the pros and cons

Topic modeling is a machine learning technique used in text analysis to discover underlying topics or themes within a collection of documents. Read more..

aiml.com

359

SNOLA

SNOLA @snolaresearch

26 Sep 2024

¡No te pierdas el webinar "The highs and lows of topic modelling ('I got my topics, what is next?)" con Dorin Stanciu (@utcluj)! 🗓️ 8 de octubre, 16-17h Registro en forms.gle/nyHGCSdRJGVcYrvE8 Explora las aplicaciones de #TopicModeling en #LearningAnalytics y más.

Registration in the SNOLA webinar "The highs and lows of topic modeling ('I got my topics, what is...

Este formulario de inscripción recoge exclusivamente el dato del correo electrónico de la persona registrada, con el fin de poder comunicar más adelante el enlace al que conectarse para asistir al...

docs.google.com

283

Gaurav Maharjan

Gaurav Maharjan @ZERG220145

19 Jul 2024

#LSPPDay49 Continued my exploration of Topic Modeling today. I delved into its types: LSA (Latent Semantic Analysis) and LDA (Latent Dirichlet Allocation), and learned about their differences. #NLP #TopicModeling #60DaysOfLearning2024 #LearningWithLeapfrog @lftechnology

Clarivate for Academia & Government

Clarivate for Academia & Government @ClarivateAG

3 Jul 2024

We are at @ICSSIConference! Ross Potter and Ann Beynon from @Clarivate spoke on using #TopicModeling to investigate the impact of academic research, while Anand Desai spoke on the impact of AI in #R&D evaluation with Frances Carter-Johnson from @NSF. icssi.org

508

Andres Karjus | also on 🟦bsky

Andres Karjus | also on 🟦bsky @AndresKarjus

29 May 2024

Attending 2 conferences this week: presenting a poster at #DHNB2024 in person in Iceland (where a volcano just erupted) and a talk at the #xPhi2024 virtually, both on using #LLMs in #DH humanities, zero-shot text classification & Lenin detection & why we can forget topicmodeling

1,267