Filter
Exclude
Time range
-
Near
Deleting for one user never breaks another user's search โ†’ Concurrent uploads are handled safely with no extra infra Vectors are shared. Access is isolated. That's the right model for multi-user RAG. #RAG #Qdrant #VectorDatabase #LLM #AIEngineering #Python
1
3
๐Ÿงฌ High-Quality Vector Embeddings & Vector Databases โ€” the critical foundation that powers accurate retrieval in all modern RAG, semantic search, and agentic systems. Just read this excellent technical white paper from @aasaitech on building reliable retrieval layers for industrial and manufacturing domains. Key highlights: โ€ข Embedding models (text-embedding-3-large/small, BGE, E5, Instructor/GTE) domain-specific fine-tuning โ€ข Smart chunking strategies rich metadata filtering (equipment, location, date, safety, document type) โ€ข Vector DB options (Pinecone, Weaviate, Milvus, Qdrant, Chroma, Redis) โ€ข 5-step semantic search pipeline with continuous feedback evaluation (Recall@K, Precision, Context Relevance) โ€ข Industrial impact: Maintenance manuals, SOPs, logs, drawings โ€” dramatically reducing hallucinations and improving reliability This is the retrieval backbone that makes everything else (CoT, agents, multimodal, long context) actually work in production edge orchestration and manufacturing AI. Full white paper infographic: x.com/aasaitech/status/20656โ€ฆ How are you handling embeddings and vector stores in your systems โ€” open-source fine-tuned models, managed DBs, or full custom industrial pipelines with heavy metadata? #VectorEmbeddings #VectorDatabase #RAG #SemanticSearch #IndustrialAI #AgenticAI #EdgeAI

20
Building AI-Powered Database Apps with @Hibernate #Vector and the Oracle AI Database 26ai โ€” Part 2 โ€” Using the Hibernate Vector module with Oracle AI Vector Search rb.gy/ab8dc2 #Java #AI #Hibernate #VectorDatabase #JavaOracleDB @OracleDatabase @JakartaEE @juarezjunior
22
Most people think LLMs remember everything you tell them. They don't. One of the biggest misconceptions in AI is confusing memory with context. When you interact with an LLM, every message is converted into tokens and placed inside a context window. Think of it as the model's working space. As long as information remains within that window, the model can use it. Once the window fills up, older information gets compressed, summarized, or dropped to make room for new context. This is why long conversations sometimes become less reliable. The model isn't actually forgetting. It simply no longer has access to some of the information it saw earlier. Memory works very differently. Suppose a user shares their preferences with an AI agent today and returns a month later. The model won't remember those preferences unless they were stored somewhere and retrieved when needed. That's why modern AI systems rely on memory layers, vector databases, retrieval pipelines, and RAG architectures. In many ways, building great AI products isn't about making models smarter. It's about ensuring the right information reaches the model at the right time. That's the difference between a good AI demo and a production-grade AI system. #AIAgents #RAG #GenerativeAI #LLM #AIEngineering #AgenticAI #VectorDatabase #SoftwareEngineering
1
8
One of the most expensive mistakes in a RAG project is choosing a vector database based on benchmarks instead of workload characteristics. The question isn't "Which vector database is best?" The question is "What failure mode can your system tolerate?" Most teams evaluate vector databases by looking at latency, recall, and throughput numbers. Those metrics matter, but they rarely determine whether your architecture succeeds in production. What actually matters is the shape of your retrieval workload. If you're building an internal knowledge assistant with a few hundred thousand documents, pgvector may be the most practical choice. Your vectors live beside your application data, backups stay simple, and your team doesn't need another distributed system to operate. If you're expecting billions of vectors, multi-tenant workloads, and aggressive latency targets, dedicated systems like Pinecone, Qdrant, or Milvus start making more sense. If metadata filtering is central to retrieval quality, database architecture often matters more than raw ANN search speed. Many production bottlenecks appear in filtering, ranking, and retrieval orchestrationโ€”not vector similarity itself. The deeper lesson is that vector databases are only one layer of retrieval quality. I've seen teams spend weeks comparing databases while ignoring chunking strategy, embedding selection, hybrid search, reranking, and evaluation pipelines. A mediocre vector database with excellent retrieval design usually outperforms a world-class vector database with poor retrieval design. Another observation: the industry is slowly moving away from treating vector databases as standalone infrastructure. More teams are asking whether vector search should be a dedicated service at all, or simply a capability inside their existing data platform. That's why the "best" database keeps changing. The real decision framework is: How much data will you store in 12 months? What latency can your users tolerate? How much operational complexity can your team support? How important are filtering, hybrid search, and governance requirements? Is vector search your core product capability or just one feature? Those answers usually narrow the field faster than any benchmark chart. For engineers running RAG systems in production: What was the first bottleneck you hitโ€”vector search itself, metadata filtering, retrieval quality, or operational complexity? #RAG #VectorDatabase #AIEngineering #GenerativeAI #SoftwareArchitecture #DataEngineering #DistributedSystems
1
58
We benchmarked 5 engines on what agents actually do with sustained writes concurrent reads: -> seekdb: 1,523 QPS, 21.7ms P99, 1.1ร— jitter. ->Next closest: 470 QPS, 53.6ms P99, 10.3ร— jitter. That gap is architecture, not config. Configs raw results: app.marketbeam.io/m/i2oThxs8โ€ฆ #seekdb #VectorDatabase #AIAgents
2
5
11
967
Vector Databases are the secret weapon behind AI that truly understands what you mean, not just what you type. bit.ly/4ungQtJ #AI #VectorDatabase #innovations #AIDevelopment
4
Getting a RAG demo to run is easy. Understanding what happens between ingestion, retrieval, and generation is harder. The Zilliz Cloud onboarding experience helps users go from sign-up to a working RAG workflow in minutes. In one guided flow, you can ingest data, run Hybrid Search, inspect retrieval results, and generate grounded answers. At the end, you get packaged RAG app code that you can adapt locally. It is built for developers who want to understand the retrieval layer by seeing it work, not by reading another setup guide. Try it here: try.zilliz.com/?utm_source=x --- ๐Ÿ‘‰ Follow @zilliz_universe for vector database and vector lakebase updates built for production AI. #Zilliz #RAG #VectorDatabase
111
Everyone is obsessed with LLMs. Almost nobody talks about the database that makes them useful. Because without retrieval, most AI applications would be surprisingly forgetful. Think about it: โŒ ChatGPT doesn't know your company's internal documents. โŒ Claude doesn't automatically know your product manuals. โŒ AI agents don't magically remember every customer interaction. They need a way to find the right information at the right time. That's exactly what Vector Databases do. They turn millions (or billions) of embeddings into searchable knowledge. When you ask a question, the model isn't searching documents. It's searching meaning. That's why Vector Databases have become one of the most important pieces of the modern AI stack. Some of the major players: โ†’ Pinecone โ†’ Weaviate โ†’ Qdrant โ†’ Milvus โ†’ Chroma โ†’ pgvector โ†’ Redis Vector โ†’ FAISS โ†’ MongoDB Atlas Vector Search โ†’ Elasticsearch โ†’ OpenSearch โ†’ LanceDB โ†’ Vespa โ†’ Azure AI Search The interesting part? The AI race isn't only about building smarter models anymore. It's about building better memory. Because the model generates the answer. But retrieval decides what the answer is based on. In many production AI systems, retrieval quality matters more than model quality. The companies winning with AI aren't always using the largest models. They're often using the best retrieval architecture. What's powering your RAG stack today? ๐Ÿ‘‡ #AI #GenAI #RAG #VectorDatabase #LLM #AIAgents #MachineLearning #DataEngineering #SoftwareEngineering #ArtificialIntelligence
8
14
81
1,794
Building AI Systems Isn't About Choosing the Best LLM Most teams spend weeks comparing GPT, Claude, Gemini, DeepSeek, Llama, and Qwen. In reality, that's usually the easiest part. The real challenge is everything around the model. A production-grade AI system typically looks like this: ๐Ÿง  LLMs The reasoning engine behind your application. Examples: โ€ข GPT โ€ข Claude โ€ข Gemini โ€ข DeepSeek โ€ข Llama โ€ข Qwen ๐Ÿ”— Frameworks Connect models to tools, workflows, and business logic. Examples: โ€ข LangChain โ€ข LlamaIndex โ€ข Haystack ๐Ÿ—„๏ธ Vector Databases Long-term memory for AI applications. Examples: โ€ข Pinecone โ€ข Qdrant โ€ข Weaviate โ€ข Chroma โ€ข Milvus ๐Ÿ“ฅ Data Extraction Pull information from websites, PDFs, documents, and knowledge bases. Examples: โ€ข FireCrawl โ€ข Crawl4AI โ€ข Docling โ€ข LlamaParse ๐Ÿค– Open Model Access Run models locally or through open providers. Examples: โ€ข Ollama โ€ข Hugging Face โ€ข Groq โ€ข Together AI ๐Ÿ”Ž Embeddings Convert text into vectors for semantic search. Examples: โ€ข OpenAI Embeddings โ€ข Voyage AI โ€ข SBERT โ€ข Nomic ๐Ÿ“Š Evaluation The most overlooked layer. Measure: โ€ข Accuracy โ€ข Relevance โ€ข Latency โ€ข Hallucinations Examples: โ€ข Ragas โ€ข TruLens โ€ข Giskard The Workflow Is Surprisingly Simple 1๏ธโƒฃ Collect data 2๏ธโƒฃ Extract and clean it 3๏ธโƒฃ Generate embeddings 4๏ธโƒฃ Store them in a vector database 5๏ธโƒฃ Retrieve relevant context 6๏ธโƒฃ Send context to the LLM 7๏ธโƒฃ Evaluate and improve continuously The Biggest Lesson I've Learned The model rarely determines whether an AI project succeeds. Data quality, retrieval quality, evaluation, and system architecture usually matter far more than whether you're using GPT, Claude, Gemini, or DeepSeek. Most AI failures are actually: โŒ Data failures โŒ Retrieval failures โŒ Architecture failures Not model failures. ๐Ÿ’ญ Which layer of the AI stack do you think is currently the most underrated? #AI #GenAI #LLM #RAG #ArtificialIntelligence #MachineLearning #DataEngineering #VectorDatabase #LangChain #LlamaIndex #AgenticAI #SoftwareArchitecture #SystemDesign #Python #Java #AWS #CloudComputing #Developers
17
18
51
2,132
๐Ÿ” Frustrated with intranet search that only finds documents with exact keywords? Traditional search fails when employees use different terms than those in your documents. RAG with vector databases enables semantic search that understands meaning, not just word matches. Learn how to integrate Milvus with Open Intranet for intelligent knowledge base search. ๐Ÿ‘‰ Read the guide: eu1.hubs.ly/H0vSYHG0 #Drupal #OpenIntranet #AI #RAG #SemanticSearch #VectorDatabase #Intranet
1
17
Vector databases vs traditional databases ๐Ÿ‘‡ ๐Ÿ“– Traditional DB: searches for words ๐Ÿง  Vector DB: searches for meaning Search for "pet" and you might get: ๐Ÿถ dog ๐Ÿ• puppy ๐Ÿฑ cat That's why vector databases power RAG and semantic search. #AI #VectorDatabase #RAG #NoSQL
5
142
๐Ÿ“ฃ ๐—ข๐˜‚๐—ฟ ๐˜„๐—ฒ๐—ฏ๐—ถ๐—ป๐—ฎ๐—ฟ ๐—ฎ๐—ฏ๐—ผ๐˜‚๐˜ ๐— ๐—ถ๐—น๐˜ƒ๐˜‚๐˜€ ๐Ÿฏ.๐Ÿฌ ๐—ถ๐˜€ ๐—ต๐—ฎ๐—ฝ๐—ฝ๐—ฒ๐—ป๐—ถ๐—ป๐—ด ๐˜๐—ผ๐—ฑ๐—ฎ๐˜†! Join us with core maintainers Li Liu and Jiang Chen for: Milvus 3.0 architecture โ†’ A clear picture of the Milvus 3.0 roadmap โ†’ How Milvus 3.0 powers Zilliz Vector Lakebase โ†’ 15โ€“20 min AMA with maintainers Time: June 8, 2026, 4:00 PM PDT ๐Ÿ”— Link: zilliz.com/event/whats-new-iโ€ฆ #Milvus #VectorDatabase #Zilliz #VectorSearch #AIInfrastructure
May 30
๐ŸŒ ๐—ช๐—ฒ๐—ฏ๐—ถ๐—ป๐—ฎ๐—ฟ | ๐—ช๐—ต๐—ฎ๐˜โ€™๐˜€ ๐—ก๐—ฒ๐˜„ ๐—ถ๐—ป ๐— ๐—ถ๐—น๐˜ƒ๐˜‚๐˜€ ๐Ÿฏ.๐Ÿฌ: ๐—Ÿ๐—ถ๐˜ƒ๐—ฒ ๐—ช๐—ฎ๐—น๐—ธ๐˜๐—ต๐—ฟ๐—ผ๐˜‚๐—ด๐—ต & ๐—”๐— ๐—”, ๐—๐˜‚๐—ป๐—ฒ ๐Ÿด ๐—ผ๐—ป๐—น๐—ถ๐—ป๐—ฒ Milvus 3.0 beta is the biggest architectural upgrade since the project began. It brings native support for indexing and querying vectors directly on your data lake, along with a query engine that goes beyond top-K search. In this live session, core maintainers Li Liu and Jiang Chen will walk through what shipped, the design choices behind it, and how Milvus 3.0 powers ๐—ญ๐—ถ๐—น๐—น๐—ถ๐˜‡ ๐—ฉ๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ ๐—Ÿ๐—ฎ๐—ธ๐—ฒ๐—ฏ๐—ฎ๐˜€๐—ฒ. ๐ŸŽค ๐—ง๐—ผ๐—ฝ๐—ถ๐—ฐ๐˜€ โ€ข What changed in Milvus 3.0 beta โ€ข Why data lake-native vector search matters โ€ข Whatโ€™s in beta now and whatโ€™s coming in GA โ€ข How Milvus 3.0 powers Zilliz Vector Lakebase โ€ข 15โ€“20 min Live AMA with core maintainers ๐Ÿ“… ๐—๐˜‚๐—ป๐—ฒ ๐Ÿด, ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฒ, ๐Ÿฐ:๐Ÿฌ๐Ÿฌ ๐—ฝ๐—บ ๐—ฃ๐——๐—ง ๐Ÿ”— Register here: zilliz.com/event/whats-new-iโ€ฆ Bring your questions about migration, performance, architecture, or anything else youโ€™d like to ask the maintainers. Looking forward to the conversation! #Milvus #VectorDatabase #Zilliz
210
SQL is no longer enough for the AI era. Enter Vector Databases. ๐Ÿ“Šโšก Learn how high-dimensional vectors are changing how apps store data and understand human meaning. Full article ๐Ÿ‘‡ globaltech112.blogspot.com/2โ€ฆ #VectorDatabase #DataScience #LLM #TechExplained #Developers

1
1
1
11
10/ AI is becoming a commodity. Reliable AI systems are not. That's where the real advantage is built. #RAG #LangChain #VectorDatabase #AIEngineering #ProductionAI
7
One more in ๐Ÿ‡บ๐Ÿ‡ฒ! As you may have observed, I don't do things only once; I keep repeating them (except a few ๐Ÿ˜œ)! I am thrilled to announce my 2nd speaking engagement in the USA ๐Ÿ‡บ๐Ÿ‡ฒ. I will be presenting at the Michigan .NET Users Group (MI.NET) on September 16, 2026, in Southfield, MI! A huge thank you to Dustin Kingen for organizing and to EPITEC for being the Platinum Sponsor! And of course, none of this is worth anything without an amazing audience. Can't wait to learn and build together with the MI.NET community! If you are in the Detroit area, come join us! RSVP here ๐Ÿ‘‰ meetup.com/midotnet/events/3โ€ฆ #RAG #GenerativeAI #AzureAI #dotNET #CosimeSimilarity #MINet #Embeddings #VectorDatabase #TechSpeaker
1
77
Day 25 ๐Ÿš€ Today was all about AI foundations ๐Ÿค– โ€ข LLMs โ€ข Vector Databases โ€ข Embeddings โ€ข High-dimensional vectors โ€ข Gemini Embedding models โ€ข Similarity search & retrieval Understanding how AI finds the right information is fascinating ๐Ÿ’ปโœจ #AI #LLM #VectorDatabase #RAG
1
16
Traditional databases store rows & tables. Vector databases store: ๐Ÿง  Meaning. Thatโ€™s how AI systems can perform: โœ… Semantic search โœ… AI retrieval โœ… Recommendation systems โœ… RAG pipelines Modern AI search increasingly depends on vector databases. #AI #VectorDatabase #LLM #MachineLearning
9
๐Ÿคฏ Google just found a way to squeeze 31GB of AI search data into only 4GB of memory. That's nearly 8x less memory usage without sacrificing search quality. The project is called TurboVec. And it's completely open source. Why does this matter? Because every AI app needs to search through data: โ†’ RAG systems โ†’ AI agents โ†’ Chatbots โ†’ Knowledge bases โ†’ Vector search engines The bigger your data gets, the more memory you need. TurboVec attacks that problem directly. According to the project: โœ… Stores vector data using dramatically less memory โœ… Faster search than FAISS in some workloads โœ… Works on Mac and Linux servers โœ… Supports LangChain & LlamaIndex โœ… Runs fully offline โœ… Python ready out of the box The interesting trend here isn't TurboVec itself. It's that open-source AI infrastructure keeps getting better, faster, and cheaper every month. What required expensive hardware a year ago can now run on a normal machine. That's a huge deal for developers building AI products. ๐Ÿ”– Bookmark this one. #AI #OpenSource #GitHub #MachineLearning #RAG #VectorDatabase #LangChain #LlamaIndex #AIAgents #Developer
1
4
391