6d ago · 15 min read · TLDR: Lucene is a search library. Its core innovation is the inverted index — a reverse map from words to documents, like the index at the back of a textbook. Documents are stored in immutable segments that Lucene merges in the background to keep que...
Join discussionFeb 3 · 8 min read · Ever used the search bar in Discord to find an old message? Maybe you searched "meeting link" in your work server. Or tried to find that meme someone shared 2 years ago. Or looked for a conversation with a friend from months back. That simple search ...
Join discussion
Jan 19 · 4 min read · Lucene is what makes full-text search fast, scalable, and production-ready in AEM. If you’ve ever searched for content by keywords, text fragments, or metadata - you’ve used Lucene, whether you realized it or not. In this post, we’ll go one level dee...
Join discussionDec 9, 2025 · 4 min read · # Follow-up: Hybrid Search in Apache Solr is NOW Production-Ready (with 1024D vectors!) Hey everyone, A few days back I shared my experiments with hybrid search (combining traditional lexical search with vector/semantic search). Well, I've been busy,...
Join discussionDec 6, 2025 · 6 min read · Today, I want to talk about something really cool: how Apache Lucene stores and retrieves data so efficiently. We're not diving into Elasticsearch (it's built on top of Apache Lucene) but into the magic that makes Lucene so powerful for full-text se...
Join discussion
Oct 25, 2025 · 5 min read · In MongoDB Atlas Search (which uses Lucene under the hood), autocomplete search is built for text prediction and partial word matching, whereas a traditional filter ($regex, $text, $match) relies on basic scanning or limited index usage. 🚀 Why Atla...
Join discussionAug 21, 2025 · 4 min read · When deploying search infrastructure at scale, especially with technologies like Apache Lucene or Elasticsearch, performance tuning often goes beyond application-level optimizations. One of the most critical yet frequently overlooked system parameter...
Join discussion
Jan 5, 2025 · 3 min read · Elasticsearch, a powerful distributed search engine, enables users to efficiently store, search, and analyze large volumes of data in near real-time. Whether you're a developer building a search application or a data analyst working on business insig...
Join discussionOct 13, 2024 · 3 min read · Community Over Code NA 2024 (Ex ApacheCon) happened from the 7th to the 11th of October in Denver (United States of America). The Search Track was well attended, with the designated room being at standing room only capacity quite a few times. As has ...
Join discussion