Lucene in Action

Erik Hatcher and Otis Gospodnetic
  • November 2004
  • ISBN 9781932394283
  • 456 pages
  • printed in black & white
  • includes free previous edition eBook

pro $24.99 per month

  • access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
  • choose one free eBook per month to keep
  • exclusive 50% discount on all purchases

lite $19.99 per month

  • access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more


Lucene is a gem in the open-source world—a highly scalable, fast search engine. It delivers performance and is disarmingly easy to use. Lucene in Action is the authoritative guide to Lucene. It describes how to index your data, including types you definitely need to know such as MS Word, PDF, HTML, and XML. It introduces you to searching, sorting, filtering, and highlighting search results.

about the technology

Lucene powers search in surprising places—in discussion groups at Fortune 100 companies, in commercial issue trackers, in email search from Microsoft, in the Nutch web search engine (that scales to billions of pages). It is used by diverse companies including Akamai, Overture, Technorati, HotJobs, Epiphany, FedEx, Mayo Clinic, MIT, New Scientist Magazine, and many others.

about the book

Adding search to your application can be easy. With many reusable examples and good advice on best practices, Lucene in Action shows you how. And if you would like to search through Lucene in Action over the Web, you can do so using Lucene itself as the search engine--take a look at the authors' awesome Search Inside solution. Its results page resembles Google's and provides a novel yet familiar interface to the entire book and book blog.

what's inside

  • How to integrate Lucene into your applications
  • Ready-to-use framework for rich document handling
  • Case studies including Nutch, TheServerSide, jGuru, etc.
  • Lucene ports to Perl, Python, C#/.Net, and C++
  • Sorting, filtering, term vectors, multiple, and remote index searching
  • The new SpanQuery family, extending query parser, hit collecting
  • Performance testing and tuning
  • Lucene add-ons (hit highlighting, synonym lookup, and others)
  • Foreword by Doug Cutting, the inventor of Lucene

about the authors

A committer on the Ant, Lucene, and Tapestry open-source projects, Erik Hatcher is coauthor of Manning's award-winning Java Development with Ant. Otis Gospodnetic is a Lucene committer, a member of Apache Jakarta Project Management Committee, and maintainer of the jGuru's Lucene FAQ. Both authors have published numerous technical articles including several on Lucene.

...packed with examples and advice on how to effectively use this incredibly powerful tool.

Brian Goetz, Quiotix Corporation

...it unlocked for me the amazing power of Lucene.

Reece Wilton, Walt Disney Internet Group

...the code examples are useful and reusable.

Scott Ganyo, Jakarta Lucene Committer

...code samples as JUnit test cases are incredibly helpful.

Norman Richards, co-author XDoclet in Action