About the Book
Lucene is a rare gem in the Apache Jakarta world?a highly scalable, fast, and pure Java search engine. Its initial attractions are its performance, its simplicity, and its disarming ease-of-use. But there are many caveats, best practices, and examples that users are looking for that are not yet well documented. The lucene-user e-mail list is very active and helpful, but many users seek more guidance and examples. Lucene in Action describes what Lucene is and how it works and most importantly how it can be used in a variety of real-world use cases, such at Nutch. Nutch?an open-source project designed to index the internet very much like Google?is built upon Lucene.
Lucene in Action provides readers with best practices, tried and true from the field, including:
- Understanding and solving? analysis paralysis?
- Advanced searching techniques, including filtering and custom query parsing techniques.
- Handling document types such as Word, PDF, HTML, XML, and others.
About the Author
Erik Hatcher Co-author of Manning's award-winning 'Java Devlopment with Ant' Frequent speaker at industry events including JavaOne (upcoming; 2004), OSCON, JUG meetings, and the No Fluff, Just Stuff symposiums; co-author of many articles published at java.net, JavaPro, and developerWorks; Apache Software Foundation member and committer on several projects including Ant, Lucene, and Tapestry.
Otis Gospodnetiæ Active Apache Jakarta committer, a member of Apache Jakarta Project Management Committee, a developer of Lucene and maintainer of the jGuru's Lucene FAQ, and author of several articles published on O'Reilly Network, IBM developerWorks, and ?interChange?, the International SGML/XML User Group publication.