No matter how exciting a search engine might be, it's worthless unless it has data to index. ManifoldCF is an open source framework for pulling content out of a repository and sending it on to targets such as Solr via a plug-in style, connector-based architecture. ManifoldCF includes connectors for numerous commercial and open source data sources, including Documentum, SharePoint, JDBC, and RSS.
ManifoldCF in Action is a comprehensive tutorial and reference that shows you how to integrate search with enterprise-level document repositories using ManifoldCF. The book begins with an architectural overview of ManifoldCF and how it fits into your application infrastructure. After covering the basics, it dives into examples showing typical integration tasks, such as setting up connections, using ManifoldCF as an engine under the control of another enterprise system, and integrating ManifoldCF's user-based security model with a search engine.
Although ManifoldCF provides connectors for a large number of repositories and search technologies, including Solr, FileNet, Windows shares, JDBC, Documentum, Meridio, and SharePoint, there are many for which no ManifoldCF connector yet exists. As you explore the ManifoldCF architecture, you'll learn how ManifoldCF interacts with individual connectors so that you can design your own custom connectors.
This book requires a working knowledge of Java, but no prior experience with search-based applications or ManifoldCF is needed.
Karl Wright has been developing ManifoldCF since 2006, from its roots at MetaCarta well before it became an Apache project. He has extensive experience in speech recognition and compiler development, and he is the author of Borland's Turbo Assembler. Karl holds Computer Science degrees from M.I.T. and Stanford.
geekle is based on a wordle clone.