Happy 20 + 17: Get 37% off select new releases! Browse now >>

Apache Solr

A Practical Approach to Enterprise Search

Authors: Shahi, Dikshant

  • 1. Focuses on search concepts and challenges and explains different ways to achieve it in Solr, making it timeless and not specific to any particular version.
  • 2. Offers information retrieval concepts that lie at the core of any modern search engine. This is a must-know concept for effective search engine development, something that is generally ignored by other books.
  • 3. Explains customizable components of Solr along with practical examples from industry. This will help the reader extend Solr for their unique search requirements.
  • 4. Provides detailed explanations of integrating Solr with other products that are important and often used together, Tika, Zookeeper, and OpenNLP—again something other books don’t cover.
  • 5. Foundation of Apache Solr will be the only book covering latest version 5.0 of Apache Solr.
see more benefits

Buy this book

eBook $34.99
price for USA
  • ISBN 978-1-4842-1070-3
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $44.99
price for USA
  • ISBN 978-1-4842-1071-0
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
About this book

Build an enterprise search engine using Apache Solr: index and search documents; ingest data from varied sources; apply various text processing techniques; utilize different search capabilities; and customize Solr to retrieve the desired results. Apache Solr: A Practical Approach to Enterprise Search explains each essential concept-backed by practical and industry examples--to help you attain expert-level knowledge.

The book, which assumes a basic knowledge of Java, starts with an introduction to Solr, followed by steps to setting it up, indexing your first set of documents, and searching them. It then introduces you to information retrieval and its implementation in Apache Solr; this will help you understand your search problem, decide the approach to build an effective solution, and use various metrics to evaluate the results.

The book next covers the schema design and techniques to build a text analysis chain for cleansing, normalizing and enriching your documents and addressing different types of search queries. It describes various popular matching techniques which are generally applied to improve the precision and recall of searches.

You will learn the end-to-end process of data ingestion from varied sources, metadata extraction, pre-processing and transformation of content, various search components, query parsers and other advanced search capabilities.

After covering out-of-the-box features, Solr expert Dikshant Shahi dives into ways you can customize Solr for your business and its specific requirements, along with ways to plug in your own components. Most important, you will learn about implementations for Solr scoring, factors affecting the document score, and tuning the score for the application at hand. The book explains why textual scoring is not sufficient for practical ranking of documents and ways to integrate real-world factors for contributing to the document ranking.

You'll see how to influence user experience by providing suggestions and recommendations. You'll also see integration of Solr with important related technologies such as OpenNLP and Tika. Additionally, you will learn about scaling Solr using SolrCloud.

This book concludes with coverage of semantic search capabilities, which is crucial for taking the search experience to the next level. By the end of Apache Solr, you will be proficient in designing and developing your search engine. 

About the authors

Dikshant Shahi manages the search and platforms team at OnMobile Global Limited. He has been responsible for developing several vertical search engines for categories including music metadata, voice, audio fingerprinting, channel intelligence, log file processing, building analytics, finding deals like Groupon etc. He has also been responsible for handling multi-lingual contents, natural language processing and recommendation. Shahi specializes in Search Engine, Information Retrieval, Data Extraction and Analysis, Application Development, Web Services, and Mobile Applications.    

Table of contents (11 chapters)

Buy this book

eBook $34.99
price for USA
  • ISBN 978-1-4842-1070-3
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $44.99
price for USA
  • ISBN 978-1-4842-1071-0
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.

Services for this book

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Apache Solr
Book Subtitle
A Practical Approach to Enterprise Search
Authors
Copyright
2015
Publisher
Apress
Copyright Holder
Dikshant Shahi
eBook ISBN
978-1-4842-1070-3
DOI
10.1007/978-1-4842-1070-3
Softcover ISBN
978-1-4842-1071-0
Edition Number
1
Number of Pages
XXVI, 299
Number of Illustrations and Tables
56 b/w illustrations
Topics