Apache Solr for indexing data : enhance your Solr indexing experience with advanced techniques and the built-in functionalities available in Apache Solr / / Sachin Handiekar, Anshul Johri
| Apache Solr for indexing data : enhance your Solr indexing experience with advanced techniques and the built-in functionalities available in Apache Solr / / Sachin Handiekar, Anshul Johri |
| Autore | Handiekar Sachin |
| Edizione | [1st edition] |
| Pubbl/distr/stampa | Birmingham : , : Packt Publishing, , 2015 |
| Descrizione fisica | 1 online resource (160 p.) |
| Collana | Community experience distilled |
| Soggetto topico |
Electronic information resource searching
Indexing - Computer programs |
| ISBN | 1-78355-324-3 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto |
Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started; Overview and installation of Solr; Installing Solr in OS X (Mac); Running Solr; Installing Solr in Windows; Installing Solr on Linux; The Solr architecture and directory structure; Solr directory structure; Cores in Solr (Multicore Solr); Summary; Chapter 2: Understanding Analyzers, Tokenizers, and Filters; Introducing analyzers; Analysis phases; Tokenizers; Standard tokenizer; Keyword tokenizer; Lowercase tokenizer; N-gram tokenizer; Filters
Lowercase filterSynonym filter; Porter stem filter; Running your analyzer; Summary; Chapter 3: Indexing Data; Indexing data in Solr; Introducing field types; Defining fields; Defining an unique key; Copy fields and dynamic fields; Building our musicCatalogue example; Using the Solr Admin UI; Facet searching; Summary; Chapter 4: Index Data - Basic Technique and Using Index Handlers; Inserting data into Solr; Configuring UpdateRequestHandler; Indexing documents using XML; Adding and updating documents; Deleting a document; Indexing documents using JSON; Adding a single document Adding multiple JSON documentsSequential JSON update commands; Indexing updates using CSV; Summary; Chapter 5: Index Data Using Structured Data Source Using DIH; Indexing data from MySQL; Configuring datasource; DIH commands; Indexing data using XPath; Summary; Chapter 6: Indexing Data Using Apache Tika; Introducing Apache Tika; Configuring Apache Tika in Solr; Indexing PDF and Word documents; Summary; Chapter 7: Apache Nutch; Introducing Apache Nutch; Installing Apache Nutch; Configuring Solr with Nutch; Summary; Chapter 8: Commits, Real-Time Index Optimizations, and Atomic Updates Understanding soft commit, optimize, and hard commitUsing atomic updates in Solr; Using RealTime Get; Summary; Chapter 9: Advanced Topics - Multilanguage, Deduplication, and Others; Multilanguage indexing; Removing duplicate documents (deduplication); Content streaming; UIMA integration with Solr; Summary; Chapter 10: Distributed Indexing; Setting up SolrCloud; The collections API; Updating configuration files; Distributed indexing and searching; Summary; Chapter 11: Case Study of Using Solr in E-Commerce; Creating an AutoSuggest feature; Facet navigation; Search filtering and sorting Relevancy boostingSummary; Index |
| Record Nr. | UNINA-9910798064403321 |
Handiekar Sachin
|
||
| Birmingham : , : Packt Publishing, , 2015 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Apache Solr for indexing data : enhance your Solr indexing experience with advanced techniques and the built-in functionalities available in Apache Solr / / Sachin Handiekar, Anshul Johri
| Apache Solr for indexing data : enhance your Solr indexing experience with advanced techniques and the built-in functionalities available in Apache Solr / / Sachin Handiekar, Anshul Johri |
| Autore | Handiekar Sachin |
| Edizione | [1st edition] |
| Pubbl/distr/stampa | Birmingham : , : Packt Publishing, , 2015 |
| Descrizione fisica | 1 online resource (160 p.) |
| Collana | Community experience distilled |
| Soggetto topico |
Electronic information resource searching
Indexing - Computer programs |
| ISBN | 1-78355-324-3 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto |
Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started; Overview and installation of Solr; Installing Solr in OS X (Mac); Running Solr; Installing Solr in Windows; Installing Solr on Linux; The Solr architecture and directory structure; Solr directory structure; Cores in Solr (Multicore Solr); Summary; Chapter 2: Understanding Analyzers, Tokenizers, and Filters; Introducing analyzers; Analysis phases; Tokenizers; Standard tokenizer; Keyword tokenizer; Lowercase tokenizer; N-gram tokenizer; Filters
Lowercase filterSynonym filter; Porter stem filter; Running your analyzer; Summary; Chapter 3: Indexing Data; Indexing data in Solr; Introducing field types; Defining fields; Defining an unique key; Copy fields and dynamic fields; Building our musicCatalogue example; Using the Solr Admin UI; Facet searching; Summary; Chapter 4: Index Data - Basic Technique and Using Index Handlers; Inserting data into Solr; Configuring UpdateRequestHandler; Indexing documents using XML; Adding and updating documents; Deleting a document; Indexing documents using JSON; Adding a single document Adding multiple JSON documentsSequential JSON update commands; Indexing updates using CSV; Summary; Chapter 5: Index Data Using Structured Data Source Using DIH; Indexing data from MySQL; Configuring datasource; DIH commands; Indexing data using XPath; Summary; Chapter 6: Indexing Data Using Apache Tika; Introducing Apache Tika; Configuring Apache Tika in Solr; Indexing PDF and Word documents; Summary; Chapter 7: Apache Nutch; Introducing Apache Nutch; Installing Apache Nutch; Configuring Solr with Nutch; Summary; Chapter 8: Commits, Real-Time Index Optimizations, and Atomic Updates Understanding soft commit, optimize, and hard commitUsing atomic updates in Solr; Using RealTime Get; Summary; Chapter 9: Advanced Topics - Multilanguage, Deduplication, and Others; Multilanguage indexing; Removing duplicate documents (deduplication); Content streaming; UIMA integration with Solr; Summary; Chapter 10: Distributed Indexing; Setting up SolrCloud; The collections API; Updating configuration files; Distributed indexing and searching; Summary; Chapter 11: Case Study of Using Solr in E-Commerce; Creating an AutoSuggest feature; Facet navigation; Search filtering and sorting Relevancy boostingSummary; Index |
| Record Nr. | UNINA-9910815187903321 |
Handiekar Sachin
|
||
| Birmingham : , : Packt Publishing, , 2015 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||