Apache Solr 4 cookbook [[electronic resource] ] : over 100 recipes to make Apache Solr faster, more reliable, and return better results / / Rafał Kuć |
Autore | Kuć Rafał |
Edizione | [2nd ed.] |
Pubbl/distr/stampa | Birmingham, : Packt Pub., 2013 |
Descrizione fisica | 1 online resource (328 p.) |
Disciplina | 025.04 |
Collana | Community experience distilled |
Soggetto topico |
Search engines - Programming
Open source software Web search engines |
Soggetto genere / forma | Electronic books. |
ISBN |
1-62198-883-X
1-78216-133-3 1-299-19858-9 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; Acknowledgement; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Apache Solr Configuration; Introduction; Running Solr on Jetty; Running Solr on Apache Tomcat; Installing a standalone ZooKeeper; Clustering your data; Choosing the right directory implementation; Configuring spellchecker to not use its own index; Solr cache configuration; How to fetch and index web pages; How to set up the extracting request handler; Changing the default similarity implementation; Chapter 2: Indexing Your Data; Introduction
Indexing PDF filesGenerating unique fields automatically; Extracting metadata from binary files; How to properly configure Data Import Handler with JDBC; Indexing data from a database using Data Import Handler; How to import data using Data Import Handler and delta query; How to use Data Import Handler with the URL data source; How to modify data while importing with Data Import Handler; Updating a single field of your document; Handling multiple currencies; Detecting the document's language; Optimizing your primary key field indexing; Chapter 3: Analyzing Your Text Data; Introduction Storing additional information using payloadsEliminating XML and HTML tags from text; Copying the contents of one field to another; Changing words to other words; Splitting text by CamelCase; Splitting text by whitespace only; Making plural words singular without stemming; Lowercasing the whole string; Storing geographical points in the index; Stemming your data; Preparing text to perform an efficient trailing wildcard search; Splitting text by numbers and non-whitespace characters; Using Hunspell as a stemmer; Using your own stemming dictionary; Protecting words from being stemmed Chapter 4: Querying SolrIntroduction; Asking for a particular field value; Sorting results by a field value; How to search for a phrase, not a single word; Boosting phrases over words; Positioning some documents over others on a query; Positioning documents with words closer to each other first; Sorting results by a distance from a point; Getting documents with only a partial match; Affecting scoring with functions; Nesting queries; Modifying returned documents; Using parent-child relationships; Ignoring typos in terms of performance; Detecting and omitting duplicate documents Using field aliasesReturning a value of a function in the results; Chapter 5: Using the Faceting Mechanism; Introduction; Getting the number of documents with the same field value; Getting the number of documents with the same value range; Getting the number of documents matching the query and subquery; Removing filters from faceting results; Sorting faceting results in alphabetical order; Implementing the autosuggest feature using faceting; Getting the number of documents that don't have a value in the field; Having two different facet limits for two different fields in the same query Using decision tree faceting |
Record Nr. | UNINA-9910452727703321 |
Kuć Rafał
![]() |
||
Birmingham, : Packt Pub., 2013 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Elasticsearch server : leverage Elasticsearch to create a robust, fast, and flexible search solution with ease / / Rafał Kuć, Marek Rogoziński |
Autore | Kuć Rafał |
Edizione | [3rd ed.] |
Pubbl/distr/stampa | Birmingham : , : Packt Publishing, , 2016 |
Descrizione fisica | 1 online resource (556 p.) |
Collana | Community experience distilled |
Soggetto topico |
Application software
Application software - Development Client/server computing Web site - Design Electronic information resource searching |
ISBN | 1-78588-362-3 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Authors; About the Reviewer; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with Elasticsearch Cluster; Chapter 2: Indexing Your Data; Chapter 3: Searching Your Data; Chapter 4: Extending Your Querying Knowledge; Chapter 5: Extending Your Index Structure; Chapter 6: Make Your Search Better; Chapter 7: Aggregations for Data Analysis; Chapter 8: Beyond Full-text Searching; Chapter 9: Elasticsearch Cluster in Detail; Chapter 10: Administrating Your Cluster; Chapter 11: Scaling by Example; Index; Full text searching
The basics of ElasticsearchInstalling and configuring your cluster; Manipulating data with the REST API; Searching with the URI request query; Summary; Elasticsearch indexing; Mappings configuration; Batch indexing to speed up your indexing process; Introduction to segment merging; Introduction to routing; Summary; Querying Elasticsearch; Understanding the querying process; Basic queries; Compound queries; Using span queries; Choosing the right query; Summary; Filtering your results; Highlighting; Validating your queries; Sorting data; Query rewrite; Summary; Indexing tree-like structures Indexing data that is not flatUsing nested objects; Using the parent-child relationship; Modifying your index structure with the update API; Summary; Introduction to Apache Lucene scoring; Scripting capabilities of Elasticsearch; Searching content in different languages; Influencing scores with query boosts; When does index-time boosting make sense?; Words with the same meaning; Understanding the explain information; Summary; Aggregations; Aggregation types; Pipeline aggregations; Summary; Percolator; Elasticsearch spatial capabilities; Using suggesters; The Scroll API; Summary Understanding node discoveryThe gateway and recovery modules; Templates and dynamic templates; Elasticsearch plugins; Elasticsearch caches; The update settings API; Summary; Elasticsearch time machine; Monitoring your cluster's state and health; Controlling the shard and replica allocation; Controlling cluster rebalancing; The Cat API; Warming up; Index aliasing and using it to simplify your everyday work; Summary; Hardware; Preparing a single Elasticsearch node; Horizontal expansion; Preparing the cluster for high indexing and querying throughput; Monitoring; Summary The Lucene glossary and architectureInput data analysis; Indexing and querying; Scoring and query relevance; Key concepts of Elasticsearch; Key concepts of the Elasticsearch infrastructure; Indexing and searching; Installing Java; Installing Elasticsearch; Running Elasticsearch; Shutting down Elasticsearch; The directory layout; Configuring Elasticsearch; The system-specific installation and configuration; Understanding the REST API; Storing data in Elasticsearch; Retrieving documents; Updating documents; Deleting documents; Versioning; Sample data; URI search; Query analysis URI query string parameters |
Record Nr. | UNINA-9910798273203321 |
Kuć Rafał
![]() |
||
Birmingham : , : Packt Publishing, , 2016 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Elasticsearch server : leverage Elasticsearch to create a robust, fast, and flexible search solution with ease / / Rafał Kuć, Marek Rogoziński |
Autore | Kuć Rafał |
Edizione | [3rd ed.] |
Pubbl/distr/stampa | Birmingham : , : Packt Publishing, , 2016 |
Descrizione fisica | 1 online resource (556 p.) |
Collana | Community experience distilled |
Soggetto topico |
Application software
Application software - Development Client/server computing Web site - Design Electronic information resource searching |
ISBN | 1-78588-362-3 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Authors; About the Reviewer; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with Elasticsearch Cluster; Chapter 2: Indexing Your Data; Chapter 3: Searching Your Data; Chapter 4: Extending Your Querying Knowledge; Chapter 5: Extending Your Index Structure; Chapter 6: Make Your Search Better; Chapter 7: Aggregations for Data Analysis; Chapter 8: Beyond Full-text Searching; Chapter 9: Elasticsearch Cluster in Detail; Chapter 10: Administrating Your Cluster; Chapter 11: Scaling by Example; Index; Full text searching
The basics of ElasticsearchInstalling and configuring your cluster; Manipulating data with the REST API; Searching with the URI request query; Summary; Elasticsearch indexing; Mappings configuration; Batch indexing to speed up your indexing process; Introduction to segment merging; Introduction to routing; Summary; Querying Elasticsearch; Understanding the querying process; Basic queries; Compound queries; Using span queries; Choosing the right query; Summary; Filtering your results; Highlighting; Validating your queries; Sorting data; Query rewrite; Summary; Indexing tree-like structures Indexing data that is not flatUsing nested objects; Using the parent-child relationship; Modifying your index structure with the update API; Summary; Introduction to Apache Lucene scoring; Scripting capabilities of Elasticsearch; Searching content in different languages; Influencing scores with query boosts; When does index-time boosting make sense?; Words with the same meaning; Understanding the explain information; Summary; Aggregations; Aggregation types; Pipeline aggregations; Summary; Percolator; Elasticsearch spatial capabilities; Using suggesters; The Scroll API; Summary Understanding node discoveryThe gateway and recovery modules; Templates and dynamic templates; Elasticsearch plugins; Elasticsearch caches; The update settings API; Summary; Elasticsearch time machine; Monitoring your cluster's state and health; Controlling the shard and replica allocation; Controlling cluster rebalancing; The Cat API; Warming up; Index aliasing and using it to simplify your everyday work; Summary; Hardware; Preparing a single Elasticsearch node; Horizontal expansion; Preparing the cluster for high indexing and querying throughput; Monitoring; Summary The Lucene glossary and architectureInput data analysis; Indexing and querying; Scoring and query relevance; Key concepts of Elasticsearch; Key concepts of the Elasticsearch infrastructure; Indexing and searching; Installing Java; Installing Elasticsearch; Running Elasticsearch; Shutting down Elasticsearch; The directory layout; Configuring Elasticsearch; The system-specific installation and configuration; Understanding the REST API; Storing data in Elasticsearch; Retrieving documents; Updating documents; Deleting documents; Versioning; Sample data; URI search; Query analysis URI query string parameters |
Record Nr. | UNINA-9910821726103321 |
Kuć Rafał
![]() |
||
Birmingham : , : Packt Publishing, , 2016 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Elasticsearch server [[electronic resource] ] : create a fast, scalable, and flexible search solution with the emerging open source search server, ElasticSearch / / Rafał Kuć, Marek Rogoziński |
Autore | Kuć Rafał |
Edizione | [1st edition] |
Pubbl/distr/stampa | Birmingham, : Packt Pub., 2013 |
Descrizione fisica | 1 online resource (318 p.) |
Disciplina | 006.7 |
Altri autori (Persone) | RogozińskiMarek |
Collana | Community experience distilled |
Soggetto topico | Application software |
Soggetto genere / forma | Electronic books. |
ISBN |
1-62198-908-9
1-299-26139-6 1-84951-845-9 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Authors; Acknowledgement; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with ElasticSearch Cluster; What is ElasticSearch?; Index; Document; Document type; Node and cluster; Shard; Replica; Installing and configuring your cluster; Directory structure; Configuring ElasticSearch; Running ElasticSearch; Shutting down ElasticSearch; Running ElasticSearch as a system service; Data manipulation with REST API; What is REST?; Storing data in ElasticSearch; Creating a new document; Retrieving documents
Updating documentsDeleting documents; Manual index creation and mappings configuration; Index; Types; Index manipulation; Schema mapping; Type definition; Fields; Core types; Multi fields; Using analyzers; Storing a document source; All field; Dynamic mappings and templates; Type determining mechanism; Dynamic mappings; Templates; Storing templates in files; When routing does matter; How does indexing work?; How does searching work?; Routing; Routing parameters; Routing fields; Index aliasing and simplifying your everyday work using it; An alias; Creating an alias; Modifying aliases Combining commandsRetrieving all aliases; Filtering aliases; Aliases and routing; Summary; Chapter 2: Searching Your Data; Understanding the querying and indexing process; Mappings; Data; Querying ElasticSearch; Simple query; Paging and results size; Returning the version; Limiting the score; Choosing fields we want to return; Partial fields; Using script fields; Passing parameters to script fields; Choosing the right search type (advanced); Search execution preference (advanced); Basic queries; The term query; The terms query; The match query; The Boolean match query; The phrase match query The match phrase prefix queryThe multi match query; The query string query; Lucene query syntax; Explaining the query string; Running query string query against multiple fields; The field query; The identifiers query; The prefix query; The fuzzy like this query; The fuzzy like this field query; The fuzzy query; The match all query; The wildcard query; The more like this query; The more like this field query; The range query; Query rewrite; Filtering your results; Using filters; Range filters; Exists; Missing; Script; Type; Limit; IDs; If this is not enough; bool, and, or, not filters Named filtersCaching filters; Compound queries; The bool query; The boosting query; The constant score query; The indices query; The custom filters score query; The custom boost factor query; The custom score query; Sorting data; Default sorting; Selecting fields used for sorting; Specifying behavior for missing fields; Dynamic criteria; Collation and national characters; Using scripts; Available objects; MVEL; Other languages; Script library; Native code; Summary; Chapter 3: Extending Your Structure and Search; Indexing data that is not flat; Data; Objects; Arrays; Mappings; Final mappings To be or not to be dynamic |
Record Nr. | UNINA-9910465386203321 |
Kuć Rafał
![]() |
||
Birmingham, : Packt Pub., 2013 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Elasticsearch server [[electronic resource] ] : create a fast, scalable, and flexible search solution with the emerging open source search server, ElasticSearch / / Rafał Kuć, Marek Rogoziński |
Autore | Kuć Rafał |
Edizione | [1st edition] |
Pubbl/distr/stampa | Birmingham, : Packt Pub., 2013 |
Descrizione fisica | 1 online resource (318 p.) |
Disciplina | 006.7 |
Altri autori (Persone) | RogozińskiMarek |
Collana | Community experience distilled |
Soggetto topico | Application software |
ISBN |
1-62198-908-9
1-299-26139-6 1-84951-845-9 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Authors; Acknowledgement; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with ElasticSearch Cluster; What is ElasticSearch?; Index; Document; Document type; Node and cluster; Shard; Replica; Installing and configuring your cluster; Directory structure; Configuring ElasticSearch; Running ElasticSearch; Shutting down ElasticSearch; Running ElasticSearch as a system service; Data manipulation with REST API; What is REST?; Storing data in ElasticSearch; Creating a new document; Retrieving documents
Updating documentsDeleting documents; Manual index creation and mappings configuration; Index; Types; Index manipulation; Schema mapping; Type definition; Fields; Core types; Multi fields; Using analyzers; Storing a document source; All field; Dynamic mappings and templates; Type determining mechanism; Dynamic mappings; Templates; Storing templates in files; When routing does matter; How does indexing work?; How does searching work?; Routing; Routing parameters; Routing fields; Index aliasing and simplifying your everyday work using it; An alias; Creating an alias; Modifying aliases Combining commandsRetrieving all aliases; Filtering aliases; Aliases and routing; Summary; Chapter 2: Searching Your Data; Understanding the querying and indexing process; Mappings; Data; Querying ElasticSearch; Simple query; Paging and results size; Returning the version; Limiting the score; Choosing fields we want to return; Partial fields; Using script fields; Passing parameters to script fields; Choosing the right search type (advanced); Search execution preference (advanced); Basic queries; The term query; The terms query; The match query; The Boolean match query; The phrase match query The match phrase prefix queryThe multi match query; The query string query; Lucene query syntax; Explaining the query string; Running query string query against multiple fields; The field query; The identifiers query; The prefix query; The fuzzy like this query; The fuzzy like this field query; The fuzzy query; The match all query; The wildcard query; The more like this query; The more like this field query; The range query; Query rewrite; Filtering your results; Using filters; Range filters; Exists; Missing; Script; Type; Limit; IDs; If this is not enough; bool, and, or, not filters Named filtersCaching filters; Compound queries; The bool query; The boosting query; The constant score query; The indices query; The custom filters score query; The custom boost factor query; The custom score query; Sorting data; Default sorting; Selecting fields used for sorting; Specifying behavior for missing fields; Dynamic criteria; Collation and national characters; Using scripts; Available objects; MVEL; Other languages; Script library; Native code; Summary; Chapter 3: Extending Your Structure and Search; Indexing data that is not flat; Data; Objects; Arrays; Mappings; Final mappings To be or not to be dynamic |
Record Nr. | UNINA-9910792068203321 |
Kuć Rafał
![]() |
||
Birmingham, : Packt Pub., 2013 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Elasticsearch server : create a fast, scalable, and flexible search solution with the emerging open source search server, ElasticSearch / / Rafał Kuć, Marek Rogoziński |
Autore | Kuć Rafał |
Edizione | [1st edition] |
Pubbl/distr/stampa | Birmingham, : Packt Pub., 2013 |
Descrizione fisica | 1 online resource (318 p.) |
Disciplina | 006.7 |
Altri autori (Persone) | RogozińskiMarek |
Collana | Community experience distilled |
Soggetto topico | Application software |
ISBN |
1-62198-908-9
1-299-26139-6 1-84951-845-9 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Authors; Acknowledgement; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with ElasticSearch Cluster; What is ElasticSearch?; Index; Document; Document type; Node and cluster; Shard; Replica; Installing and configuring your cluster; Directory structure; Configuring ElasticSearch; Running ElasticSearch; Shutting down ElasticSearch; Running ElasticSearch as a system service; Data manipulation with REST API; What is REST?; Storing data in ElasticSearch; Creating a new document; Retrieving documents
Updating documentsDeleting documents; Manual index creation and mappings configuration; Index; Types; Index manipulation; Schema mapping; Type definition; Fields; Core types; Multi fields; Using analyzers; Storing a document source; All field; Dynamic mappings and templates; Type determining mechanism; Dynamic mappings; Templates; Storing templates in files; When routing does matter; How does indexing work?; How does searching work?; Routing; Routing parameters; Routing fields; Index aliasing and simplifying your everyday work using it; An alias; Creating an alias; Modifying aliases Combining commandsRetrieving all aliases; Filtering aliases; Aliases and routing; Summary; Chapter 2: Searching Your Data; Understanding the querying and indexing process; Mappings; Data; Querying ElasticSearch; Simple query; Paging and results size; Returning the version; Limiting the score; Choosing fields we want to return; Partial fields; Using script fields; Passing parameters to script fields; Choosing the right search type (advanced); Search execution preference (advanced); Basic queries; The term query; The terms query; The match query; The Boolean match query; The phrase match query The match phrase prefix queryThe multi match query; The query string query; Lucene query syntax; Explaining the query string; Running query string query against multiple fields; The field query; The identifiers query; The prefix query; The fuzzy like this query; The fuzzy like this field query; The fuzzy query; The match all query; The wildcard query; The more like this query; The more like this field query; The range query; Query rewrite; Filtering your results; Using filters; Range filters; Exists; Missing; Script; Type; Limit; IDs; If this is not enough; bool, and, or, not filters Named filtersCaching filters; Compound queries; The bool query; The boosting query; The constant score query; The indices query; The custom filters score query; The custom boost factor query; The custom score query; Sorting data; Default sorting; Selecting fields used for sorting; Specifying behavior for missing fields; Dynamic criteria; Collation and national characters; Using scripts; Available objects; MVEL; Other languages; Script library; Native code; Summary; Chapter 3: Extending Your Structure and Search; Indexing data that is not flat; Data; Objects; Arrays; Mappings; Final mappings To be or not to be dynamic |
Record Nr. | UNINA-9910815540103321 |
Kuć Rafał
![]() |
||
Birmingham, : Packt Pub., 2013 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Solr cookbook : solve real-time problems related to Apache Solr 4.x and 5.0 effectively with the help of over 100 easy-to-follow recipes / / Rafał Kuć |
Autore | Kuć Rafał |
Edizione | [Third edition.] |
Pubbl/distr/stampa | Birmingham, England : , : Packt Publishing, , 2015 |
Descrizione fisica | 1 online resource (356 p.) |
Disciplina | 005.758 |
Soggetto topico |
Search engines - Programming
Web search engines |
Soggetto genere / forma | Electronic books. |
ISBN | 1-78355-316-2 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Apache Solr Configuration; Introduction; Running Solr on a standalone Jetty; Installing ZooKeeper for SolrCloud; Migrating configuration from master-slave to SolrCloud; Choosing the proper directory configuration; Configuring the Solr spellchecker; Using Solr in a schemaless mode; Limiting I/O usage; Using core discovery; Configuring SolrCloud for NRT use cases; Configuring SolrCloud for high-indexing use cases
Configuring SolrCloud for high-querying use casesConfiguring the Solr heartbeat mechanism; Changing similarity; Chapter 2: Indexing Your Data; Introduction; Indexing PDF files; Counting the number of fields; Using parsing update processors to parse data; Using scripting update processors to modify documents; Indexing data from a database using Data Import Handler; Incremental imports with DIH; Transforming data when using DIH; Indexing multiple geographical points; Updating document fields; Detecting the document language during indexation; Optimizing the primary key indexation Handling multiple currenciesChapter 3: Analyzing Your Text Data; Introduction; Using the enumeration type; Removing HTML tags during indexing; Storing data outside of Solr index; Using synonyms; Stemming different languages; Using nonaggressive stemmers; Using the n-gram approach to do performant trailing wildcard searches; Using position increment to divide sentences; Using patterns to replace tokens; Chapter 4: Querying Solr; Introduction; Understanding and using the Lucene query language; Using position aware queries; Using boosting with autocomplete; Phrase queries with shingles Handling user queries without errorsHandling hierarchies with nested documents; Sorting data on the basis of function value; Controlling the number of terms needed to match; Affecting document score using function queries; Using simple nested queries; Using the Solr document query join functionality; Handling typos with n-grams; Rescoring query results; Chapter 5: Faceting; Introduction; Getting the number of documents with the same field value; Getting the number of documents with the same value range; Getting the number of documents matching the query and subquery Removing filters from faceting resultsUsing decision tree faceting; Calculating faceting for relevant documents in groups; Improving faceting performance for low cardinality fields; Chapter 6: Improving Solr Performance; Introduction; Handling deep paging efficiently; Configuring the document cache; Configuring the query result cache; Configuring the filter cache; Improving Solr query performance after the start and commit operations; Lowering the memory consumption of faceting and sorting; Speeding up indexing with Solr segment merge tuning Avoiding caching of rare filters to improve the performance |
Record Nr. | UNINA-9910464134903321 |
Kuć Rafał
![]() |
||
Birmingham, England : , : Packt Publishing, , 2015 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Solr cookbook : solve real-time problems related to Apache Solr 4.x and 5.0 effectively with the help of over 100 easy-to-follow recipes / / Rafał Kuć |
Autore | Kuć Rafał |
Edizione | [Third edition.] |
Pubbl/distr/stampa | Birmingham, England : , : Packt Publishing, , 2015 |
Descrizione fisica | 1 online resource (356 p.) |
Disciplina | 005.758 |
Soggetto topico |
Search engines - Programming
Web search engines |
ISBN | 1-78355-316-2 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Apache Solr Configuration; Introduction; Running Solr on a standalone Jetty; Installing ZooKeeper for SolrCloud; Migrating configuration from master-slave to SolrCloud; Choosing the proper directory configuration; Configuring the Solr spellchecker; Using Solr in a schemaless mode; Limiting I/O usage; Using core discovery; Configuring SolrCloud for NRT use cases; Configuring SolrCloud for high-indexing use cases
Configuring SolrCloud for high-querying use casesConfiguring the Solr heartbeat mechanism; Changing similarity; Chapter 2: Indexing Your Data; Introduction; Indexing PDF files; Counting the number of fields; Using parsing update processors to parse data; Using scripting update processors to modify documents; Indexing data from a database using Data Import Handler; Incremental imports with DIH; Transforming data when using DIH; Indexing multiple geographical points; Updating document fields; Detecting the document language during indexation; Optimizing the primary key indexation Handling multiple currenciesChapter 3: Analyzing Your Text Data; Introduction; Using the enumeration type; Removing HTML tags during indexing; Storing data outside of Solr index; Using synonyms; Stemming different languages; Using nonaggressive stemmers; Using the n-gram approach to do performant trailing wildcard searches; Using position increment to divide sentences; Using patterns to replace tokens; Chapter 4: Querying Solr; Introduction; Understanding and using the Lucene query language; Using position aware queries; Using boosting with autocomplete; Phrase queries with shingles Handling user queries without errorsHandling hierarchies with nested documents; Sorting data on the basis of function value; Controlling the number of terms needed to match; Affecting document score using function queries; Using simple nested queries; Using the Solr document query join functionality; Handling typos with n-grams; Rescoring query results; Chapter 5: Faceting; Introduction; Getting the number of documents with the same field value; Getting the number of documents with the same value range; Getting the number of documents matching the query and subquery Removing filters from faceting resultsUsing decision tree faceting; Calculating faceting for relevant documents in groups; Improving faceting performance for low cardinality fields; Chapter 6: Improving Solr Performance; Introduction; Handling deep paging efficiently; Configuring the document cache; Configuring the query result cache; Configuring the filter cache; Improving Solr query performance after the start and commit operations; Lowering the memory consumption of faceting and sorting; Speeding up indexing with Solr segment merge tuning Avoiding caching of rare filters to improve the performance |
Record Nr. | UNINA-9910788047903321 |
Kuć Rafał
![]() |
||
Birmingham, England : , : Packt Publishing, , 2015 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Solr cookbook : solve real-time problems related to Apache Solr 4.x and 5.0 effectively with the help of over 100 easy-to-follow recipes / / Rafał Kuć |
Autore | Kuć Rafał |
Edizione | [Third edition.] |
Pubbl/distr/stampa | Birmingham, England : , : Packt Publishing, , 2015 |
Descrizione fisica | 1 online resource (356 p.) |
Disciplina | 005.758 |
Soggetto topico |
Search engines - Programming
Web search engines |
ISBN | 1-78355-316-2 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Apache Solr Configuration; Introduction; Running Solr on a standalone Jetty; Installing ZooKeeper for SolrCloud; Migrating configuration from master-slave to SolrCloud; Choosing the proper directory configuration; Configuring the Solr spellchecker; Using Solr in a schemaless mode; Limiting I/O usage; Using core discovery; Configuring SolrCloud for NRT use cases; Configuring SolrCloud for high-indexing use cases
Configuring SolrCloud for high-querying use casesConfiguring the Solr heartbeat mechanism; Changing similarity; Chapter 2: Indexing Your Data; Introduction; Indexing PDF files; Counting the number of fields; Using parsing update processors to parse data; Using scripting update processors to modify documents; Indexing data from a database using Data Import Handler; Incremental imports with DIH; Transforming data when using DIH; Indexing multiple geographical points; Updating document fields; Detecting the document language during indexation; Optimizing the primary key indexation Handling multiple currenciesChapter 3: Analyzing Your Text Data; Introduction; Using the enumeration type; Removing HTML tags during indexing; Storing data outside of Solr index; Using synonyms; Stemming different languages; Using nonaggressive stemmers; Using the n-gram approach to do performant trailing wildcard searches; Using position increment to divide sentences; Using patterns to replace tokens; Chapter 4: Querying Solr; Introduction; Understanding and using the Lucene query language; Using position aware queries; Using boosting with autocomplete; Phrase queries with shingles Handling user queries without errorsHandling hierarchies with nested documents; Sorting data on the basis of function value; Controlling the number of terms needed to match; Affecting document score using function queries; Using simple nested queries; Using the Solr document query join functionality; Handling typos with n-grams; Rescoring query results; Chapter 5: Faceting; Introduction; Getting the number of documents with the same field value; Getting the number of documents with the same value range; Getting the number of documents matching the query and subquery Removing filters from faceting resultsUsing decision tree faceting; Calculating faceting for relevant documents in groups; Improving faceting performance for low cardinality fields; Chapter 6: Improving Solr Performance; Introduction; Handling deep paging efficiently; Configuring the document cache; Configuring the query result cache; Configuring the filter cache; Improving Solr query performance after the start and commit operations; Lowering the memory consumption of faceting and sorting; Speeding up indexing with Solr segment merge tuning Avoiding caching of rare filters to improve the performance |
Record Nr. | UNINA-9910818575903321 |
Kuć Rafał
![]() |
||
Birmingham, England : , : Packt Publishing, , 2015 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|