Apache Hadoop 3 quick start guide : learn about big data processing and analytics / / Hrishikesh Vijay Karambelkar |
Autore | Karambelkar Hrishikesh Vijay |
Edizione | [First edition] |
Pubbl/distr/stampa | London, England : , : Packt Publishing, Limited, , [2018] |
Descrizione fisica | 1 online resource (220 pages) |
Disciplina | 004.36 |
Soggetto topico |
Cloud computing
Electronic data processing - Distributed processing - Management |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910795325303321 |
Karambelkar Hrishikesh Vijay | ||
London, England : , : Packt Publishing, Limited, , [2018] | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Apache Hadoop 3 quick start guide : learn about big data processing and analytics / / Hrishikesh Vijay Karambelkar |
Autore | Karambelkar Hrishikesh Vijay |
Edizione | [First edition] |
Pubbl/distr/stampa | London, England : , : Packt Publishing, Limited, , [2018] |
Descrizione fisica | 1 online resource (220 pages) |
Disciplina | 004.36 |
Soggetto topico |
Cloud computing
Electronic data processing - Distributed processing - Management |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910814241203321 |
Karambelkar Hrishikesh Vijay | ||
London, England : , : Packt Publishing, Limited, , [2018] | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar |
Autore | Karambelkar Hrishikesh Vijay |
Edizione | [1st edition] |
Pubbl/distr/stampa | Bradford, England : , : Packt Publishing, , 2014 |
Descrizione fisica | 1 online resource (298 p.) |
Disciplina | 005.758 |
Collana | Community experience distilled |
Soggetto topico | Search engines - Programming |
Soggetto genere / forma | Electronic books. |
ISBN | 1-78398-175-X |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Understanding Apache Solr; Challenges in enterprise search; Understanding Apache Solr; Features of Apache Solr; Solr for end users; Powerful full text search; Search through rich information; Results ranking, pagination, and sorting; Facets for better browsing experience; Advanced search capabilities; Administration; Apache Solr architecture; Storage; Solr application; Integration; Client APIs and SolrJ client; Other interfaces; Practical use cases for Apache Solr
Enterprise search for a job search agencyProblem statement; Approach; Enterprise search for energy industry; Problem statement; Approach; Summary; Chapter 2: Getting Started with Apache Solr; Setting up Apache Solr; Prerequisites; Running Solr on Jetty; Running Solr on Tomcat; Solr administration; What's next?; Common problems and solution; Understanding the Solr structure; The Solr home directory structure; Solr navigation; Configuring the Apache Solr for enterprise; Defining a Solr schema; Solr fields; Dynamic Fields in Solr; Copying the fields; Field types Other important elements in the Solr schemaConfiguring Solr parameters; solr.xml and Solr core; solrconfig.xml; The Solr plugin; Other configurations; Understanding SolrJ; Summary; Chapter 3: Analyzing Data with Apache Solr; Understanding enterprise data; Categorizing by characteristics; Categorizing by access pattern; Categorizing by data formats; Loading data using native handlers; Quick and simple data loading - post tool; Working with JSON, XML, and CSV; Handling JSON data; Working with CSV data; Working with XML data; Working with rich documents; Understanding Apache Tika Using Solr Cell (ExtractingRequestHandler)Adding metadata to your rich documents; Importing structured data from the database; Configuring the data source; Importing data in Solr; Full import; Delta import; Loading RDBMS tables in Solr; Advanced topics with Solr; Deduplication; Extracting information from scanned documents; Searching through images using LIRE; Summary; Chapter 4: Designing Enterprise Search; Designing aspects for enterprise search; Identifying requirements; Matching user expectations through relevance; Access to searched entities and user interface Improving search performance and ensuring instance scalabilityWorking with applications through federated search; Other differentiators - mobiles, linguistic search, and security; Enterprise search data-processing patterns; Standalone search engine server; Distributed enterprise search pattern; The replicated enterprise search pattern; Distributed and replicated; Data integrating pattern for search; Data import by enterprise search; Applications pushing data; Middleware-based integration; Case study - designing an enterprise knowledge repository search for software IT services Gathering requirements |
Record Nr. | UNINA-9910464624703321 |
Karambelkar Hrishikesh Vijay | ||
Bradford, England : , : Packt Publishing, , 2014 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar |
Autore | Karambelkar Hrishikesh Vijay |
Edizione | [1st edition] |
Pubbl/distr/stampa | Bradford, England : , : Packt Publishing, , 2014 |
Descrizione fisica | 1 online resource (298 p.) |
Disciplina | 005.758 |
Collana | Community experience distilled |
Soggetto topico | Search engines - Programming |
ISBN | 1-78398-175-X |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Understanding Apache Solr; Challenges in enterprise search; Understanding Apache Solr; Features of Apache Solr; Solr for end users; Powerful full text search; Search through rich information; Results ranking, pagination, and sorting; Facets for better browsing experience; Advanced search capabilities; Administration; Apache Solr architecture; Storage; Solr application; Integration; Client APIs and SolrJ client; Other interfaces; Practical use cases for Apache Solr
Enterprise search for a job search agencyProblem statement; Approach; Enterprise search for energy industry; Problem statement; Approach; Summary; Chapter 2: Getting Started with Apache Solr; Setting up Apache Solr; Prerequisites; Running Solr on Jetty; Running Solr on Tomcat; Solr administration; What's next?; Common problems and solution; Understanding the Solr structure; The Solr home directory structure; Solr navigation; Configuring the Apache Solr for enterprise; Defining a Solr schema; Solr fields; Dynamic Fields in Solr; Copying the fields; Field types Other important elements in the Solr schemaConfiguring Solr parameters; solr.xml and Solr core; solrconfig.xml; The Solr plugin; Other configurations; Understanding SolrJ; Summary; Chapter 3: Analyzing Data with Apache Solr; Understanding enterprise data; Categorizing by characteristics; Categorizing by access pattern; Categorizing by data formats; Loading data using native handlers; Quick and simple data loading - post tool; Working with JSON, XML, and CSV; Handling JSON data; Working with CSV data; Working with XML data; Working with rich documents; Understanding Apache Tika Using Solr Cell (ExtractingRequestHandler)Adding metadata to your rich documents; Importing structured data from the database; Configuring the data source; Importing data in Solr; Full import; Delta import; Loading RDBMS tables in Solr; Advanced topics with Solr; Deduplication; Extracting information from scanned documents; Searching through images using LIRE; Summary; Chapter 4: Designing Enterprise Search; Designing aspects for enterprise search; Identifying requirements; Matching user expectations through relevance; Access to searched entities and user interface Improving search performance and ensuring instance scalabilityWorking with applications through federated search; Other differentiators - mobiles, linguistic search, and security; Enterprise search data-processing patterns; Standalone search engine server; Distributed enterprise search pattern; The replicated enterprise search pattern; Distributed and replicated; Data integrating pattern for search; Data import by enterprise search; Applications pushing data; Middleware-based integration; Case study - designing an enterprise knowledge repository search for software IT services Gathering requirements |
Record Nr. | UNINA-9910786778203321 |
Karambelkar Hrishikesh Vijay | ||
Bradford, England : , : Packt Publishing, , 2014 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar |
Autore | Karambelkar Hrishikesh Vijay |
Edizione | [1st edition] |
Pubbl/distr/stampa | Bradford, England : , : Packt Publishing, , 2014 |
Descrizione fisica | 1 online resource (298 p.) |
Disciplina | 005.758 |
Collana | Community experience distilled |
Soggetto topico | Search engines - Programming |
ISBN | 1-78398-175-X |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Understanding Apache Solr; Challenges in enterprise search; Understanding Apache Solr; Features of Apache Solr; Solr for end users; Powerful full text search; Search through rich information; Results ranking, pagination, and sorting; Facets for better browsing experience; Advanced search capabilities; Administration; Apache Solr architecture; Storage; Solr application; Integration; Client APIs and SolrJ client; Other interfaces; Practical use cases for Apache Solr
Enterprise search for a job search agencyProblem statement; Approach; Enterprise search for energy industry; Problem statement; Approach; Summary; Chapter 2: Getting Started with Apache Solr; Setting up Apache Solr; Prerequisites; Running Solr on Jetty; Running Solr on Tomcat; Solr administration; What's next?; Common problems and solution; Understanding the Solr structure; The Solr home directory structure; Solr navigation; Configuring the Apache Solr for enterprise; Defining a Solr schema; Solr fields; Dynamic Fields in Solr; Copying the fields; Field types Other important elements in the Solr schemaConfiguring Solr parameters; solr.xml and Solr core; solrconfig.xml; The Solr plugin; Other configurations; Understanding SolrJ; Summary; Chapter 3: Analyzing Data with Apache Solr; Understanding enterprise data; Categorizing by characteristics; Categorizing by access pattern; Categorizing by data formats; Loading data using native handlers; Quick and simple data loading - post tool; Working with JSON, XML, and CSV; Handling JSON data; Working with CSV data; Working with XML data; Working with rich documents; Understanding Apache Tika Using Solr Cell (ExtractingRequestHandler)Adding metadata to your rich documents; Importing structured data from the database; Configuring the data source; Importing data in Solr; Full import; Delta import; Loading RDBMS tables in Solr; Advanced topics with Solr; Deduplication; Extracting information from scanned documents; Searching through images using LIRE; Summary; Chapter 4: Designing Enterprise Search; Designing aspects for enterprise search; Identifying requirements; Matching user expectations through relevance; Access to searched entities and user interface Improving search performance and ensuring instance scalabilityWorking with applications through federated search; Other differentiators - mobiles, linguistic search, and security; Enterprise search data-processing patterns; Standalone search engine server; Distributed enterprise search pattern; The replicated enterprise search pattern; Distributed and replicated; Data integrating pattern for search; Data import by enterprise search; Applications pushing data; Middleware-based integration; Case study - designing an enterprise knowledge repository search for software IT services Gathering requirements |
Record Nr. | UNINA-9910820321103321 |
Karambelkar Hrishikesh Vijay | ||
Bradford, England : , : Packt Publishing, , 2014 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar |
Autore | Karambelkar Hrishikesh Vijay |
Edizione | [Second edition.] |
Pubbl/distr/stampa | Birmingham, England : , : Packt Publishing, , 2015 |
Descrizione fisica | 1 online resource (166 p.) |
Disciplina | 004 |
Collana | Community Experience Distilled |
Soggetto topico |
Electronic data processing
Data mining Big data |
Soggetto genere / forma | Electronic books. |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Processing Big Data Using Hadoop and MapReduce; Apache Hadoop's ecosystem; Core components; Understanding Hadoop's ecosystem; Configuring Apache Hadoop; Prerequisites; Setting up ssh without passphrase; Configuring Hadoop; Running Hadoop; Setting up a Hadoop cluster; Common problems and their solutions; Summary; Chapter 2: Understanding Apache Solr; Setting up Apache Solr; Prerequisites for setting up Apache Solr; Running Apache Solr on jetty
Running Solr on other J2EE containersHello World with Apache Solr!; Understanding Solr administration; Solr navigation; Common problems and solutions; The Apache Solr architecture; Configuring Solr; Understanding the Solr structure; Defining the Solr schema; Solr fields; Dynamic fields in Solr; Copying the fields; Dealing with field types; Additional metadata configuration; Other important elements of the Solr schema; Configuration files of Apache Solr; Working with solr.xml and Solr core; Instance configuration with solrconfig.xml; Understanding the Solr plugin; Other configuration Loading data in Apache SolrExtracting request handler - Solr Cell; Understanding data import handlers; Interacting with Solr through SolrJ; Working with rich documents (Apache Tika); Querying for information in Solr; Summary; Chapter 3: Enabling Distributed Search using Apache Solr; Understanding a distributed search; Distributed search patterns; Apache Solr and distributed search; Working with SolrCloud; Why ZooKeeper?; The SolrCloud architecture; Building an enterprise distributed search using SolrCloud; Setting up SolrCloud for development; Setting up SolrCloud for production Adding a document to SolrCloudCreating shards, collections, and replicas in SolrCloud; Common problems and resolutions; Sharding algorithm and fault tolerance; Document Routing and Sharding; Shard splitting; Load balancing and fault tolerance in SolrCloud; Apache Solr and Big Data - integration with MongoDB; What is NoSQL and how is it related to Big Data?; MongoDB at glance; Installing MongoDB; Creating Solr indexes from MongoDB; Summary; Chapter 4: Big Data Search Using Hadoop and Its Ecosystem; Understanding NoSQL; Working with the Solr HDFS connector; Big data search using Katta How Katta works?Setting up the Katta cluster; Creating Katta indexes; Using Solr 1045 Patch - map-side indexing; Using Solr 1301 Patch - reduce-side indexing; Distributed search using Apache Blur; Setting up Apache Blur with Hadoop; Apache Solr and Cassandra; Working with Cassandra and Solr; Single node configuration; Integrating with multinode Cassandra; Scaling Solr through Storm; Getting along with Apache Storm; Advanced analytics with Solr; Integrating Solr and R; Summary; Chapter 5: Scaling Search Performance; Understanding the limits; Optimizing search schema Specifying default search field |
Record Nr. | UNINA-9910463842403321 |
Karambelkar Hrishikesh Vijay | ||
Birmingham, England : , : Packt Publishing, , 2015 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar |
Autore | Karambelkar Hrishikesh Vijay |
Edizione | [Second edition.] |
Pubbl/distr/stampa | Birmingham, England : , : Packt Publishing, , 2015 |
Descrizione fisica | 1 online resource (166 p.) |
Disciplina | 004 |
Collana | Community Experience Distilled |
Soggetto topico |
Electronic data processing
Data mining Big data |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Processing Big Data Using Hadoop and MapReduce; Apache Hadoop's ecosystem; Core components; Understanding Hadoop's ecosystem; Configuring Apache Hadoop; Prerequisites; Setting up ssh without passphrase; Configuring Hadoop; Running Hadoop; Setting up a Hadoop cluster; Common problems and their solutions; Summary; Chapter 2: Understanding Apache Solr; Setting up Apache Solr; Prerequisites for setting up Apache Solr; Running Apache Solr on jetty
Running Solr on other J2EE containersHello World with Apache Solr!; Understanding Solr administration; Solr navigation; Common problems and solutions; The Apache Solr architecture; Configuring Solr; Understanding the Solr structure; Defining the Solr schema; Solr fields; Dynamic fields in Solr; Copying the fields; Dealing with field types; Additional metadata configuration; Other important elements of the Solr schema; Configuration files of Apache Solr; Working with solr.xml and Solr core; Instance configuration with solrconfig.xml; Understanding the Solr plugin; Other configuration Loading data in Apache SolrExtracting request handler - Solr Cell; Understanding data import handlers; Interacting with Solr through SolrJ; Working with rich documents (Apache Tika); Querying for information in Solr; Summary; Chapter 3: Enabling Distributed Search using Apache Solr; Understanding a distributed search; Distributed search patterns; Apache Solr and distributed search; Working with SolrCloud; Why ZooKeeper?; The SolrCloud architecture; Building an enterprise distributed search using SolrCloud; Setting up SolrCloud for development; Setting up SolrCloud for production Adding a document to SolrCloudCreating shards, collections, and replicas in SolrCloud; Common problems and resolutions; Sharding algorithm and fault tolerance; Document Routing and Sharding; Shard splitting; Load balancing and fault tolerance in SolrCloud; Apache Solr and Big Data - integration with MongoDB; What is NoSQL and how is it related to Big Data?; MongoDB at glance; Installing MongoDB; Creating Solr indexes from MongoDB; Summary; Chapter 4: Big Data Search Using Hadoop and Its Ecosystem; Understanding NoSQL; Working with the Solr HDFS connector; Big data search using Katta How Katta works?Setting up the Katta cluster; Creating Katta indexes; Using Solr 1045 Patch - map-side indexing; Using Solr 1301 Patch - reduce-side indexing; Distributed search using Apache Blur; Setting up Apache Blur with Hadoop; Apache Solr and Cassandra; Working with Cassandra and Solr; Single node configuration; Integrating with multinode Cassandra; Scaling Solr through Storm; Getting along with Apache Storm; Advanced analytics with Solr; Integrating Solr and R; Summary; Chapter 5: Scaling Search Performance; Understanding the limits; Optimizing search schema Specifying default search field |
Record Nr. | UNINA-9910788111503321 |
Karambelkar Hrishikesh Vijay | ||
Birmingham, England : , : Packt Publishing, , 2015 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar |
Autore | Karambelkar Hrishikesh Vijay |
Edizione | [Second edition.] |
Pubbl/distr/stampa | Birmingham, England : , : Packt Publishing, , 2015 |
Descrizione fisica | 1 online resource (166 p.) |
Disciplina | 004 |
Collana | Community Experience Distilled |
Soggetto topico |
Electronic data processing
Data mining Big data |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Processing Big Data Using Hadoop and MapReduce; Apache Hadoop's ecosystem; Core components; Understanding Hadoop's ecosystem; Configuring Apache Hadoop; Prerequisites; Setting up ssh without passphrase; Configuring Hadoop; Running Hadoop; Setting up a Hadoop cluster; Common problems and their solutions; Summary; Chapter 2: Understanding Apache Solr; Setting up Apache Solr; Prerequisites for setting up Apache Solr; Running Apache Solr on jetty
Running Solr on other J2EE containersHello World with Apache Solr!; Understanding Solr administration; Solr navigation; Common problems and solutions; The Apache Solr architecture; Configuring Solr; Understanding the Solr structure; Defining the Solr schema; Solr fields; Dynamic fields in Solr; Copying the fields; Dealing with field types; Additional metadata configuration; Other important elements of the Solr schema; Configuration files of Apache Solr; Working with solr.xml and Solr core; Instance configuration with solrconfig.xml; Understanding the Solr plugin; Other configuration Loading data in Apache SolrExtracting request handler - Solr Cell; Understanding data import handlers; Interacting with Solr through SolrJ; Working with rich documents (Apache Tika); Querying for information in Solr; Summary; Chapter 3: Enabling Distributed Search using Apache Solr; Understanding a distributed search; Distributed search patterns; Apache Solr and distributed search; Working with SolrCloud; Why ZooKeeper?; The SolrCloud architecture; Building an enterprise distributed search using SolrCloud; Setting up SolrCloud for development; Setting up SolrCloud for production Adding a document to SolrCloudCreating shards, collections, and replicas in SolrCloud; Common problems and resolutions; Sharding algorithm and fault tolerance; Document Routing and Sharding; Shard splitting; Load balancing and fault tolerance in SolrCloud; Apache Solr and Big Data - integration with MongoDB; What is NoSQL and how is it related to Big Data?; MongoDB at glance; Installing MongoDB; Creating Solr indexes from MongoDB; Summary; Chapter 4: Big Data Search Using Hadoop and Its Ecosystem; Understanding NoSQL; Working with the Solr HDFS connector; Big data search using Katta How Katta works?Setting up the Katta cluster; Creating Katta indexes; Using Solr 1045 Patch - map-side indexing; Using Solr 1301 Patch - reduce-side indexing; Distributed search using Apache Blur; Setting up Apache Blur with Hadoop; Apache Solr and Cassandra; Working with Cassandra and Solr; Single node configuration; Integrating with multinode Cassandra; Scaling Solr through Storm; Getting along with Apache Storm; Advanced analytics with Solr; Integrating Solr and R; Summary; Chapter 5: Scaling Search Performance; Understanding the limits; Optimizing search schema Specifying default search field |
Record Nr. | UNINA-9910828488603321 |
Karambelkar Hrishikesh Vijay | ||
Birmingham, England : , : Packt Publishing, , 2015 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|