top

  Info

  • Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

  Info

  • Utilizzare questo link per rimuovere la selezione effettuata.
Apache Hadoop 3 quick start guide : learn about big data processing and analytics / / Hrishikesh Vijay Karambelkar
Apache Hadoop 3 quick start guide : learn about big data processing and analytics / / Hrishikesh Vijay Karambelkar
Autore Karambelkar Hrishikesh Vijay
Edizione [First edition]
Pubbl/distr/stampa London, England : , : Packt Publishing, Limited, , [2018]
Descrizione fisica 1 online resource (220 pages)
Disciplina 004.36
Soggetto topico Cloud computing
Electronic data processing - Distributed processing - Management
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910795325303321
Karambelkar Hrishikesh Vijay  
London, England : , : Packt Publishing, Limited, , [2018]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Apache Hadoop 3 quick start guide : learn about big data processing and analytics / / Hrishikesh Vijay Karambelkar
Apache Hadoop 3 quick start guide : learn about big data processing and analytics / / Hrishikesh Vijay Karambelkar
Autore Karambelkar Hrishikesh Vijay
Edizione [First edition]
Pubbl/distr/stampa London, England : , : Packt Publishing, Limited, , [2018]
Descrizione fisica 1 online resource (220 pages)
Disciplina 004.36
Soggetto topico Cloud computing
Electronic data processing - Distributed processing - Management
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910814241203321
Karambelkar Hrishikesh Vijay  
London, England : , : Packt Publishing, Limited, , [2018]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar
Autore Karambelkar Hrishikesh Vijay
Edizione [1st edition]
Pubbl/distr/stampa Bradford, England : , : Packt Publishing, , 2014
Descrizione fisica 1 online resource (298 p.)
Disciplina 005.758
Collana Community experience distilled
Soggetto topico Search engines - Programming
Soggetto genere / forma Electronic books.
ISBN 1-78398-175-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Understanding Apache Solr; Challenges in enterprise search; Understanding Apache Solr; Features of Apache Solr; Solr for end users; Powerful full text search; Search through rich information; Results ranking, pagination, and sorting; Facets for better browsing experience; Advanced search capabilities; Administration; Apache Solr architecture; Storage; Solr application; Integration; Client APIs and SolrJ client; Other interfaces; Practical use cases for Apache Solr
Enterprise search for a job search agencyProblem statement; Approach; Enterprise search for energy industry; Problem statement; Approach; Summary; Chapter 2: Getting Started with Apache Solr; Setting up Apache Solr; Prerequisites; Running Solr on Jetty; Running Solr on Tomcat; Solr administration; What's next?; Common problems and solution; Understanding the Solr structure; The Solr home directory structure; Solr navigation; Configuring the Apache Solr for enterprise; Defining a Solr schema; Solr fields; Dynamic Fields in Solr; Copying the fields; Field types
Other important elements in the Solr schemaConfiguring Solr parameters; solr.xml and Solr core; solrconfig.xml; The Solr plugin; Other configurations; Understanding SolrJ; Summary; Chapter 3: Analyzing Data with Apache Solr; Understanding enterprise data; Categorizing by characteristics; Categorizing by access pattern; Categorizing by data formats; Loading data using native handlers; Quick and simple data loading - post tool; Working with JSON, XML, and CSV; Handling JSON data; Working with CSV data; Working with XML data; Working with rich documents; Understanding Apache Tika
Using Solr Cell (ExtractingRequestHandler)Adding metadata to your rich documents; Importing structured data from the database; Configuring the data source; Importing data in Solr; Full import; Delta import; Loading RDBMS tables in Solr; Advanced topics with Solr; Deduplication; Extracting information from scanned documents; Searching through images using LIRE; Summary; Chapter 4: Designing Enterprise Search; Designing aspects for enterprise search; Identifying requirements; Matching user expectations through relevance; Access to searched entities and user interface
Improving search performance and ensuring instance scalabilityWorking with applications through federated search; Other differentiators - mobiles, linguistic search, and security; Enterprise search data-processing patterns; Standalone search engine server; Distributed enterprise search pattern; The replicated enterprise search pattern; Distributed and replicated; Data integrating pattern for search; Data import by enterprise search; Applications pushing data; Middleware-based integration; Case study - designing an enterprise knowledge repository search for software IT services
Gathering requirements
Record Nr. UNINA-9910464624703321
Karambelkar Hrishikesh Vijay  
Bradford, England : , : Packt Publishing, , 2014
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar
Autore Karambelkar Hrishikesh Vijay
Edizione [1st edition]
Pubbl/distr/stampa Bradford, England : , : Packt Publishing, , 2014
Descrizione fisica 1 online resource (298 p.)
Disciplina 005.758
Collana Community experience distilled
Soggetto topico Search engines - Programming
ISBN 1-78398-175-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Understanding Apache Solr; Challenges in enterprise search; Understanding Apache Solr; Features of Apache Solr; Solr for end users; Powerful full text search; Search through rich information; Results ranking, pagination, and sorting; Facets for better browsing experience; Advanced search capabilities; Administration; Apache Solr architecture; Storage; Solr application; Integration; Client APIs and SolrJ client; Other interfaces; Practical use cases for Apache Solr
Enterprise search for a job search agencyProblem statement; Approach; Enterprise search for energy industry; Problem statement; Approach; Summary; Chapter 2: Getting Started with Apache Solr; Setting up Apache Solr; Prerequisites; Running Solr on Jetty; Running Solr on Tomcat; Solr administration; What's next?; Common problems and solution; Understanding the Solr structure; The Solr home directory structure; Solr navigation; Configuring the Apache Solr for enterprise; Defining a Solr schema; Solr fields; Dynamic Fields in Solr; Copying the fields; Field types
Other important elements in the Solr schemaConfiguring Solr parameters; solr.xml and Solr core; solrconfig.xml; The Solr plugin; Other configurations; Understanding SolrJ; Summary; Chapter 3: Analyzing Data with Apache Solr; Understanding enterprise data; Categorizing by characteristics; Categorizing by access pattern; Categorizing by data formats; Loading data using native handlers; Quick and simple data loading - post tool; Working with JSON, XML, and CSV; Handling JSON data; Working with CSV data; Working with XML data; Working with rich documents; Understanding Apache Tika
Using Solr Cell (ExtractingRequestHandler)Adding metadata to your rich documents; Importing structured data from the database; Configuring the data source; Importing data in Solr; Full import; Delta import; Loading RDBMS tables in Solr; Advanced topics with Solr; Deduplication; Extracting information from scanned documents; Searching through images using LIRE; Summary; Chapter 4: Designing Enterprise Search; Designing aspects for enterprise search; Identifying requirements; Matching user expectations through relevance; Access to searched entities and user interface
Improving search performance and ensuring instance scalabilityWorking with applications through federated search; Other differentiators - mobiles, linguistic search, and security; Enterprise search data-processing patterns; Standalone search engine server; Distributed enterprise search pattern; The replicated enterprise search pattern; Distributed and replicated; Data integrating pattern for search; Data import by enterprise search; Applications pushing data; Middleware-based integration; Case study - designing an enterprise knowledge repository search for software IT services
Gathering requirements
Record Nr. UNINA-9910786778203321
Karambelkar Hrishikesh Vijay  
Bradford, England : , : Packt Publishing, , 2014
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar
Scaling apache solr : optimize your searches using high-performance enterprise search repositories with apache solr / / Hrishikesh Vijay Karambelkar
Autore Karambelkar Hrishikesh Vijay
Edizione [1st edition]
Pubbl/distr/stampa Bradford, England : , : Packt Publishing, , 2014
Descrizione fisica 1 online resource (298 p.)
Disciplina 005.758
Collana Community experience distilled
Soggetto topico Search engines - Programming
ISBN 1-78398-175-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Understanding Apache Solr; Challenges in enterprise search; Understanding Apache Solr; Features of Apache Solr; Solr for end users; Powerful full text search; Search through rich information; Results ranking, pagination, and sorting; Facets for better browsing experience; Advanced search capabilities; Administration; Apache Solr architecture; Storage; Solr application; Integration; Client APIs and SolrJ client; Other interfaces; Practical use cases for Apache Solr
Enterprise search for a job search agencyProblem statement; Approach; Enterprise search for energy industry; Problem statement; Approach; Summary; Chapter 2: Getting Started with Apache Solr; Setting up Apache Solr; Prerequisites; Running Solr on Jetty; Running Solr on Tomcat; Solr administration; What's next?; Common problems and solution; Understanding the Solr structure; The Solr home directory structure; Solr navigation; Configuring the Apache Solr for enterprise; Defining a Solr schema; Solr fields; Dynamic Fields in Solr; Copying the fields; Field types
Other important elements in the Solr schemaConfiguring Solr parameters; solr.xml and Solr core; solrconfig.xml; The Solr plugin; Other configurations; Understanding SolrJ; Summary; Chapter 3: Analyzing Data with Apache Solr; Understanding enterprise data; Categorizing by characteristics; Categorizing by access pattern; Categorizing by data formats; Loading data using native handlers; Quick and simple data loading - post tool; Working with JSON, XML, and CSV; Handling JSON data; Working with CSV data; Working with XML data; Working with rich documents; Understanding Apache Tika
Using Solr Cell (ExtractingRequestHandler)Adding metadata to your rich documents; Importing structured data from the database; Configuring the data source; Importing data in Solr; Full import; Delta import; Loading RDBMS tables in Solr; Advanced topics with Solr; Deduplication; Extracting information from scanned documents; Searching through images using LIRE; Summary; Chapter 4: Designing Enterprise Search; Designing aspects for enterprise search; Identifying requirements; Matching user expectations through relevance; Access to searched entities and user interface
Improving search performance and ensuring instance scalabilityWorking with applications through federated search; Other differentiators - mobiles, linguistic search, and security; Enterprise search data-processing patterns; Standalone search engine server; Distributed enterprise search pattern; The replicated enterprise search pattern; Distributed and replicated; Data integrating pattern for search; Data import by enterprise search; Applications pushing data; Middleware-based integration; Case study - designing an enterprise knowledge repository search for software IT services
Gathering requirements
Record Nr. UNINA-9910820321103321
Karambelkar Hrishikesh Vijay  
Bradford, England : , : Packt Publishing, , 2014
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar
Autore Karambelkar Hrishikesh Vijay
Edizione [Second edition.]
Pubbl/distr/stampa Birmingham, England : , : Packt Publishing, , 2015
Descrizione fisica 1 online resource (166 p.)
Disciplina 004
Collana Community Experience Distilled
Soggetto topico Electronic data processing
Data mining
Big data
Soggetto genere / forma Electronic books.
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Processing Big Data Using Hadoop and MapReduce; Apache Hadoop's ecosystem; Core components; Understanding Hadoop's ecosystem; Configuring Apache Hadoop; Prerequisites; Setting up ssh without passphrase; Configuring Hadoop; Running Hadoop; Setting up a Hadoop cluster; Common problems and their solutions; Summary; Chapter 2: Understanding Apache Solr; Setting up Apache Solr; Prerequisites for setting up Apache Solr; Running Apache Solr on jetty
Running Solr on other J2EE containersHello World with Apache Solr!; Understanding Solr administration; Solr navigation; Common problems and solutions; The Apache Solr architecture; Configuring Solr; Understanding the Solr structure; Defining the Solr schema; Solr fields; Dynamic fields in Solr; Copying the fields; Dealing with field types; Additional metadata configuration; Other important elements of the Solr schema; Configuration files of Apache Solr; Working with solr.xml and Solr core; Instance configuration with solrconfig.xml; Understanding the Solr plugin; Other configuration
Loading data in Apache SolrExtracting request handler - Solr Cell; Understanding data import handlers; Interacting with Solr through SolrJ; Working with rich documents (Apache Tika); Querying for information in Solr; Summary; Chapter 3: Enabling Distributed Search using Apache Solr; Understanding a distributed search; Distributed search patterns; Apache Solr and distributed search; Working with SolrCloud; Why ZooKeeper?; The SolrCloud architecture; Building an enterprise distributed search using SolrCloud; Setting up SolrCloud for development; Setting up SolrCloud for production
Adding a document to SolrCloudCreating shards, collections, and replicas in SolrCloud; Common problems and resolutions; Sharding algorithm and fault tolerance; Document Routing and Sharding; Shard splitting; Load balancing and fault tolerance in SolrCloud; Apache Solr and Big Data - integration with MongoDB; What is NoSQL and how is it related to Big Data?; MongoDB at glance; Installing MongoDB; Creating Solr indexes from MongoDB; Summary; Chapter 4: Big Data Search Using Hadoop and Its Ecosystem; Understanding NoSQL; Working with the Solr HDFS connector; Big data search using Katta
How Katta works?Setting up the Katta cluster; Creating Katta indexes; Using Solr 1045 Patch - map-side indexing; Using Solr 1301 Patch - reduce-side indexing; Distributed search using Apache Blur; Setting up Apache Blur with Hadoop; Apache Solr and Cassandra; Working with Cassandra and Solr; Single node configuration; Integrating with multinode Cassandra; Scaling Solr through Storm; Getting along with Apache Storm; Advanced analytics with Solr; Integrating Solr and R; Summary; Chapter 5: Scaling Search Performance; Understanding the limits; Optimizing search schema
Specifying default search field
Record Nr. UNINA-9910463842403321
Karambelkar Hrishikesh Vijay  
Birmingham, England : , : Packt Publishing, , 2015
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar
Autore Karambelkar Hrishikesh Vijay
Edizione [Second edition.]
Pubbl/distr/stampa Birmingham, England : , : Packt Publishing, , 2015
Descrizione fisica 1 online resource (166 p.)
Disciplina 004
Collana Community Experience Distilled
Soggetto topico Electronic data processing
Data mining
Big data
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Processing Big Data Using Hadoop and MapReduce; Apache Hadoop's ecosystem; Core components; Understanding Hadoop's ecosystem; Configuring Apache Hadoop; Prerequisites; Setting up ssh without passphrase; Configuring Hadoop; Running Hadoop; Setting up a Hadoop cluster; Common problems and their solutions; Summary; Chapter 2: Understanding Apache Solr; Setting up Apache Solr; Prerequisites for setting up Apache Solr; Running Apache Solr on jetty
Running Solr on other J2EE containersHello World with Apache Solr!; Understanding Solr administration; Solr navigation; Common problems and solutions; The Apache Solr architecture; Configuring Solr; Understanding the Solr structure; Defining the Solr schema; Solr fields; Dynamic fields in Solr; Copying the fields; Dealing with field types; Additional metadata configuration; Other important elements of the Solr schema; Configuration files of Apache Solr; Working with solr.xml and Solr core; Instance configuration with solrconfig.xml; Understanding the Solr plugin; Other configuration
Loading data in Apache SolrExtracting request handler - Solr Cell; Understanding data import handlers; Interacting with Solr through SolrJ; Working with rich documents (Apache Tika); Querying for information in Solr; Summary; Chapter 3: Enabling Distributed Search using Apache Solr; Understanding a distributed search; Distributed search patterns; Apache Solr and distributed search; Working with SolrCloud; Why ZooKeeper?; The SolrCloud architecture; Building an enterprise distributed search using SolrCloud; Setting up SolrCloud for development; Setting up SolrCloud for production
Adding a document to SolrCloudCreating shards, collections, and replicas in SolrCloud; Common problems and resolutions; Sharding algorithm and fault tolerance; Document Routing and Sharding; Shard splitting; Load balancing and fault tolerance in SolrCloud; Apache Solr and Big Data - integration with MongoDB; What is NoSQL and how is it related to Big Data?; MongoDB at glance; Installing MongoDB; Creating Solr indexes from MongoDB; Summary; Chapter 4: Big Data Search Using Hadoop and Its Ecosystem; Understanding NoSQL; Working with the Solr HDFS connector; Big data search using Katta
How Katta works?Setting up the Katta cluster; Creating Katta indexes; Using Solr 1045 Patch - map-side indexing; Using Solr 1301 Patch - reduce-side indexing; Distributed search using Apache Blur; Setting up Apache Blur with Hadoop; Apache Solr and Cassandra; Working with Cassandra and Solr; Single node configuration; Integrating with multinode Cassandra; Scaling Solr through Storm; Getting along with Apache Storm; Advanced analytics with Solr; Integrating Solr and R; Summary; Chapter 5: Scaling Search Performance; Understanding the limits; Optimizing search schema
Specifying default search field
Record Nr. UNINA-9910788111503321
Karambelkar Hrishikesh Vijay  
Birmingham, England : , : Packt Publishing, , 2015
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar
Scaling big data with Hadoop and Solr : understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr / / Hrishikesh Vijay Karambelkar
Autore Karambelkar Hrishikesh Vijay
Edizione [Second edition.]
Pubbl/distr/stampa Birmingham, England : , : Packt Publishing, , 2015
Descrizione fisica 1 online resource (166 p.)
Disciplina 004
Collana Community Experience Distilled
Soggetto topico Electronic data processing
Data mining
Big data
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Processing Big Data Using Hadoop and MapReduce; Apache Hadoop's ecosystem; Core components; Understanding Hadoop's ecosystem; Configuring Apache Hadoop; Prerequisites; Setting up ssh without passphrase; Configuring Hadoop; Running Hadoop; Setting up a Hadoop cluster; Common problems and their solutions; Summary; Chapter 2: Understanding Apache Solr; Setting up Apache Solr; Prerequisites for setting up Apache Solr; Running Apache Solr on jetty
Running Solr on other J2EE containersHello World with Apache Solr!; Understanding Solr administration; Solr navigation; Common problems and solutions; The Apache Solr architecture; Configuring Solr; Understanding the Solr structure; Defining the Solr schema; Solr fields; Dynamic fields in Solr; Copying the fields; Dealing with field types; Additional metadata configuration; Other important elements of the Solr schema; Configuration files of Apache Solr; Working with solr.xml and Solr core; Instance configuration with solrconfig.xml; Understanding the Solr plugin; Other configuration
Loading data in Apache SolrExtracting request handler - Solr Cell; Understanding data import handlers; Interacting with Solr through SolrJ; Working with rich documents (Apache Tika); Querying for information in Solr; Summary; Chapter 3: Enabling Distributed Search using Apache Solr; Understanding a distributed search; Distributed search patterns; Apache Solr and distributed search; Working with SolrCloud; Why ZooKeeper?; The SolrCloud architecture; Building an enterprise distributed search using SolrCloud; Setting up SolrCloud for development; Setting up SolrCloud for production
Adding a document to SolrCloudCreating shards, collections, and replicas in SolrCloud; Common problems and resolutions; Sharding algorithm and fault tolerance; Document Routing and Sharding; Shard splitting; Load balancing and fault tolerance in SolrCloud; Apache Solr and Big Data - integration with MongoDB; What is NoSQL and how is it related to Big Data?; MongoDB at glance; Installing MongoDB; Creating Solr indexes from MongoDB; Summary; Chapter 4: Big Data Search Using Hadoop and Its Ecosystem; Understanding NoSQL; Working with the Solr HDFS connector; Big data search using Katta
How Katta works?Setting up the Katta cluster; Creating Katta indexes; Using Solr 1045 Patch - map-side indexing; Using Solr 1301 Patch - reduce-side indexing; Distributed search using Apache Blur; Setting up Apache Blur with Hadoop; Apache Solr and Cassandra; Working with Cassandra and Solr; Single node configuration; Integrating with multinode Cassandra; Scaling Solr through Storm; Getting along with Apache Storm; Advanced analytics with Solr; Integrating Solr and R; Summary; Chapter 5: Scaling Search Performance; Understanding the limits; Optimizing search schema
Specifying default search field
Record Nr. UNINA-9910828488603321
Karambelkar Hrishikesh Vijay  
Birmingham, England : , : Packt Publishing, , 2015
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui