Text mining in practice with R / / Ted Kwartler |
Autore | Kwartler Ted <1978-> |
Pubbl/distr/stampa | Chichester, England : , : Wiley, , 2017 |
Descrizione fisica | 1 online resource (309 pages) : illustrations |
Disciplina | 006.3/12 |
Collana | THEi Wiley ebooks |
Soggetto topico |
R (Computer program language)
Data mining Text processing (Computer science) |
ISBN |
1-119-28208-X
1-119-28209-8 1-119-28210-1 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | What is text mining? -- Basics of text mining -- Common text mining visualizations -- Sentiment scoring -- Hidden structures : clustering, string distance, text vectors & topic modeling -- Document classification : finding clickbait from headlines -- Predictive modeling : using text for classifying & predicting outcomes -- The OpenNLP Project -- Text sources. |
Record Nr. | UNINA-9910812215203321 |
Kwartler Ted <1978->
![]() |
||
Chichester, England : , : Wiley, , 2017 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Third International Conference on Knowledge Discovery and Data Mining : proceedings : 9-10 January 2010, Phuket, Thailand |
Pubbl/distr/stampa | [Place of publication not identified], : IEEE Computer Society, 2010 |
Disciplina | 006.3/12 |
Soggetto topico |
Machine learning
Data mining Engineering & Applied Sciences Computer Science |
ISBN | 0-7695-3923-8 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Altri titoli varianti | WKDD 2010 |
Record Nr. | UNISA-996209730503316 |
[Place of publication not identified], : IEEE Computer Society, 2010 | ||
![]() | ||
Lo trovi qui: Univ. di Salerno | ||
|
Too big to ignore [[electronic resource] ] : the business case for big data / / Phil Simon |
Autore | Simon Phil |
Pubbl/distr/stampa | Hoboken, N.J., : John Wiley & Sons, c2013 |
Descrizione fisica | 257p |
Disciplina | 006.3/12 |
Collana | Wiley & SAS business series |
Soggetto topico |
Business - Data processing
Data mining Database management Big data |
ISBN |
1-119-20403-8
1-118-64186-8 1-299-31571-2 1-118-64210-4 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910139239603321 |
Simon Phil
![]() |
||
Hoboken, N.J., : John Wiley & Sons, c2013 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Too big to ignore [[electronic resource] ] : the business case for big data / / Phil Simon |
Autore | Simon Phil |
Edizione | [1st ed.] |
Pubbl/distr/stampa | Hoboken, N.J., : John Wiley & Sons, c2013 |
Descrizione fisica | 257p |
Disciplina | 006.3/12 |
Collana | Wiley & SAS business series |
Soggetto topico |
Business - Data processing
Data mining Database management Big data |
ISBN |
1-119-20403-8
1-118-64186-8 1-299-31571-2 1-118-64210-4 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Intro -- Too Big to Ignore -- Contents -- List of Tables and Figures -- Preface -- Acknowledgments -- Introduction: This Ain't Your Father's Data -- Better Car Insurance through Data -- Potholes and General Road Hazards -- Recruiting and Retention -- How Big Is Big? The Size of Big Data -- Why Now? Explaining the Big Data Revolution -- The Always-On Consumer -- The Plummeting of Technology Costs -- The Rise of Data Science -- Google and Infonomics -- The Platform Economy -- The 11/12 Watershed: Sandy and Politics -- Social Media and Other Factors -- Central Thesis of Book -- Plan of Attack -- Who Should Read This Book? -- Summary -- Notes -- Chapter 1 Data 101 and the Data Deluge -- The Beginnings: Structured Data -- Structure This! Web 2.0 and the Arrival of Big Data -- Unstructured Data -- Semi-Structured Data -- Metadata -- The Composition of Data: Then and Now -- The Current State of the Data Union -- The Enterprise and the Brave New Big Data World -- The Data Disconnect -- Big Tools and Big Opportunities -- Summary -- Notes -- Chapter 2 Demystifying Big Data -- Characteristics of Big Data -- Big Data Is Already Here -- Big Data Is Extremely Fragmented -- Big Data Is Not an Elixir -- Small Data Extends Big Data -- Big Data Is a Complement, Not a Substitute -- Big Data Can Yield Better Predictions -- Big Data Giveth-and Big Data Taketh Away -- Big Data Is Neither Omniscient Nor Precise -- Big Data Is Generally Wide, Not Long -- Big Data Is Dynamic and Largely Unpredictable -- Big Data Is Largely Consumer Driven -- Big Data Is External and "Unmanageable" in the Traditional Sense -- Big Data Is Inherently Incomplete -- Big Overlap: Big Data, Business Intelligence, and Data Mining -- Big Data Is Democratic -- The Anti-Definition: What Big Data Is Not -- Summary -- Notes -- Chapter 3 The Elements of Persuasion: Big Data Techniques.
The Big Overview -- Statistical Techniques and Methods -- Regression -- A/B Testing -- Data Visualization -- Heat Maps -- Time Series Analysis -- Automation -- Machine Learning and Intelligence -- Sensors and Nanotechnology -- RFID and NFC -- Semantics -- Natural Language Processing -- Text Analytics -- Sentiment Analysis -- Big Data and the Gang of Four -- Predictive Analytics -- Two Key Laws of Big Data -- Collaborative Filtering -- Limitations of Big Data -- Summary -- Notes -- Chapter 4 Big Data Solutions -- Projects, Applications, and Platforms -- Hadoop -- Other Data Storage Solutions -- NoSQL Databases -- NewSQL -- Columnar Databases -- Google: Following the Amazon Model? -- Websites, Start-Ups, and Web Services -- Kaggle -- Other Start-Ups -- Hardware Considerations -- The Art and Science of Predictive Analytics -- Summary -- Notes -- Chapter 5 Case Studies: The Big Rewards of Big Data -- Quantcast: A Small Big Data Company -- Steps: A Big Evolution -- Buy Your Audience -- Results -- Lessons -- Explorys: The Human Case for Big Data -- Better Healthcare through Hadoop -- Steps -- Results -- Lessons -- NASA: How Contests, Gamification, and OpenInnovation Enable Big Data -- Background -- Examples -- A Sample Challenge -- Lessons -- Summary -- Notes -- Chapter 6 Taking the Big Plunge -- Before Starting -- Infonomics Revisited -- Big Data Tools Don't Cleanse Bad Data -- The Big Question: Is the Organization Ready? -- Think Free Speech, Not Free Beer -- Starting the Journey -- Start Relatively Small and Organically -- First Aim for Little Victories -- New Employees and New Skills -- Experiment with Big Data Solutions -- Gradually Gain Acceptance throughout the Organization -- Open Your Mind -- Let the Data Model Evolve -- Tap into Existing Communities -- Realize That Big Data Is Iterative -- Avoiding the Big Pitfalls -- Big Data Is a Binary. Big Data Is an Initiative -- Big Data Is a Side Project -- There Is a Big Data Checklist -- IT Owns Big Data -- Remember the Goal -- Summary -- Notes -- Chapter 7 Big Data: Big Issues and Big Problems -- Privacy: Big Data = Big Brother? -- Big Security Concerns -- Big, Pragmatic Issues -- Big Consumer Fatigue -- Rise of the Machines: Big Employee Resistance -- Employee Revolt and the Big Paradox -- Summary -- Notes -- Chapter 8 Looking Forward: The Future of Big Data -- Predicting Pregnancy -- Big Data Is Here to Stay -- Big Data Will Evolve -- Projects and Movements -- The Vibrant Data Project -- The Data Liberation Front -- Open Data Foundation -- Big Data Will Only Get Bigger . . . and Smarter -- The Internet of Things: The Move from Active toPassive Data Generation -- Hi-Tech Oreos -- Hi-Tech Thermostats -- Smart Food and Smart Music -- Big Data: No Longer a Big Luxury -- Stasis Is Not an Option -- Summary -- Notes -- Final Thoughts -- Spreading the Big Data Gospel -- Notes -- Selected Bibliography -- About the Author -- index. |
Record Nr. | UNINA-9910812838403321 |
Simon Phil
![]() |
||
Hoboken, N.J., : John Wiley & Sons, c2013 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Web Information Systems and Mining [[electronic resource] ] : International Conference, WISM 2012, Chengdu, China, October 26-28, 2012, Proceedings / / edited by Wu Lee Wang, Jingsheng Lei, Gong Zhiguo, Xiangfeng Luo |
Edizione | [1st ed. 2012.] |
Pubbl/distr/stampa | Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2012 |
Descrizione fisica | 1 online resource (XVIII, 718 p. 271 illus.) |
Disciplina | 006.3/12 |
Collana | Information Systems and Applications, incl. Internet/Web, and HCI |
Soggetto topico |
Application software
Information storage and retrieval Data mining E-commerce Computer security Management information systems Computer science Information Systems Applications (incl. Internet) Information Storage and Retrieval Data Mining and Knowledge Discovery e-Commerce/e-business Systems and Data Security Management of Computing and Information Systems |
ISBN | 3-642-33469-5 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Study on the Design of Automatic Cotton Bale Inspecting Management System -- The Smallest Randi´c Index for Trees -- Design of Meridian and Acupoints Compatibility Analysis System -- Invariant Subspaces for Operators with Thick Spectra -- Voronoi Feature Selection Model Considering Variable-Scale Map’s Balance and Legibility -- A Code Dissemination Protocol of Low Energy Consumption -- Dynamic Spectrum Analysis of High-Speed Train Passenger Compartment Luggage Rack Noise -- Port-Based Composable Modeling and Simulation for Safety Critical System Testbed -- Risk Assessment Method of Radio Block Center in Fuzzy Uncertain Environment -- Learning Research in Knowledge Transfer -- A Post-filtering Technique for Enhancing Acoustic Echo Cancelation System -- Packet Dropping Schemes and Quality Evaluation for H.264 Videos at High Packet Loss Rates -- A Blocked Statistics Method Based on Directional Derivative -- Study on Cooperative Game Model of Talent Training through School-Enterprise Coalition -- Research on Internet Public Opinion Detection System Based on Domain Ontology -- A Security Analysis Model Based on Artificial Neural Network -- Social Network Analyses on Knowledge Diffusion of China’s Management Science -- A Parallel Association-Rule Mining Algorithm -- An Algorithm of Parallel Programming Design Based on Problem Domain Model -- Metadata-Aware Small Files Storage Architecture on Hadoop -- Research on the Isomorphism of the Electronic-Government and Electronic-Commerce in Support System -- On the Deployment of Wireless Sensor Networks with Regular Topology Patterns -- An Empirical Study on the Relationship among Trust and the Risky and Non-Risky Components of E-Commerce -- Process Modeling and Reengineering in the Integration Stage of Electronic Government -- Development of Vertical Industrial B2B in China: Based on Cases Study -- Personalized Recommendation System on Massive Content Processing Using Improved MFNN -- A Speaker Recognition Based Approach for Identifying Voice Spammer -- Security Access Authentication System for IPv4/IPv6 Dual-Stack Campus Network Based on IpoE -- Information Encryption Based on Virtual Optical Imaging System and Chen’s Chaos -- A New Scheme with Secure Cookie against SSLStrip Attack -- ID-Based Signatures from Lattices in the Random Oracle Model -- Strongly Secure Attribute-Based Authenticated Key Exchange with Traceability -- A New Public Key Signature Scheme Based on Multivariate Polynomials -- Comments on an Advanced Dynamic ID-Based Authentication Scheme for Cloud Computing -- Research on Security Management in Active Network Node Operating Systems -- An Integrity Verification Scheme for Multiple Replicas in Clouds -- Multi-stage Attack Detection Algorithm Based on Hidden Markov Model -- Security Analysis of a Secure and Practical Dynamic Identity-Based Remote User Authentication Scheme -- Formal Construction of Secure Information Transmission in Office Automation System -- A Novel CBCD Scheme Based on Local Features Category -- Research and Improvement on Association Rule Algorithm Based on FP-Growth -- Encrypted Remote User Authentication Scheme by Using Smart Card -- A Web Visualization System of Cyberinfrastructure Resources -- A Novel Clustering Mechanism Based on Image-Oriented Correlation Coefficient for Wireless Multimedia Sensor Networks -- Design of Underground Miner Positioning System Based on ZigBee Technology -- Metadata Management of Context Resources in Context-Aware Middleware System -- Research on Scientific Data Sharing Platform of Hydrology and Water Resources Based on Service Composition -- Study on the Scheme of Tianjin Area E-commerce Platform Construction -- Research on Chinese Hydrological Data Quality Management -- Using IoT Technologies to Resolve the Food Safety Problem – An Analysis Based on Chinese Food Standards -- Towards Better Cross-Cloud Data Integration: Using P2P and ETL Together -- Design of Intelligent Maintenance Decision-Making System for Fixed Equipment in Petrochemical Plants -- Dimensional Modeling for Landslide Monitoring Data Warehouse -- A New Fuzzy Risk Evaluation Method for Uncertain Network Public Sentiment Emergency -- Service Lifecycle Management in Distributed JBI Environment -- Graded BDI Models for Agent Architectures Based on _Lukasiewicz Logic and Propositional Dynamic Logic -- Energy Model of SARA and Its Performance Analysis -- Data Profiling for Semantic Web Data -- Checking and Handling Inconsistency of DBpedia -- Conceptual Representing of Documents and Query Expansion Based on Ontology -- Robust Web Data Extraction: A Novel Approach Based on Minimum Cost Script Edit Model -- Rule-Based Text Mining of Chinese Herbal Medicines with Patterns in Traditional Chinese Medicine for Chronic Obstructive Pulmonary Disease -- Fault Forecast of Electronic Equipment Based on ε – SVR -- Analysis and Design of Internet Monitoring System on Public Opinion Based on Cloud Computing and NLP -- Using Similes to Extract Basic Sentiments across Languages -- Automatic Summarization for Chinese Text Using Affinity Propagation Clustering and Latent Semantic Analysis -- A Webpage Deletion Algorithm Based on Hierarchical Filtering -- Research in Keyword Extraction -- Tuple Refinement Method Based on Relationship Keyword Extension -- An Agent Based Intelligent Meta Search Engine -- A Novel Image Annotation Feedback Model Based on Internet-Search -- The Design and Application of an Ancient Porcelain Online Identification Analysis System -- Web Crawler in In-Site Search -- A Novel Shark-Search Algorithm for Theme Crawler -- A Framework of Online Proxy-Based Web Prefetching -- Mapping the Intellectual Structure by Co-word: A Case of International Management Science -- Study on Multi-sensor Information Fusion Technology in the Dynamic Monitoring of Coal Mine Roof -- Detecting Hot Topics in Chinese Microblog Streams Based on Frequent Patterns Mining -- KACTL: Knowware Based Automated Construction of a Treelike Library from Web Documents -- Associating Labels and Elements of Deep Web Query Interface Based on DOM -- Design and Implementation of the Online Shopping System -- System Development of Residence Property Management Based on WEB -- An Intelligent Metadata Extraction Approach Based on Programming by Demonstration -- OF-NEDL: An OpenFlow Networking Experiment Description Language Based on XML -- Structural Similarity Evaluation of XML Documents Based on Basic Statistics -- An XML Data Query Method Based on Structure-Encoded. |
Record Nr. | UNISA-996465481603316 |
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2012 | ||
![]() | ||
Lo trovi qui: Univ. di Salerno | ||
|