06173nam 2200817 450 991045337690332120200903223051.0981-4368-71-7(CKB)2550000001191454(EBL)1611949(SSID)ssj0000982824(PQKBManifestationID)12388972(PQKBTitleCode)TC0000982824(PQKBWorkID)10987314(PQKB)11305523(MiAaPQ)EBC1611949(WSP)00008280(Au-PeEL)EBL1611949(CaPaEBR)ebr10832738(CaONFJC)MIL570877(OCoLC)869905558(EXLCZ)99255000000119145420130913h20142014 uy| 0engur|n|---|||||txtccrAdvances in digital document processing and retrieval /editors, Bidyut Baran Chaudhuri (Indian Statistical Institute, India) & Swapan Kumar Parui (Indian Statistical Institute, India)New Jersey :World Scientific,[2014]©20141 online resource (334 p.)Statistical science and interdisciplinary research,1793-6195 ;volume 13Platinum jubilee seriesDescription based upon print version of record.981-4368-70-9 1-306-39626-3 Includes bibliographical references and index.Foreword; Preface; Contents; 1. Document Image Analysis using Markovian Models: Application to Historical Documents; 1.1. Introduction; 1.2. Hidden Markov Random Field Models; 1.2.1. Theoretical foundations; 1.2.1.1. Simulated annealing; 1.2.1.2. Iterated Conditional Modes (ICM); 1.2.1.3. Highest Confidence First (HCF) algorithm; 1.2.1.4. 2D Dynamic Programming; 1.2.2. Application of MRF labelling to handwritten document segmentation; 1.2.2.1. Probability densities; 1.2.2.2. Clique potential functions; 1.2.2.3. Observations; 1.2.2.4. Decoding strategy; 1.2.3. Results; 1.2.3.1. Zone labelling1.2.3.2. Text line labelling1.2.3.3. Conclusion; 1.3. Conditional Random Field Models; 1.3.1. Proposed model; 1.3.1.1. Feature functions; 1.3.1.2. Model inference; 1.3.1.3. Parameter learning; 1.3.2. A two level CRF model; 1.3.2.1. Observation features; 1.3.2.2. Label features; 1.3.2.3. Learning; 1.3.3. Integrating more contextual information; 1.3.3.1. Global feature function; 1.3.3.2. Combination of the information sources; 1.3.3.3. Linear combination of the information sources (impl.2); 1.3.3.4. Combination of the information sources using an MLP (impl.3); 1.3.4. Experiments and results1.4. Conclusions and OutlookAcknowledgments; References; 2. Information Just-in-Time: Going Beyond the Myth of Paperlessness; 2.1. Introduction; 2.2. Information Just-in-Time; 2.2.1. Personal Information Environment; 2.2.2. Hot/Warm/Cold Documents; 2.2.3. Proposed Approach; 2.3. Digital Pen Solution; 2.3.1. Anoto Functionality; 2.3.2. Data Entry Applications; 2.4. iJIT Collaboration Platform; 2.4.1. On-Demand Printing; 2.4.2. Hybrid Document Management System; 2.4.3. Research Notebook Application - iJITNote; 2.4.4. Future Directions; 2.5. Conclusions; Acknowledgments; References3. The Role of Document Image Analysis in Trustworthy Elections3.1. Introduction; 3.2. History; 3.3. Problems with Current Voting Technologies; 3.4. Experimental Approaches to Reliable Processing of Voting Records; 3.4.1. Statistical distribution of mark sense errors; 3.4.2. Unbiased context-free visual auditing based on ballot images; 3.4.3. Homogenous class display; 3.4.4. Unique identification of ballots; 3.4.5. Error characteristics of DRE with VVPAT; 3.4.6. Development of testing procedures for voting systems; 3.4.7. Affordances for voters with disabilities; 3.5. Some Related Efforts3.6. Concluding RemarksReferences; 4. Information Retrieval from Document Image Databases; 4.1. Introduction; 4.2. Related Work; 4.3. Word Shape Coding; 4.3.1. Word Shape Coding by Character Stroke Categorization; 4.3.2. Word Shape Coding by Character Boundary Extrema; 4.3.3. Word Shape Coding by Character Holes and Reservoirs; 4.4. Document Image Retrieval; 4.4.1. Document Vector Construction; 4.4.2. Document Similarity Measurement; 4.5. Discussions; 4.5.1. Coding Ambiguity; 4.5.2. Coding Robustness; 4.5.3. Document Similarity Measurements; 4.5.4. Coding Scheme Selection; 4.6. ConclusionReferencesFrom the participation of researchers in most important international conferences in the field, it is noted that activities in automatic document processing have been continuously growing. This book is an edited volume in Digital Document Processing where the chapters are written by several internationally renowned researchers in the domain. It will be useful for both students and researchers working on various aspects of document image analysis and recognition problems. It contains chapters on topics that are not covered by any textbook, but are more futuristic like ""Going beyond the Myth ofStatistical science and interdisciplinary research ;v. 13.Information storage and retrieval systemsDocumentationData processingDigital preservationDocument imaging systemsElectronic publishingMultimedia systemsElectronic recordsElectronic books.Information storage and retrieval systems.DocumentationData processing.Digital preservation.Document imaging systems.Electronic publishing.Multimedia systems.Electronic records.025.04Chaudhuri B. B(Bidyut Baran)948327Parui Swapan Kumar948328MiAaPQMiAaPQMiAaPQBOOK9910453376903321Advances in digital document processing and retrieval2143405UNINA