Record Nr.

UNINA9910483333003321

Titolo

Advances in Multimedia Information Processing -- PCM 2015 : 16th Pacific-Rim Conference on Multimedia, Gwangju, South Korea, September 16-18, 2015, Proceedings, Part I / / edited by Yo-Sung Ho, Jitao Sang, Yong Man Ro, Junmo Kim, Fei Wu

Pubbl/distr/stampa


Cham : , : Springer International Publishing : , : Imprint : Springer, , 2015

ISBN

3-319-24075-7

Edizione

[1st ed. 2015.]

Descrizione fisica

1 online resource (XXIV, 735 p. 358 illus. in color.)

Collana

Information Systems and Applications, incl. Internet/Web, and HCI, , 2946-1642 ; ; 9314

Disciplina

006.7

Soggetti

Multimedia systems

Application software

Computer vision

Pattern recognition systems

Data mining

User interfaces (Computer systems)

Human-computer interaction

Multimedia Information Systems

Computer and Information Systems Applications

Computer Vision

Automated Pattern Recognition

Data Mining and Knowledge Discovery

User Interfaces and Human Computer Interaction

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Note generali

Bibliographic Level Mode of Issuance: Monograph

Nota di contenuto

Intro -- Preface -- Organization -- Contents - Part I -- Contents - Part II -- Image and Audio Processing -- Internal Generative Mechanism Based Otsu Multilevel Thresholding Segmentation for Medical Brain Images -- Abstract -- 1 Introduction -- 2 Otsu Thresholding -- 3 The Proposed Segmentation Algorithm -- 3.1 Segmentation Scheme -- 3.2 Internal Generative Mechanism -- 3.3 Regrouping the Controversial

Pixels -- 4 Experimental Results and Analysis -- 4.1 Experimental Settings -- 4.2 Experimental Results -- 5 Conclusion -- Acknowledgements -- References -- Efficient Face Image Deblurring via Robust Face Salient Landmark Detection -- 1 Introduction -- 2 The Proposed Method -- 2.1 Motivation -- 2.2 Robust Face Landmark Detector Training -- 2.3 Salient Contour Detection -- 2.4 Blind Image Deblurring -- 3 Experimental Results -- 3.1 Experiments on Synthesised Dataset and Real Images -- 3.2 Computation Cost Comparison -- 3.3 Adaptation to Complex Face Poses -- 3.4 Rolling Guidance Face Deblurring -- 4 Conclusions -- References -- Non-uniform Deblur Using Gyro Sensor and Long/Short Exposure Image Pair -- Abstract -- 1 Introduction -- 2 The Proposed Algorithm -- 2.1 Non-uniform Blur Model -- 2.2 IMU Sensor and Camera Motion -- 2.3 The Initial Kernel Estimation Using Gyro Data -- 2.4 Kernel Refinement -- 2.5 Deconvolution -- 3 Experimental Results -- 4 Conclusion -- References -- Object Searching with Combination of Template Matching -- Abstract -- 1 Introduction -- 2 Conventional Methods -- 3 Proposed Method -- 3.1 Partition Search Area -- 3.2 Object Identification -- 3.3 Adaptive Combination Template Matching -- 4 Experiment Results -- 5 Conclusion -- Acknowledgement -- References -- Multimedia Content Analysis -- Two-Step Greedy Subspace Clustering -- 1 Introduction -- 1.1 Related Work on Subspace Clustering -- 1.2 Paper Contributions.

2 Two-Step Greedy Subspace Clustering -- 2.1 First Step: Initial Subspace Construction -- 2.2 Second Step: Greedy Subspace Clustering -- 3 Experiments -- 3.1 Motion Segmentation -- 3.2 Face Clustering -- 4 Conclusion -- References -- Iterative Collection Annotation for Sketch Recognition -- 1 Introduction -- 2 Overview of Proposed Method -- 3 Sketch Representation and Similarity Measuring Model -- 4 Semi-Supervised Clustering -- 5 Supervision Information Establishment -- 6 Experiments and Results -- 7 Conclusion -- References -- Supervised Dictionary Learning Based on Relationship Between Edges and Levels -- 1 Introduction -- 2 Our Approach -- 2.1 Classical Dictionary Learning -- 2.2 Our Supervised Dictionary Learning -- 2.3 Optimization Algorithm -- 3 Experimental -- 3.1 Data Set -- 3.2 Comparison Methods and Evaluation Criteria -- 3.3 Experimental Results and Analysis -- 4 Conclusions -- References -- Adaptive Margin Nearest Neighbor for Person Re-Identification -- 1 Introduction -- 2 Large Margin Nearest Neighbor -- 3 Adaptive Margin Nearest Neighbor -- 4 Experiment -- 4.1 Experiment Setting -- 4.2 Parameter Selection -- 4.3 Evaluation on VIPeR and CUHK -- 5 Conclusion -- References -- Compressed-Domain Based Camera Motion Estimation for Realtime Action Recognition -- 1 Introduction -- 2 Proposed Method -- 2.1 Camera Model -- 2.2 Estimation of T -- 2.3 Estimation of a -- 2.4 Camera Motion Compensation -- 2.5 Feature Descriptor Extraction -- 3 Experimental Results -- 3.1 GME Evaluation -- 3.2 Feature Descriptor Evaluation -- 4 Conclusion -- References -- Image and Audio Processing -- On the Security of Image Manipulation Forensics -- Abstract -- 1 Introduction -- 2 Understanding and Evaluation of Forensics Security -- 2.1 Image Manipulation Forensics Model -- 2.2 Security Evaluation and Attacks -- 3 A Case Study with Resampling Forging Attack.

4 Experimental Results -- 5 Conclusion -- Acknowledgements -- References -- A Sparse Representation-Based Label Pruning for Image Inpainting Using Global Optimization -- Abstract -- 1 Introduction -- 2 Proposed Label Pruning -- 2.1 Dictionary Construction for Two Target Region Cases -- 2.2 Active Label Selection by Label Pruning -- 3 Experimental Results -- 4 Conclusion -- References -- Interactive

RGB-D Image Segmentation Using Hierarchical Graph Cut and Geodesic Distance -- 1 Introduction -- 2 Related Work -- 3 Interactive RGB-D Image Segmentation -- 3.1 Preliminary of Hierarchical Graph Cut -- 3.2 Scale Space Construction -- 3.3 Integration of Color Cue and Depth Cue -- 3.4 Upscaling Boundary Refinement -- 4 Experiments -- 4.1 Datasets and Experimental Settings -- 4.2 Segmentation Accuracy Evaluation -- 4.3 Running Time Evaluation -- 5 Conclusions -- References -- Face Alignment with Two-Layer Shape Regression -- Abstract -- 1 Introduction -- 2 Overview -- 3 Main Work -- 3.1 Key Feature Points of a Component -- 3.2 Two-Layer Geometric Constraint -- 3.3 Sub-shape Selection -- 4 Experimental Results -- 4.1 Comparison with Previous Works -- 5 Conclusion and Future Work -- Acknowledgments -- References -- 3D Panning Based Sound Field Enhancement Method for Ambisonics -- 1 Introduction -- 2 Ambisonics Method -- 3 3D Panning Method with Sound Pressure Constraint at Two Ears -- 4 New Signal Distribution Method -- 4.1 Extension of Loudspeakers Structure -- 4.2 Calculation of the Input Signal -- 4.3 Signal Redistribution -- 4.4 Final Signals -- 5 Experiments -- 5.1 Objective Tests -- 5.2 Subjective Tests -- 6 Conclusion -- References -- Multimedia Applications and Services -- Multi-target Tracking via Max-Entropy Target Selection and Heterogeneous Camera Fusion -- 1 Introduction -- 2 Our Method -- 2.1 Online Multi-target Tracking.

2.2 Active Camera Scheduling -- 2.3 Static and Active Camera Tracklet Association -- 2.4 Final Trajectory Generation -- 3 Experiments -- 3.1 Experiment Setting -- 3.2 Results -- 4 Conclusion -- References -- Adaptive Multiple Appearances Model Framework for Long-Term Robust Tracking -- 1 Introduction -- 2 Related Works -- 3 The Framework of Adaptive Multiple Appearances Model Tracking -- 3.1 Dirichlet Process Mixture Model -- 3.2 Model Inference -- 3.3 AMAM Tracking -- 4 Experiments -- 4.1 The AMAM Modeling -- 4.2 Tracking System -- 5 Conclusion -- References -- On-line Sample Generation for In-air Written Chinese Character Recognition Based on Leap Motion Cont ... -- Abstract -- 1 Introduction -- 2 Writing Trajectory Capturing -- 3 Proposed Method -- 3.1 Off-line Sample Generation -- 3.2 On-line Sample Generation -- 4 Experimental Results -- 5 Conclusion -- References -- Progressive Image Segmentation Using Online Learning -- 1 Introduction -- 2 Overview of Progressive Segmentation Method -- 3 Multi-level Image Representation -- 4 Online Segmentation -- 5 Experimental Result -- 6 Conclusion -- References -- A Study of Interactive Digital Multimedia Applications -- Abstract -- 1 Background -- 2 Unlimited Channel for Communication -- 3 Conclusion -- Acknowledgments -- References -- Video Coding and Processing -- Particle Filter with Ball Size Adaptive Tracking Window and Ball Feature Likelihood Model for Ball's 3D Position Tracking in Volleyball Analysis -- Abstract -- 1 Introduction -- 2 Proposal -- 2.1 Ball Size Adaptive Tracking Window -- 2.2 Volleyball Feature Likelihood Model -- 2.3 Anti-occlusion Likelihood Measurement Method -- 3 Experiment -- 3.1 Tracking Example and Evaluation Method -- 3.2 Result and Comparison Analysis -- 4 Conclusion -- Acknowledgment -- References.

Block-Based Global and Multiple-Reference Scheme for Surveillance Video Coding -- Abstract -- 1 Introduction -- 2 Analysis -- 3 The Proposed Scheme -- 3.1 Block-Based Reference Scheme -- 3.2 Multiple-Reference Scheme -- 3.3 Global Reference Scheme -- 3.4 Costs -- 4 Experimental Results -- 5 Conclusion -- Acknowledgements -- References -- Global Object Representation of Scene Surveillance Video Based on Model and Feature Parameters -- Abstract -- 1

Introduction -- 2 Global Coding Scheme of Scene Surveillance Video -- 2.1 The Generation Mechanism and Features of Global Redundancy -- 2.2 Scene Surveillance Video Global Coding Scheme -- 3 Global Object Representation Based on Model and Feature Parameters -- 3.1 Model and Shape Representation -- 3.2 Location and Pose Representation -- 3.3 Texture Parameters Representation -- 3.4 Illumination Parameters Representation -- 4 Experiments and Results -- 4.1 Experiment 1 -- 4.2 Experiment 2 -- 5 Conclusions -- References -- A Sparse Error Compensation Based Incremental Principal Component Analysis Method for Foreground Detection -- 1 Introduction -- 2 Foreground Detection via Sparse Error Compensation Based Incremental PCA -- 2.1 The Proposed Subspace Based Foreground Detection Model -- 2.2 Two-Step Optimization Algorithm -- 3 Experiments -- 4 Conclusion -- References -- Multimedia Representation Learning -- Convolutional Neural Networks Features: Principal Pyramidal Convolution -- Abstract -- 1 Introduction -- 2 Principal Pyramidal Convolution -- 3 Experiment -- 3.1 Datasets -- 3.2 Comparisons on Different Networks -- 3.3 Comparisons on Different Dimensions -- 4 Conclusion -- References -- Gaze Shifting Kernel: Engineering Perceptually- Aware Features for Scene Categorization -- 1 Introduction -- 2 Related Work -- 3 The Proposed Gaze Shifting Kernel -- 3.1 Low-Level and High-Level Descriptions of Graphlets.

3.2 Sparsity-Constrained Graphlets Ranking.

Sommario/riassunto

The two-volume proceedings LNCS 9314 and 9315, constitute the proceedings of the 16th Pacific-Rim Conference on Multimedia, PCM 2015, held in Gwangju, South Korea, in September 2015. The total of 138 full and 32 short papers presented in these proceedings was carefully reviewed and selected from 224 submissions. The papers were organized in topical sections named: image and audio processing; multimedia content analysis; multimedia applications and services; video coding and processing; multimedia representation learning; visual understanding and recognition on big data; coding and reconstruction of multimedia data with spatial-temporal information; 3D image/video processing and applications; video/image quality assessment and processing; social media computing; human action recognition in social robotics and video surveillance; recent advances in image/video processing; new media representation and transmission technologies for emerging UHD services. .