Pubbl/distr/stampa	Cham, Switzerland : , : Springer, , [2022]
Descrizione fisica	1 online resource (466 pages)
Disciplina	929.605
Collana	Lecture Notes in Computer Science
Soggetto topico	Computer vision
ISBN	3-031-20716-5
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Intro -- Preface -- Organization -- Keynote Talks -- Towards Scaling Up GANs -- Sensible Machine Learning for Geometry -- Designing Augmented Reality for the Future of Work -- The Future of Visual Computing via Foundation Models (Banquet Keynote Talk) -- 3D Reconstruction: Leveraging Synthetic Data for Lightweight Reconstruction -- Human-AI Interaction in Visual Analytics: Designing for the "Two Black Boxes" Problem -- Contents - Part II -- Contents - Part I -- ST: Neuro-inspired Artificial Intelligence -- Brain Shape Correspondence Analysis Using Functional Maps -- 1 Introduction -- 2 Materials and Methods -- 2.1 Database -- 2.2 Methodology -- 3 Results -- 3.1 First Experiment -- 3.2 Second Experiment -- 3.3 Third Experiment -- 4 Conclusions -- References -- Biomimetic Oculomotor Control with Spiking Neural Networks -- 1 Introduction -- 2 Related Work -- 3 Eye Model and Neuromuscular Oculomotor Controller -- 4 Spiking Neurons -- 4.1 Encoding the Input Signals -- 4.2 Outputs -- 5 The SLiNet Model -- 5.1 Architecture -- 5.2 Training -- 6 Experiments -- 6.1 Eye Movements -- 6.2 Comparison to Human Eye Movements -- 7 Conclusions -- References -- Border Ownership, Category Selectivity and Beyond -- 1 Introduction -- 2 Implementation -- 2.1 Border-Ownership Coding Method -- 2.2 Category-Selective Coding Method -- 2.3 TcNet -- 3 Results -- 3.1 Datasets -- 3.2 Statistic Evaluation Criteria -- 4 Discussion -- 4.1 T-Junctions and Other 'KEY' Points -- 4.2 Global Context Awareness -- 4.3 Early Object Representation, 'PRoto-Object' -- 4.4 Relation to Biological Vision Systems -- 5 Summary -- References -- Sparse Kernel Transfer Learning -- 1 Introduction -- 2 Background -- 2.1 Background in Convolutional Neural Networks -- 2.2 Background in Sparse Coding -- 3 Methodology -- 3.1 Dictionary Learning -- 3.2 Initialization Techniques -- 3.3 Datasets. 3.4 Kernel Transfer Learning -- 4 Experiments and Results -- 4.1 Comparison with Other Initialization Methods -- 4.2 Learning with Less Labels -- 4.3 Breast Cancer Detection -- 4.4 Intepretability and Complexity -- 5 Conclusion -- References -- Applications -- Photobombing Removal Benchmarking -- 1 Introduction -- 2 Related Work -- 2.1 Traditional Methods -- 2.2 Deep Learning-based Methods -- 3 Photobombing Removal Benchmark -- 3.1 Benchmarking Dataset -- 3.2 Benchmarking Methods -- 4 Experiments -- 4.1 Performance Metrics -- 4.2 Experimental Results -- 5 Conclusion and Future Works -- References -- Automatic Detection and Recognition of Products and Planogram Conformity Analysis in Real Time on Store Shelves -- 1 Introduction -- 1.1 Features for Detection of Retails Products -- 1.2 Detection of Single Product -- 2 Clustering by Products Famillies -- 2.1 Multi-object Detection with ASIFT -- 2.2 Distance Normalisation -- 2.3 DBSCAN: Products Famillies -- 2.4 Shelf Planogram Conformity Rate -- 3 Experiments -- 3.1 Database -- 3.2 Evaluation Metrics -- 4 Conclusion -- References -- Enhancing Privacy in Computer Vision Applications: An Emotion Preserving Approach to Obfuscate Faces -- 1 Introduction -- 2 Related Work -- 3 Approach -- 3.1 Face Detection -- 3.2 Face Selection -- 3.3 Face Reconstruction -- 3.4 Color Adaptation -- 3.5 Cloning -- 4 Validation -- 4.1 Experiment -- 4.2 Results -- 5 Conclusion and Future Work -- References -- House Price Prediction via Visual Cues and Estate Attributes -- 1 Introduction -- 2 Related Work -- 3 Proposed Work -- 3.1 Data Collection -- 3.2 Computational Model -- 4 Experiments -- 4.1 Evaluation Metrics -- 4.2 Experimental Results -- 4.3 Ablation Studies -- 5 Conclusion and Future Works -- References -- DRB-Net: Dilated Residual Block Network for Infrared Image Restoration -- 1 Introduction -- 2 Related Work. 2.1 Non-learning Denoising Methods -- 2.2 Discriminative Learning Denoising Methods -- 2.3 Deep Learning for IR Imaging -- 3 Proposed Architecture -- 3.1 Why Dilated Convolution? -- 3.2 Residual Blocks -- 3.3 Architecture and Compared Methods -- 4 Dataset -- 4.1 Sample Preparation and Image Acquisition -- 4.2 Dataset Creation -- 4.3 Implementation -- 5 Experiments -- 5.1 DRB-Net Specification -- 5.2 Denoising of Synthetic Noisy Data -- 5.3 Generalization and Robustness Test -- 6 Conclusion and Future Work -- References -- Segmentation and Tracking -- Saliency Can Be All You Need in Contrastive Self-supervised Learning -- 1 Introduction -- 2 Motivation and Background -- 2.1 Related Work -- 2.2 Concrete Background -- 3 Implementation, Setup and Results -- 3.1 Setup and Datasets -- 3.2 Preliminary: Running SGD on NORCE-PV and MultiRes-PV Datasets -- 3.3 An Efficient Implementation -- 3.4 Using SGD as an Augmentation Policy in Contrastive SSL Algorithms -- 4 Discussion -- 5 Conclusions -- References -- GCEENet: A Global Context Enhancement and Exploitation for Medical Image Segmentation -- 1 Introduction -- 2 Related Work -- 2.1 Convolutional Neural Networks for Semantic Segmentation -- 2.2 Contextual Information Modeling -- 3 Proposed Architecture -- 3.1 Overview -- 3.2 Global Context Encoder Module -- 3.3 Local Distribution -- 3.4 Aggregator Module -- 3.5 Loss Function -- 4 Experiments and Discussion -- 4.1 Benchmark Datasets -- 4.2 Experiment Settings -- 5 Results and Discussion -- 5.1 Ablation Study -- 5.2 Comparison to Baseline Models -- 6 Conclusion -- References -- V2F: Real Time Video Segmentation with Apache Flink -- 1 Introduction -- 2 Related Work -- 3 Video2Flink Architecture -- 3.1 V2F Operators -- 4 Experiments -- 5 Conclusions and Future Work -- References -- Joint Discriminative and Metric Embedding Learning for Person Re-identification. 1 Introduction -- 2 Related Work -- 3 Proposed Approach -- 3.1 Classification Losses -- 3.2 Metric Learning Loss -- 3.3 Joint Classification and Metric Loss -- 3.4 Network Architecture -- 4 Experiments -- 4.1 Implementation Details -- 4.2 Comparison with State-of-the-Art Methods -- 4.3 Ablation Study -- 5 Conclusions -- References -- Transformer Networks for Future Person Localization in First-Person Videos -- 1 Introduction -- 2 Related Work -- 3 Proposed Method -- 3.1 Problem Overview -- 3.2 Input Overview -- 3.3 Implementation Details -- 4 Experiments -- 4.1 Evaluation Metrics and Baselines -- 4.2 Quantitative Results -- 4.3 Additional Analysis -- 4.4 Inference Time Analysis -- 5 Conclusion -- References -- Virtual Reality -- VR-SFT: Reproducing Swinging Flashlight Test in Virtual Reality to Detect Relative Afferent Pupillary Defect -- 1 Introduction -- 2 Literature Review -- 3 Methodology -- 3.1 Swinging Flashlight Test in Virtual Reality -- 3.2 VR Implementation and Experimental Software -- 3.3 RAPD Scoring -- 4 Dataset -- 5 Data Analysis and Results -- 6 Discussion and Future Work -- References -- A Quantitative Analysis of Redirected Walking in Virtual Reality Using Saccadic Eye Movements -- 1 Introduction -- 2 Methodology -- 2.1 Simulation and Hardware -- 2.2 Simulation Tasks and Data Collection -- 2.3 Eye Tracking -- 2.4 Questionnaire -- 2.5 Demographics -- 3 Results -- 4 Conclusion and Future Work -- References -- A DirectX-Based DICOM Viewer for Multi-user Surgical Planning in Augmented Reality -- 1 Introduction -- 2 Related Work -- 2.1 Holographic DICOM Viewer Prototypes -- 2.2 Interaction with 3D Objects -- 3 System Design Overview -- 4 Direct3D-Based DICOM Viewer Implementation -- 4.1 Smartphones as User Input Devices -- 4.2 Functionalities -- 4.3 Marker-Based 3D Object Placement -- 5 User Interactions -- 5.1 Virtual 2D Plane Touch. 5.2 3D User Interaction -- 6 Experiments -- 7 Conclusions -- References -- Virtual-Reality Based Vestibular Ocular Motor Screening for Concussion Detection Using Machine-Learning -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 Naive Bayes -- 3.2 Decision Tree -- 3.3 Random Forest -- 3.4 Support Vector Classifer -- 3.5 AdaBoost -- 3.6 Gaussian Process Classifier -- 3.7 Logistic Regression -- 3.8 Perceptron -- 3.9 Isolation Forest -- 3.10 One Class SVM -- 4 Experimental Analysis -- 4.1 Data Collection Using Virtual-Reality Headset -- 4.2 Data Splitting for Training and Testing -- 4.3 Qualitative Evaluation -- 4.4 Quantitative Evaluation -- 5 Conclusion -- References -- Posters -- GUILD - A Generator for Usable Images in Large-Scale Datasets -- 1 Introduction -- 2 Related Work -- 2.1 Manual Collection of Datasets -- 2.2 Synthetic Generation of Datasets -- 3 Implementation -- 3.1 Approach -- 3.2 Object Models -- 3.3 Environments -- 3.4 Label Generation -- 4 Evaluation -- 4.1 Evaluation Design -- 4.2 Evaluation Datasets -- 4.3 Accuracy -- 4.4 Generalizability -- 4.5 Variety -- 5 Conclusion and Future Work -- References -- Distributional Semantics of Line Charts for Trend Classification -- 1 Introduction -- 2 Dataset -- 3 Related Work -- 3.1 Information Graphic Description Generation -- 3.2 Prototype Learning -- 3.3 Bag of Words for Computer Vision -- 3.4 Distributional Semantics -- 4 Architecture and Methodology -- 4.1 Forming the Vocabulary -- 4.2 Line Chart Embeddings -- 4.3 Classification -- 5 Implementation -- 6 Experiments and Results -- 6.1 Classification Task -- 6.2 Results -- 7 Discussion -- 8 Conclusion -- References -- Deep Learning Hyperparameter Optimization for Breast Mass Detection in Mammograms -- 1 Introduction -- 2 Background and Motivation -- 2.1 End-to-End Pipeline -- 2.2 Genetic Algorithm -- 2.3 Binary Tournament Selection. 2.4 Simulated Binary Crossover (SBX).
Record Nr.	UNISA-996503470203316

Edizione	[1st ed. 2022.]
Pubbl/distr/stampa	Cham : , : Springer International Publishing : , : Imprint : Springer, , 2022
Descrizione fisica	1 online resource (486 pages)
Disciplina	929.605 006.37
Collana	Lecture Notes in Computer Science
Soggetto topico	Image processing - Digital techniques Computer vision Artificial intelligence Computer engineering Computer networks Social sciences - Data processing Computer Imaging, Vision, Pattern Recognition and Graphics Artificial Intelligence Computer Engineering and Networks Computer Application in Social and Behavioral Sciences
ISBN	9783031207136 3031207130
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Deep Learning I -- Visualization -- Object Detection and Recognition -- Deep Learning II -- Video Analysis and Event Recognition -- Computer Graphics -- ST: Biomedical Imaging Techniques for Cancer Detection, Diagnosis and Management.
Record Nr.	UNINA-9910634049103321

Edizione	[1st ed. 2021.]
Pubbl/distr/stampa	Cham : , : Springer International Publishing : , : Imprint : Springer, , 2021
Descrizione fisica	1 online resource (635 pages)
Disciplina	006.6
Collana	Image Processing, Computer Vision, Pattern Recognition, and Graphics
Soggetto topico	Pattern recognition systems Computer vision Artificial intelligence Computer engineering Computer networks Automated Pattern Recognition Computer Vision Artificial Intelligence Computer Engineering and Networks
ISBN	3-030-90439-3
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Intro -- Preface -- Organization -- Keynote Talks -- Embodied Perception in-the-Wild -- Design Tools for Material Appearance -- Guidance-Enriched Visual Analytics: Challenges and Opportunities -- Learning and Accruing Knowledge over Time Using Modular Architectures -- Combining Brain-Computer Interfaces and Virtual Reality: Novel 3D Interactions and Promising Applications -- Direct Estimation of Appearance Models for Image Segmentation -- Contents - Part I -- Contents - Part II -- Deep Learning I -- Real-World Thermal Image Super-Resolution -- 1 Introduction -- 2 Related Work -- 2.1 RGB Image Super-Resolution -- 2.2 Thermal Image Super-Resolution -- 3 Dataset -- 4 Thermal RealSR -- 4.1 Realistic Degradation Using KernelGAN and Noise Injection -- 4.2 Super-Resolution Model -- 5 Experiments and Results -- 5.1 Evaluation Metrics -- 5.2 Comparison with the State of the Art -- 6 Conclusion -- References -- QR Code Style Transfer Method Based on Conditional Instance Regularization -- 1 Introduction -- 2 Related Works -- 2.1 QR Code Style Transfer System -- 2.2 Structure of the Style Transfer Network -- 3 Style Transfer Network Based on Conditional Instance Regularization -- 3.1 Conditional Instance Regularization -- 3.2 Residual Connected Module -- 3.3 Style Transfer Network Structure -- 3.4 Weighted Fusion Correction for Styled QR Codes -- 3.5 Artistic Style QR Code Dynamics -- 4 Experiment -- 4.1 Training -- 4.2 Single-Style Training -- 4.3 Multi-style Training -- 4.4 Comparison of Experiment Results -- 5 Conclusion -- References -- Multimodal Multi-tasking for Skin Lesion Classification Using Deep Neural Networks -- 1 Introduction -- 2 Methodology -- 2.1 Dataset -- 2.2 ABCD Rule Feature Extraction -- 2.3 Proposed Model -- 2.4 Class Balancing Techniques -- 3 Experiments -- 3.1 ABCD Rule for Multimodal Multi-tasking. 3.2 Role of Segmentation in Lesion Classification -- 4 Results and Discussion -- 5 Conclusion -- References -- DeepSolfège: Recognizing Solfège Hand Signs Using Convolutional Neural Networks -- 1 Introduction -- 2 Related Work -- 3 Dataset -- 3.1 Labels -- 3.2 Preprocessing -- 4 Method -- 4.1 CNN Architecture -- 4.2 Training -- 4.3 Ablation Study -- 5 Evaluation -- 5.1 Real World Application -- 6 Conclusion -- References -- Image Prior Transfer and Ensemble Architectures for Parkinson's Disease Detection -- 1 Introduction -- 2 Background and Related Works -- 3 Proposed Method -- 3.1 Dataset -- 3.2 Preprocessing -- 3.3 Models -- 3.4 Ensemble Architecture 1 -- 3.5 Ensemble Architecture 2 -- 4 Experimental Results -- 5 Occlusion Analysis to Locate Relevant Regions -- 5.1 Occlusion Analysis for Modified ResNet -- 5.2 Occlusion Analysis for Ensemble Architecture - Model 1 -- 6 Conclusion and Future Works -- References -- Computer Graphics I -- BRDF Measurement of Real Materials Using Handheld Cameras -- 1 Introduction -- 2 BRDF Measurement Using Handheld Cameras -- 2.1 Use of Bivariate BRDF -- 2.2 BRDF Sampling -- 2.3 Dense BRDF Estimation -- 3 BRDF Measurement of Real Materials -- 3.1 Experimental Setup -- 3.2 Estimation Results -- 3.3 Measurement Time -- 4 Conclusion -- References -- SORGATE: Extracting Geometry and Texture from Images of Solids of Revolution -- 1 Introduction -- 2 Previous Work -- 3 Design -- 4 Implementation -- 5 Evaluation -- 6 Conclusions and Future Work -- References -- Putting Table Cartograms into Practice -- 1 Introduction -- 2 Related Work -- 3 TCarto: An Optimization Based Algorithm -- 4 Potential Applications -- 5 Experimental Results -- 6 Limitations and Directions for Future Research -- References -- Perceived Naturalness of Interpolation Methods for Character Upper Body Animation -- 1 Introduction -- 2 Prior Work. 3 Methods -- 3.1 Studied Interpolations -- 3.2 Study Design -- 3.3 Experiment Design -- 3.4 Study Procedure -- 3.5 Data Collection and Analysis -- 4 Discussion -- 5 Conclusion and Future Work -- References -- Neuromuscular Control of the Face-Head-Neck Biomechanical Complex with Learning-Based Expression Transfer from Images and Videos -- 1 Introduction -- 2 Related Work -- 3 Musculoskeletal Model -- 3.1 Control -- 4 Expression Learning -- 4.1 Network Architecture -- 4.2 Training Data Generation -- 4.3 Network Training -- 4.4 Expression Transfer Pipeline -- 5 Experiments and Results -- 5.1 Facial Expression Datasets -- 5.2 Action Units and Muscle Activations -- 5.3 Head Orientation -- 5.4 Facial Expression Transfer -- 6 Conclusion and Future Work -- References -- Segmentation -- Synthesized Image Datasets: Towards an Annotation-Free Instance Segmentation Strategy -- 1 Introduction -- 2 Related Work -- 3 Proposed Approach -- 3.1 Segmentation Algorithm -- 3.2 Synthesized Image Generation -- 4 Experimental Results -- 4.1 Case Study -- 4.2 Free Annotation Results -- 5 Conclusions -- References -- Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction -- 1 Introduction -- 2 Related Work -- 3 Method -- 3.1 Road Border Detection -- 3.2 Road Structure Aware GNN -- 3.3 Element-Wise Attention -- 3.4 Joint Multi-task Training -- 4 Experimental Results -- 4.1 Evaluation Metrics -- 4.2 Results -- 5 Conclusions -- References -- Extraction and Merging of Stroke Structure of Chinese Characters -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 The Path Network -- 3.2 Pixel Selection for Path Net -- 3.3 Stroke Merging Algorithm -- 4 Experiment Results -- 5 Conclusion -- References -- Analysis of Multi-temporal Image Series for the Preventive Conservation of Varnished Wooden Surfaces -- 1 Introduction -- 2 Related Works. 2.1 Change and Damage Detection -- 2.2 A-Contrario Framework -- 3 Proposed Approach -- 3.1 Distance Matrix -- 3.2 Number of False Alarms -- 3.3 Maximal Clusters -- 4 Experiments -- 4.1 Dataset -- 4.2 Results -- 5 Conclusions -- References -- Visualization -- Evaluating User Interfaces for a Driver Guidance System to Support Stationary Wireless Charging of Electric Vehicles -- 1 Introduction -- 2 Related Work -- 3 System -- 3.1 Goals -- 3.2 Overview -- 3.3 Visualization Types -- 3.4 Information Output Setup -- 4 User Study -- 4.1 Test Environment -- 4.2 Task -- 4.3 Data Acquisition -- 4.4 Results -- 5 Discussion -- 5.1 Precision -- 5.2 User Experience -- 5.3 Time -- 5.4 Observations -- 6 Conclusions -- References -- MOBA Coach: Exploring and Analyzing Multiplayer Online Battle Arena Data -- 1 Introduction -- 2 Related Work -- 3 MOBA Coach Tool -- 4 Evaluation -- 4.1 Analysis of the Results -- 4.2 Discussion of the Results -- 5 Conclusion and Future Work -- References -- JobNet: 2D and 3D Visualization for Temporal and Structural Association in High-Performance Computing System -- 1 Introduction -- 2 Related Work -- 3 Design and Implementation -- 3.1 Terms and Definitions -- 3.2 Design Rationale -- 4 Case Study -- 4.1 JobNet2D -- 4.2 JobNet3D -- 5 Conclusion -- References -- Evaluation and Selection of Autoencoders for Expressive Dimensionality Reduction of Spatial Ensembles -- 1 Introduction -- 2 Related Work -- 3 Study Setup, Metrics and Selection -- 4 Evaluation -- 5 Discussion and Outlook -- References -- Data-Driven Estimation of Temporal-Sampling Errors in Unsteady Flows-8pt -- 1 Introduction -- 2 Related Work -- 3 Temporal Subsampling Errors in Pathlines -- 3.1 Temporal Subsampling of Simulated Unsteady Flows -- 3.2 Data-Driven Modeling of Errors -- 4 Validation and Results -- 4.1 2D Flow Past a Cylinder -- 4.2 3D Lifted Ethylene Jet Flame. 5 Conclusion -- References -- Applications -- ReGenMorph: Visibly Realistic GAN Generated Face Morphing Attacks by Attack Re-generation -- 1 Introduction -- 2 Related Works -- 3 Methodology -- 3.1 The ReGenMorph Face Morphing Pipeline -- 3.2 Creating ReGenMorph Morphing Attacks -- 4 Experimental Setup -- 4.1 Database -- 4.2 Vulnerability Analyses -- 4.3 Detectability Analyses -- 5 Results -- 5.1 ReGenMorph Image Appearance -- 5.2 Vulnerability of Face Recognition to ReGenMorph -- 5.3 Detectability of ReGenMorph -- 6 Conclusion -- References -- Car Pose Estimation Through Wheel Detection -- 1 Introduction -- 2 Related Work -- 3 Wheel Detection Techniques -- 4 Pose Estimation -- 5 Evaluation -- 5.1 Evaluation Datasets -- 5.2 Algorithm Parameterization -- 5.3 Evaluation Metrics -- 5.4 Wheel Detection Accuracy and Time Performance -- 5.5 Pose Estimation Accuracy -- 6 Conclusion and Future Work -- References -- Improving Automatic Quality Inspection in the Automotive Industry by Combining Simulated and Real Data -- 1 Introduction -- 2 Related Work -- 3 Methods -- 3.1 Baseline -- 3.2 CycleGAN with Semantic Consistency -- 3.3 Detection System's Improvements -- 4 Experiments -- 5 Results and Discussion -- 5.1 Domain Mapping -- 5.2 Detector Fine-Tuning -- 6 Conclusion and Future Work -- References -- PW-MAD: Pixel-Wise Supervision for Generalized Face Morphing Attack Detection-8pt -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 The Proposed PW-MAD -- 3.2 Baselines -- 4 Experimental Setup -- 4.1 The Dataset -- 4.2 Experiments and Evaluation Metrics -- 5 Results and Discussion -- 6 Conclusion -- References -- Integration of a BCI with a Hand Tracking System and a Motorized Robotic Arm to Improve Decoding of Brain Signals Related to Hand and Finger Movements -- 1 Introduction -- 2 System Assembly -- 3 Use-Cases -- 4 Conclusion -- References. Deep Learning II.
Record Nr.	UNINA-9910512174103321

Edizione	[1st ed. 2020.]
Pubbl/distr/stampa	Cham, Switzerland : , : Springer, , [2020]
Descrizione fisica	1 online resource (XXVIII, 777 p. 351 illus., 296 illus. in color.)
Disciplina	006.4
Collana	Image Processing, Computer Vision, Pattern Recognition, and Graphics
Soggetto topico	Artificial intelligence Image Processing and Computer Vision Pattern perception
ISBN	3-030-64559-2
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Object Recognition/Detection/Categorization -- Few-shot Image Recognition with Manifolds -- A scale-aware YOLO model for pedestrian detection -- Image categorization using Agglomerative clustering based smoothed Dirichlet mixtures -- SAT-CNN: A Small Neural Network for Object Recognition from Satellite Imagery -- Domain Adaptive Transfer Learning on Visual Attention Aware Data Augmentation for Fine-grained Visual Categorization -- 3D Reconstruction -- A Light-Weight Monocular Depth Estimation With Edge-Guided Occlusion Fading Reduction -- Iterative Closest Point with Minimal Free Space Constraints -- Minimal Free Space Constraints for Implicit Distance Bounds -- Medical Image Analysis -- Fetal Brain Segmentation using Convolutional Neural Networks with Fusion Strategies -- Fundus2Angio: A Novel Conditional GAN Architecture for Generating Fluorescein Angiography Images from Retinal Fundus Photography -- Multiscale Detection of Cancerous Tissue in High Resolution Slide Scans -- DeepTKAClassi er: Brand Classification of Total Knee Arthroplasty Implants using Explainable Deep Convolutional Neural Networks -- Multi-modal Image Fusion based on Weight Local Features and Novel Sum-Modified-Laplacian in Non-Subsampled Shearlet Transform Domain -- Robust Prostate Cancer Classification with Siamese Neural Networks -- Vision for Robotics -- Simple Camera-to-2D-LiDAR Calibration Method for General Use -- SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds -- Mobile Manipulator Robot Visual Servoing and Guidance for Dynamic Target Grasping -- Statistical Pattern Recognition -- Interpreting Galaxy Deblender GAN from the Discriminator's Perspective -- Variational Bayesian Sequence to Sequence Networks for Memory-Efficient Sign Language Translation -- A Gaussian Process Upsampling Model for Improvements in Optical Character Recognition -- Posters -- Video based fire detection using Xception and ConvLSTM -- Highway Traffic Classification for the Perception Level of Situation Awareness -- 3D-CNN for Facial Emotion Recognition in Videos -- Reducing Triangle Inequality Violations with Deep Learning and Its Application to Image Retrieval -- A Driver Guidance System to Support the Stationary Wireless Charging of Electric Vehicles -- An Efficient Tiny Feature Map Network For Real-Time Semantic Segmentation -- A Modified Syn2Real Network for Nighttime Rainy Image Restoration -- Unsupervised domain adaptation for person re-identification with few and unlabeled target data -- How Does Computer Animation Affect Our Perception Of Emotions in Video Summarization? -- Where's Wally: A Gigapixel Image Study for Face Recognition in Crowds -- Optical Flow Based Background Subtraction with a Moving Camera: Application to Autonomous Driving -- Deep Facial Expression Recognition with Occlusion Regularization -- Semantic Segmentation with Peripheral Vision -- Generator From Edges: Reconstruction of Facial Images -- CD2 : Combined Distances of Contrast Distributions for Image Quality Analysis -- Real-Time Person Tracking and Association on Doorbell Cameras -- MySnapFoodLog: Culturally Sensitive FoodPhoto-Logging App for Dietary BiculturalismStudies -- Hand Gesture Recognition Based on the Fusion of Visual and Touch Sensing Data -- Gastrointestinal Tract Anomaly Detection from Endoscopic Videos using Object Detection Approach -- A multimodal high level video segmentation for content targeted online advertising -- AI Playground: Unreal Engine-based Data Ablation Tool for Deep Learning -- Homework Helper: Providing Valuable Feedback on Math Mistakes -- Interface Design for HCI Classroom: From Learners' Perspective -- Pre-trained Convolutional Neural Network for the Diagnosis of Tuberculosis -- Near-Optimal Concentric Circles Layout -- Facial Expression Recognition and Ordinal Intensity Estimation: A Multilabel Learning Approach -- Prostate MRI Registration Using Siamese Metric Learning -- Unsupervised Anomaly Detection of the First Person in Gait from an Egocentric Camera -- Emotion Categorization from Video-frame Images using a Novel Sequential Voting Technique -- Systematic Optimization of Image Processing Pipelines Using GPUs -- A Hybrid Approach for Improved Image Similarity Using Semantic Segmentation -- Automated classification of Parkinson's Disease using Diffusion Tensor Imaging Data -- Nonlocal Adaptive Biharmonic Regularizer for Image Restoration -- A Robust Approach to Plagiarism Detection in Handwritten Documents -- Optical Coherence Tomography Latent Fingerprint Image Denoising -- CNN, Segmentation or Semantic Embeddings: Evaluating Scene Context for Trajectory Prediction -- Automatic Extraction of Joint Orientations in Rock Mass using PointNet and DBSCAN -- Feature Map Retargeting to Classify Biomedical Journal Figures -- Automatic 3D Object Detection from RGB-D data using PU-GAN -- Nodule Generation of Lung CT Images using a 3D Convolutional LSTM Network -- Conditional GAN for Prediction of Glaucoma Progression with Macular Optical Coherence Tomography.
Record Nr.	UNINA-9910447250403321