Share Catalogue

Pubblicazioni (Istanze)

Vai a Persone/Opere

Home / (Tutto) >> MaZhanyu

Info

Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

Info

Utilizzare questo link per rimuovere la selezione effettuata.

Export / Download (0)

Esporta in PDF
Esporta in Excel
Esporta in HTML
Esporta in MARC (binario)
Esporta in MARC XML
Esporta in MARC (testo)
Invia tramite E-Mail

Biblioteca

Univ. Federico II (13)
Univ. di Salerno (11)

Tutto
+

MARC Lista (tabellare)

Seleziona tutti

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part V / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Liu Qingshan

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. di Salerno

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part VII / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Liu Qingshan

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. di Salerno

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part X / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Liu Qingshan

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. di Salerno

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part XIII / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Liu Qingshan

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. di Salerno

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part VIII / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part III / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part I / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part II / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part XI / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Pattern Recognition and Computer Vision [[electronic resource] ] : 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13–15, 2023, Proceedings, Part XII / / edited by Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji

Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

1 2 3

Autore (Ente)

Autore (Convegno)

Opere

Pubbl/distr/stampa

Imprint: Springer (24)
Springer Nature Singapore (24)

Lingua di pubblicazione

Inglese (24)

Data

Data di pubblicazione

2024 (24)

Soggetto (Persona)

Soggetto (Ente)

Soggetto (Convegno)

Soggetto geografico

Soggetto topico

Computer Communication Networks (24)
Computer Imaging, Vision, Pattern Recognition and Graphics (24)
Computer System Implementation (24)
Computer networks (24)
Computer systems (24)

Altro...

Autore	Liu Qingshan
Edizione	[1st ed. 2024.]
Pubbl/distr/stampa	Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024
Descrizione fisica	1 online resource (542 pages)
Disciplina	621.39 004.6
Altri autori (Persone)	WangHanzi MaZhanyu ZhengWeishi ZhaHongbin ChenXilin WangLiang JiRongrong
Collana	Lecture Notes in Computer Science
Soggetto topico	Computer engineering Computer networks Image processing - Digital techniques Computer vision Computer systems Machine learning Computer Engineering and Networks Computer Imaging, Vision, Pattern Recognition and Graphics Computer Communication Networks Computer System Implementation Machine Learning
ISBN	981-9984-69-6
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Biometric Recognition -- Face Recognition and Pose Recognition -- Structural Pattern Recognition.
Record Nr.	UNISA-996587868903316

Autore	Liu Qingshan
Edizione	[1st ed. 2024.]
Pubbl/distr/stampa	Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024
Descrizione fisica	1 online resource (525 pages)
Disciplina	006
Altri autori (Persone)	WangHanzi MaZhanyu ZhengWeishi ZhaHongbin ChenXilin WangLiang JiRongrong
Collana	Lecture Notes in Computer Science
Soggetto topico	Image processing - Digital techniques Computer vision Artificial intelligence Application software Computer networks Computer systems Machine learning Computer Imaging, Vision, Pattern Recognition and Graphics Artificial Intelligence Computer and Information Systems Applications Computer Communication Networks Computer System Implementation Machine Learning
ISBN	981-9985-40-4
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Document Analysis and Recognition -- Feature Extraction and Feature Selection -- Multimedia Analysis and Reasoning.
Record Nr.	UNISA-996587869003316

Autore	Liu Qingshan
Edizione	[1st ed. 2024.]
Pubbl/distr/stampa	Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024
Descrizione fisica	1 online resource (509 pages)
Disciplina	006
Altri autori (Persone)	WangHanzi MaZhanyu ZhengWeishi ZhaHongbin ChenXilin WangLiang JiRongrong
Collana	Lecture Notes in Computer Science
Soggetto topico	Image processing - Digital techniques Computer vision Artificial intelligence Application software Computer networks Computer systems Machine learning Computer Imaging, Vision, Pattern Recognition and Graphics Artificial Intelligence Computer and Information Systems Applications Computer Communication Networks Computer System Implementation Machine Learning
ISBN	981-9985-49-8
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Intro -- Preface -- Organization -- Contents - Part X -- Neural Network and Deep Learning III -- Dual-Stream Context-Aware Neural Network for Survival Prediction from Whole Slide Images -- 1 Introduction -- 2 Method -- 3 Experiments and Results -- 4 Conclusion -- References -- A Multi-label Image Recognition Algorithm Based on Spatial and Semantic Correlation Interaction -- 1 Introduction -- 2 Related Work -- 2.1 Correlation-Agnostic Algorithms -- 2.2 Spatial Correlation Algorithms -- 2.3 Semantic Correlation Algorithms -- 3 Methodology -- 3.1 Definition of Multi-label Image Recognition -- 3.2 The Framework of SSCI -- 3.3 Loss Function -- 4 Experiments -- 4.1 Evaluation Metrics -- 4.2 Implementation Details -- 4.3 Comparison with Other Mainstream Algorithms -- 4.4 Evaluation of the SSCI Effectiveness -- 5 Conclusion -- References -- Hierarchical Spatial-Temporal Network for Skeleton-Based Temporal Action Segmentation -- 1 Introduction -- 2 Related Work -- 2.1 Temporal Action Segmentation -- 2.2 Skeleton-Based Action Recognition -- 3 Method -- 3.1 Network Architecture -- 3.2 Multi-Branch Transfer Fusion Module -- 3.3 Multi-Scale Temporal Convolution Module -- 3.4 Loss Function -- 4 Experiments -- 4.1 Setup -- 4.2 Effect of Hierarchical Model -- 4.3 Effect of Multiple Modalties -- 4.4 Effect of Multi-modal Fusion Methods -- 4.5 Effect of Multi-Scale Temporal Convolution -- 4.6 Comparision with State-of-the-Art -- 5 Conclusion -- References -- Multi-behavior Enhanced Graph Neural Networks for Social Recommendation -- 1 Introduction -- 2 Related Work -- 3 Preliminaries -- 4 Methodology -- 4.1 Embedding Layer -- 4.2 Propagation Layer -- 4.3 Multi-behavior Integration Layer -- 4.4 Prediction Layer -- 4.5 Model Training -- 5 Experiments -- 5.1 Experimental Settings -- 5.2 Performance Comparison (RQ1) -- 5.3 Ablation Study (RQ2). 5.4 Parameter Analysis (RQ3) -- 6 Conclusion and Future Work -- References -- A Complex-Valued Neural Network Based Robust Image Compression -- 1 Introduction -- 2 Related Works -- 2.1 Neural Image Compression -- 2.2 Adversarial Attack -- 2.3 Complex-Valued Convolutional Neural Networks -- 3 Proposed Method -- 3.1 Overall Framework -- 3.2 Nonlinear Transform -- 4 Experiment Results -- 4.1 Experiment Setup -- 4.2 Results and Comparison -- 4.3 Ablation Study -- 5 Conclusions -- References -- Binarizing Super-Resolution Neural Network Without Batch Normalization -- 1 Introduction -- 2 Related Work -- 3 Method -- 3.1 Batch Normalization in SR Models -- 3.2 Channel-Wise Asymmetric Binarizer for Activations -- 3.3 Smoothness-Controlled Estimator -- 4 Experimentation -- 4.1 Experiment Setup -- 4.2 Ablation Study -- 4.3 Visualization -- 5 Conclusion -- References -- Infrared and Visible Image Fusion via Test-Time Training -- 1 Introduction -- 2 Method -- 2.1 Overall Framework -- 2.2 Training and Testing -- 3 Experiments -- 3.1 Experiment Configuration -- 3.2 Performance Comparison on TNO -- 3.3 Performance Comparison on VIFB -- 3.4 Ablation Study -- 4 Conclusion -- References -- Graph-Based Dependency-Aware Non-Intrusive Load Monitoring -- 1 Introduction -- 2 Proposed Method -- 2.1 Problem Formulation -- 2.2 Co-occurrence Probability Graph -- 2.3 Graph Structure Learning -- 2.4 Graph Attention Neural Network -- 2.5 Encoder-Decoder Module -- 3 Numerical Studies and Discussions -- 3.1 Dataset and Experiment Setup -- 3.2 Metrics and Comparisons -- 4 Conclusion -- References -- Few-Shot Object Detection via Classify-Free RPN -- 1 Introduction -- 2 Related Work -- 2.1 Object Detection -- 2.2 Few-Shot Learning -- 2.3 Few-Shot Object Detection -- 3 Methodology -- 3.1 Problem Setting -- 3.2 Analysis of the Base Class Bias Issue in RPN -- 3.3 Classify-Free RPN. 4 Experiments -- 4.1 Experimental Setup -- 4.2 Comparison with the State-of-the-Art -- 4.3 Ablation Study -- 5 Conclusion -- References -- IPFR: Identity-Preserving Face Reenactment with Enhanced Domain Adversarial Training and Multi-level Identity Priors -- 1 Introduction -- 2 Methods -- 2.1 Target Motion Encoder and 3D Shape Encoder -- 2.2 3D Shape-Aware Warping Module -- 2.3 Identity-Aware Refining Module -- 2.4 Enhanced Domain Discriminator -- 2.5 Training -- 3 Experiment -- 3.1 Experimental Setup -- 3.2 Comparisons -- 3.3 Ablation Study -- 4 Limitation -- 5 Conclusion -- References -- L2MNet: Enhancing Continual Semantic Segmentation with Mask Matching -- 1 Introduction -- 2 Related Work -- 3 Method -- 3.1 Preliminaries and Revisiting -- 3.2 Proposed Learn-to-Match Framework -- 3.3 Training Loss -- 4 Experiments -- 4.1 Experimental Setting -- 4.2 Quantitative Evaluation -- 4.3 Ablation Study -- 5 Conclusion -- References -- Adaptive Channel Pruning for Trainability Protection -- 1 Introduction -- 2 Related Work -- 3 Method -- 3.1 Method Framework and Motivation -- 3.2 Channel Similarity Calculation and Trainability Preservation -- 3.3 Sparse Control and Optimization -- 4 Experiments -- 4.1 Experiments Settings and Evaluation Metrics -- 4.2 Results on Imagenet -- 4.3 Results on Cifar-10 -- 4.4 Results on YOLOX-s -- 4.5 Ablation -- 5 Conclusion -- References -- Exploiting Adaptive Crop and Deformable Convolution for Road Damage Detection -- 1 Introduction -- 2 Related Work -- 3 Methods -- 3.1 Adaptive Image Cropping Based on Vanishing Point Estimation -- 3.2 Feature Learning with Deformable Convolution -- 3.3 Diagonal Intersection over Union Loss Function -- 4 Experiment -- 4.1 Comparative Analysis of Different Datasets -- 4.2 Ablation Analysis -- 5 Conclusion -- References -- Cascaded-Scoring Tracklet Matching for Multi-object Tracking. 1 Introduction -- 2 Related Work -- 2.1 Tracking by Detection -- 2.2 Joint Detection and Tracking -- 3 Proposed Method -- 3.1 Cascaded-Scoring Tracklet Matching -- 3.2 Motion-Guided Based Target Aware -- 3.3 Appearance-Assisted Feature Warper -- 4 Experiments -- 4.1 Experimental Setup -- 4.2 Ablation Studies -- 4.3 Comparison with State-of-the-Art Methods -- 5 Conclusion -- References -- Boosting Generalization Performance in Person Re-identification -- 1 Introduction -- 2 Related Work -- 2.1 Generalizable Person ReID -- 2.2 Vision-Language Learning -- 3 Method -- 3.1 Review of CLIP -- 3.2 A Novel Cross-Modal Framework -- 3.3 Prompt Design Process -- 3.4 Loss Function -- 4 Experiments -- 4.1 Datasets and Evaluation Protocols -- 4.2 Implementation Details -- 4.3 Ablation Study -- 4.4 Comparison with State-of-the-Art Methods -- 4.5 Other Analysis -- 5 Conclusion -- References -- Self-guided Transformer for Video Super-Resolution -- 1 Introduction -- 2 Related Work -- 2.1 Video Super-Resolution -- 2.2 Vision Transformers -- 3 Our Method -- 3.1 Network Overview -- 3.2 Multi-headed Self-attention Module Based on Offset-Guided Window (OGW-MSA) -- 3.3 Feature Aggregation (FA) -- 4 Experiments -- 4.1 Datasets and Experimental Settings -- 4.2 Comparisons with State-of-the-Art Methods -- 4.3 Ablation Study -- 5 Conclusion -- References -- SAMP: Sub-task Aware Model Pruning with Layer-Wise Channel Balancing for Person Search -- 1 Introduction -- 2 Related Work -- 3 The Proposed Method -- 3.1 Framework Overview -- 3.2 Sub-task Aware Channel Importance Estimation -- 3.3 Layer-Wise Channel Balancing -- 3.4 Adaptive OIM Loss for Model Pruning and Finetuning -- 4 Experimental Results and Analysis -- 4.1 Dataset and Evaluation Metric -- 4.2 Implementation Details -- 4.3 Comparison with the State-of-the-Art Approaches -- 4.4 Ablation Study -- 5 Conclusion. References -- MKB: Multi-Kernel Bures Metric for Nighttime Aerial Tracking -- 1 Introduction -- 2 Methodology -- 2.1 Kernel Bures Metric -- 2.2 Multi-Kernel Bures Metric -- 2.3 Objective Loss -- 3 Experiments -- 3.1 Implementation Details -- 3.2 Evaluation Datasets -- 3.3 Comparison Results -- 3.4 Visualization -- 3.5 Ablation Study -- 4 Conclusion -- References -- Deep Arbitrary-Scale Unfolding Network for Color-Guided Depth Map Super-Resolution -- 1 Introduction -- 2 The Proposed Method -- 2.1 Problem Formulation -- 2.2 Algorithm Unfolding -- 2.3 Continuous Up-Sampling Fusion (CUSF) -- 2.4 Loss Function -- 3 Experimental Results -- 3.1 Implementation Details -- 3.2 The Quality Comparison of Different DSR Methods -- 3.3 Ablation Study -- 4 Conclusion -- References -- SSDD-Net: A Lightweight and Efficient Deep Learning Model for Steel Surface Defect Detection -- 1 Introduction -- 2 Methods -- 2.1 LMFE: Light Multiscale Feature Extraction Module -- 2.2 SEFF: Simple Effective Feature Fusion Network -- 2.3 SSDD-Net -- 3 Experiments and Analysis -- 3.1 Implementation Details -- 3.2 Evaluation Metrics -- 3.3 Dataset -- 3.4 Ablation Studies -- 3.5 Comparison with Other SOTA Methods -- 3.6 Comprehensive Performance of SSDD-Net -- 4 Conclusion -- References -- Effective Small Ship Detection with Enhanced-YOLOv7 -- 1 Introduction -- 2 Method -- 2.1 Small Object-Aware Feature Extraction Module (SOAFE) -- 2.2 Small Object-Friendly Scale-Insensitive Regression Scheme (SOFSIR) -- 2.3 Geometric Constraint-Based Non-Maximum Suppression Method (GCNMS) -- 3 Experiments -- 3.1 Experimental Settings -- 3.2 Quantitative Analysis -- 3.3 Ablation Studies -- 3.4 Qualitative Analysis -- 4 Conclusion -- References -- PiDiNeXt: An Efficient Edge Detector Based on Parallel Pixel Difference Networks -- 1 Introduction -- 2 Related Work. 2.1 The Development of Deep Learning Based Edge Detection.
Record Nr.	UNISA-996587868803316

Autore	Liu Qingshan
Edizione	[1st ed. 2024.]
Pubbl/distr/stampa	Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024
Descrizione fisica	1 online resource (524 pages)
Disciplina	006
Altri autori (Persone)	WangHanzi MaZhanyu ZhengWeishi ZhaHongbin ChenXilin WangLiang JiRongrong
Collana	Lecture Notes in Computer Science
Soggetto topico	Image processing - Digital techniques Computer vision Artificial intelligence Application software Computer networks Computer systems Machine learning Computer Imaging, Vision, Pattern Recognition and Graphics Artificial Intelligence Computer and Information Systems Applications Computer Communication Networks Computer System Implementation Machine Learning
ISBN	981-9985-58-7
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Intro -- Preface -- Organization -- Contents - Part XIII -- Medical Image Processing and Analysis -- Growth Simulation Network for Polyp Segmentation -- 1 Introduction -- 2 The Proposed Method -- 2.1 Gaussian Map and Body Map -- 2.2 Overall Architecture -- 2.3 Features Extraction and Fusion Module -- 2.4 Dynamic Attention Guidance Module -- 2.5 Dynamic Simulation Loss -- 3 Experiments -- 3.1 Settings -- 3.2 Comparisons with State-of-the-art -- 3.3 Ablation Study -- 4 Conclusion -- References -- Brain Diffuser: An End-to-End Brain Image to Brain Network Pipeline -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 Feature Extraction Module -- 3.2 Brain Diffuser -- 3.3 GCN Classifier -- 3.4 Loss Function -- 4 Experiments -- 4.1 Dataset and Preprocessing -- 4.2 Experiment Configuration -- 4.3 Results and Discussion -- 5 Conclusion -- References -- CCJ-SLC: A Skin Lesion Image Classification Method Based on Contrastive Clustering and Jigsaw Puzzle -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 Overview of Our Method -- 3.2 Contrastive Clustering -- 3.3 Jigsaw Puzzle -- 3.4 Loss Function -- 4 Experiments -- 4.1 Dataset and Evaluation Metrics -- 4.2 Baseline Performance -- 4.3 Ablation Experiment -- 4.4 Analysis -- 5 Conclusion -- References -- A Real-Time Network for Fast Breast Lesion Detection in Ultrasound Videos -- 1 Introduction -- 2 Method -- 2.1 Space Time Feature Aggregation (STA) Module -- 3 Experiments and Results -- 3.1 Comparisons with State-of-the-Arts -- 3.2 Ablation Study -- 3.3 Generalizability of Our Network -- 4 Conclusion -- References -- CBAV-Loss: Crossover and Branch Losses for Artery-Vein Segmentation in OCTA Images -- 1 Introduction -- 2 Methods -- 2.1 Overview -- 2.2 Crossover Loss and Branch Loss -- 2.3 Loss Function -- 3 Experiments -- 3.1 Data -- 3.2 Experimental Settings -- 3.3 Evaluation Metrics. 3.4 Ablation Study on CBAV-Loss -- 3.5 Influence of the Proposed Loss on Different Segmentation Networks -- 4 Conclusion -- References -- Leveraging Data Correlations for Skin Lesion Classification -- 1 Introduction -- 2 Related Work -- 2.1 Skin Lesion Classification -- 2.2 Correlation Mining -- 3 Methodology -- 3.1 Feature Enhancement Stage -- 3.2 Label Distribution Learning Stage -- 4 Experiments -- 4.1 Experiment Settings -- 4.2 Hyper Parameters Setting -- 4.3 Comparison with State-of-the-Art Methods -- 4.4 Ablation Studies -- 5 Conclusion -- References -- CheXNet: Combing Transformer and CNN for Thorax Disease Diagnosis from Chest X-ray Images -- 1 Introduction -- 2 Related Work -- 2.1 Label Dependency and Imbalance -- 2.2 Extensive Lesion Location -- 3 Approaches -- 3.1 Label Embedding and MSP Block -- 3.2 Inner Branch -- 3.3 C2T and T2C in IIM -- 4 Experiments -- 4.1 Dataset -- 4.2 Comparison to the State-of-the-Arts -- 4.3 Ablation Study -- 5 Conclusion -- References -- Cross Attention Multi Scale CNN-Transformer Hybrid Encoder Is General Medical Image Learner -- 1 Introduction -- 2 Methods -- 2.1 Dual Encoder -- 2.2 Shallow Fusion Module -- 2.3 Deep Fusion Module -- 2.4 Deep Supervision -- 3 Experiments and Results -- 3.1 Dateset -- 3.2 Implementation Details -- 3.3 Comparison with Other Methods -- 3.4 Ablation Studies -- 4 Conclusion -- References -- Weakly/Semi-supervised Left Ventricle Segmentation in 2D Echocardiography with Uncertain Region-Aware Contrastive Learning -- 1 Introduction -- 2 Methods -- 2.1 Multi-level Regularization of Semi-supervision -- 2.2 Uncertain Region-Aware Contrastive Learning -- 2.3 Differentiable Ejection Fraction Estimation of Weak Supervision -- 3 Datasets and Implementation Details -- 4 Results -- 5 Conclusion -- References. Spatial-Temporal Graph Convolutional Network for Insomnia Classification via Brain Functional Connectivity Imaging of rs-fMRI -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 Data Preprocessing -- 3.2 Data Augmentation -- 3.3 Construction of Spatio-Temporal Graph -- 3.4 Spatio-Temporal Graph Convolution (ST-GC) -- 3.5 ST-GCN Building -- 3.6 Edge Importance Learning -- 4 Experiments -- 4.1 Dataset -- 4.2 Evaluation Metrics -- 4.3 Analysis of Different Sliding Window Step Size -- 4.4 Comparison with Other Methods -- 5 Conclusion -- References -- Probability-Based Nuclei Detection and Critical-Region Guided Instance Segmentation -- 1 Introduction -- 2 Related Works on Nucleus Instance Segmentation -- 2.1 Bounding Box-Based Methods -- 2.2 Boundary-Based Methods -- 2.3 Critical Region-Based Methods -- 3 CGIS Method and CPF Feature -- 3.1 Critical-Region Guided Instance Segmentation -- 3.2 Central Probability Field -- 3.3 Nuclear Classification -- 4 Experimental Verification and Analysis -- 4.1 Datasets and Evaluation Metrics -- 4.2 Parameters and Implementation Details -- 4.3 Comparisons with Other Methods -- 4.4 Ablation Study -- 5 Conclusion -- References -- FlashViT: A Flash Vision Transformer with Large-Scale Token Merging for Congenital Heart Disease Detection -- 1 Introduction -- 2 Method -- 2.1 Overview -- 2.2 FlashViT Block -- 2.3 Large-Scale Token Merging Module -- 2.4 Architecture Variants -- 3 Experiments -- 3.1 CHD Dataset -- 3.2 Evaluations on CHD Dataset -- 3.3 Homogenous Pre-training Strategy -- 3.4 Ablation Study -- 4 Conclusion -- References -- Semi-supervised Retinal Vessel Segmentation Through Point Consistency -- 1 Introduction -- 2 Method -- 2.1 Segmentation Module -- 2.2 Point Consistency Module -- 2.3 Semi-supervised Training Through Point Consistency -- 3 Experiments -- 3.1 Datasets -- 3.2 Implementation Details. 3.3 Experimental Results -- 4 Conclusion -- References -- Knowledge Distillation of Attention and Residual U-Net: Transfer from Deep to Shallow Models for Medical Image Classification -- 1 Introduction -- 2 Methods -- 2.1 Res-Transformer Teacher Model Based on U-Net Structure -- 2.2 ResU-Net Student Model Incorporates Residual -- 2.3 Knowledge Distillation -- 3 Data and Experiments -- 3.1 Datasets -- 3.2 Experimental Settings -- 3.3 Results -- 4 Conclusion -- References -- Two-Stage Deep Learning Segmentation for Tiny Brain Regions -- 1 Introduction -- 2 Method -- 2.1 Overall Workflow -- 2.2 Two-Stage Segmentation Network -- 2.3 Contrast Loss Function -- 2.4 Attention Modules -- 3 Experiments -- 3.1 Dataset and Metrics -- 3.2 Comparisons Experiments -- 4 Conclusion -- References -- Encoder Activation Diffusion and Decoder Transformer Fusion Network for Medical Image Segmentation -- 1 Introduction -- 2 Methodology -- 2.1 Lightweight Convolution Modulation -- 2.2 Encoder Activation Diffusion -- 2.3 Multi-scale Decoding Fusion with Transformer -- 3 Experiments -- 3.1 Datasets -- 3.2 Implementation Details -- 3.3 Evaluation Results -- 3.4 Ablation Study -- 4 Conclusion -- References -- Liver Segmentation via Learning Cross-Modality Content-Aware Representation -- 1 Introduce -- 2 Methodology -- 2.1 Overview -- 2.2 Image-to-Image Network -- 2.3 Peer-to-Peer Network -- 3 Experiments -- 3.1 Dataset -- 3.2 Setting -- 3.3 Result -- 4 Conclusion -- References -- Semi-supervised Medical Image Segmentation Based on Multi-scale Knowledge Discovery and Multi-task Ensemble -- 1 Introduction -- 2 Related Works on SSMIS -- 3 Proposed Method -- 3.1 Multi-scale Knowledge Discovery -- 3.2 Multi-task Ensemble Strategy -- 4 Experiments and Analysis -- 4.1 Datasets and Implementation Details -- 4.2 Comparisons with State-of-the-Art Methods -- 4.3 Ablation Studies. 5 Conclusion -- References -- LATrans-Unet: Improving CNN-Transformer with Location Adaptive for Medical Image Segmentation -- 1 Introduction -- 2 Method -- 2.1 Encoder-Decoder Architecture -- 2.2 Location-Adaptive Attention -- 2.3 SimAM-Skip Structure -- 3 Experiments -- 3.1 Dataset -- 3.2 Implementation Details -- 3.3 Evaluation Results -- 3.4 Ablation Study -- 3.5 Discussion -- 4 Conclusions -- References -- Adversarial Keyword Extraction and Semantic-Spatial Feature Aggregation for Clinical Report Guided Thyroid Nodule Segmentation -- 1 Introduction -- 2 Method -- 2.1 Adversarial Keyword Extraction (AKE) -- 2.2 Semantic-Spatial Features Aggregation (SSFA) -- 2.3 The Full Objective Functions -- 3 Experiment -- 3.1 Comparison with the State-of-the-Arts -- 3.2 Ablation Study -- 3.3 Visualization of Generated Keyword Masks -- 4 Conclusion -- References -- A Multi-modality Driven Promptable Transformer for Automated Parapneumonic Effusion Staging -- 1 Introduction -- 2 Related Works -- 2.1 Disease Detection Methods with CT Images -- 2.2 Classification Methods with Time Sequence Videos -- 3 Method -- 3.1 CNN-Based Slice-Level Feature Extraction -- 3.2 Prompt Encoder -- 3.3 Cross-Modality Fusion Transformer -- 4 Experiments -- 4.1 Setting and Implementation -- 4.2 Results -- 4.3 Ablation Study -- 5 Conclusion -- References -- Assessing the Social Skills of Children with Autism Spectrum Disorder via Language-Image Pre-training Models -- 1 Introduction -- 2 Related Works -- 2.1 Behavior Signal Processing System -- 2.2 Language-Image Pre-training Models -- 3 Methodology -- 3.1 Paradigm Design -- 3.2 Language-Image Based Method -- 4 Experimental Results -- 4.1 Database -- 4.2 Results -- 4.3 Discussion -- 5 Conclusion -- References -- PPS: Semi-supervised 3D Biomedical Image Segmentation via Pyramid Pseudo-Labeling Supervision -- 1 Introduction -- 2 Method. 2.1 Overview.
Record Nr.	UNISA-996587868703316

Edizione	[1st ed. 2024.]
Pubbl/distr/stampa	Singapore : , : Springer Nature Singapore : , : Imprint : Springer, , 2024
Descrizione fisica	1 online resource (XIV, 513 p. 157 illus., 152 illus. in color.)
Disciplina	006
Collana	Lecture Notes in Computer Science
Soggetto topico	Image processing - Digital techniques Computer vision Artificial intelligence Application software Computer networks Computer systems Machine learning Computer Imaging, Vision, Pattern Recognition and Graphics Artificial Intelligence Computer and Information Systems Applications Computer Communication Networks Computer System Implementation Machine Learning
ISBN	981-9985-43-9
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Intro -- Preface -- Organization -- Contents - Part VIII -- Neural Network and Deep Learning I -- A Quantum-Based Attention Mechanism in Scene Text Detection -- 1 Introduction -- 2 Related Work -- 2.1 Attention Mechanism -- 2.2 Revisit Quantum-State-based Mapping -- 3 Approach -- 3.1 QSM-Based Channel Attention (QCA) Module and QSM-Based Spatial Attention (QSA) Module -- 3.2 Quantum-Based Convolutional Attention Module (QCAM) -- 3.3 Adaptive Channel Information Transfer Module (ACTM) -- 4 Experiments -- 4.1 Implementation Details -- 4.2 Performance Comparison -- 4.3 Ablation Study -- 5 Discussion and Conclusion -- References -- NCMatch: Semi-supervised Learning with Noisy Labels via Noisy Sample Filter and Contrastive Learning -- 1 Introduction -- 2 Related Work -- 2.1 Semi-supervised Learning -- 2.2 Self-supervised Contrastive Learning -- 2.3 Learning with Noisy Labels -- 3 Method -- 3.1 Preliminaries -- 3.2 Overall Framework -- 3.3 Noisy Sample Filter (NSF) -- 3.4 Semi-supervised Contrastive Learning (SSCL) -- 4 Experiments -- 4.1 Datasets -- 4.2 Experimental for SSL -- 4.3 Experimental for SSLNL -- 4.4 Ablation Study -- 5 Conclusion -- References -- Data-Free Low-Bit Quantization via Dynamic Multi-teacher Knowledge Distillation -- 1 Introduction -- 2 Related Work -- 3 Method -- 3.1 Preliminaries -- 3.2 More Insight on 8-Bit Quantized Models -- 3.3 Dynamic Multi-teacher Knowledge Distillation -- 4 Experiments -- 4.1 Experimental Setups -- 4.2 Comparison with Previous Data-Free Quantization Methods -- 4.3 Ablation Studies -- 5 Conclusion -- References -- LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation -- 1 Introduction -- 2 Related Works -- 3 Method -- 3.1 Architecture of LeViT-UNet -- 3.2 LeViT as Encoder -- 3.3 CNNs as Decoder -- 4 Experiments and Results -- 4.1 Dataset -- 4.2 Implementation Details. 4.3 Experiment Results on Synapse Dataset -- 4.4 Experiment Results on ACDC Dataset -- 5 Conclusion -- References -- DUFormer: Solving Power Line Detection Task in Aerial Images Using Semantic Segmentation -- 1 Introduction -- 2 Related Work -- 2.1 Vision Transformer -- 2.2 Semantic Segmentation -- 3 Proposed Architecture -- 3.1 Overview -- 3.2 Double U Block (DUB) -- 3.3 Power Line Aware Block (PLAB) -- 3.4 BiscSE Block -- 3.5 Loss Function -- 4 Experiments -- 4.1 Experimental Settings -- 4.2 Comparative Experiments -- 4.3 Ablation Experiments -- 5 Conclusion -- References -- Space-Transform Margin Loss with Mixup for Long-Tailed Visual Recognition -- 1 Introduction -- 2 Related Work -- 2.1 Mixup and Its Space Transformation -- 2.2 Long-Tailed Learning with Mixup -- 2.3 Re-balanced Loss Function Modification Methods -- 3 Method -- 3.1 Space Transformation in Mixup -- 3.2 Space-Transform Margin Loss Function -- 4 Experiments -- 4.1 Datasets -- 4.2 Implementations Details -- 4.3 Main Results -- 4.4 Feature Visualization and Analysis of STM Loss -- 4.5 Ablation Study -- 5 Conclusion -- References -- A Multi-perspective Squeeze Excitation Classifier Based on Vision Transformer for Few Shot Image Classification -- 1 Introduction -- 2 Related Work -- 3 Method -- 3.1 Problem Definition -- 3.2 Meta-Training Phase -- 3.3 Meta-test Phase -- 4 Experimental Results -- 4.1 Datasets and Training Details -- 4.2 Evaluation Results -- 4.3 Ablation Study -- 5 Conclusion -- References -- ITCNN: Incremental Learning Network Based on ITDA and Tree Hierarchical CNN -- 1 Introduction -- 2 Proposed Network -- 2.1 Network Structure -- 2.2 ITDA -- 2.3 Branch Route -- 2.4 Training Strategies -- 2.5 Optimization Strategies -- 3 Experiments and Results -- 3.1 Experiment on Classification -- 3.2 Experiment on CIL -- 4 Conclusion -- References. Periodic-Aware Network for Fine-Grained Action Recognition -- 1 Introduction -- 2 Related Work -- 2.1 Skeleton-Based Action Recognition -- 2.2 Periodicity Estimation of Videos -- 2.3 Squeeze and Excitation Module -- 3 Method -- 3.1 3D-CNN Backbone -- 3.2 Periodicity Feature Extraction Module -- 3.3 Periodicity Fusion Module -- 4 Experiment -- 4.1 Datasets -- 4.2 Implementation Details -- 4.3 Ablation Study -- 4.4 Comparison with State-of-the-Art Methods -- 5 Conclusion -- References -- Learning Domain-Invariant Representations from Text for Domain Generalization -- 1 Introduction -- 2 Related Work -- 2.1 Domain Generalization -- 2.2 CLIP in Domain Generalization -- 3 Method -- 3.1 Problem Formulation -- 3.2 Text Regularization -- 3.3 CLIP Representations -- 4 Experiments and Results -- 4.1 Datasets and Experimental Settings -- 4.2 Comparison with Existing DG Methods -- 4.3 Ablation Study -- 5 Conclusions -- References -- TSTD:A Cross-modal Two Stages Network with New Trans-decoder for Point Cloud Semantic Segmentation -- 1 Introduction -- 2 Related Works -- 2.1 Image Transformers -- 2.2 Point Cloud Transformer -- 2.3 Joint 2D-3D Network -- 3 Method -- 3.1 Overall Architecture -- 3.2 2D-3D Backprojection -- 3.3 Trans-Decoder -- 4 Experiments -- 4.1 Dataset and Metric -- 4.2 Performance Comparison -- 4.3 Ablation Experiment -- 5 Conclusion -- References -- NeuralMAE: Data-Efficient Neural Architecture Predictor with Masked Autoencoder -- 1 Introduction -- 2 Related Work -- 2.1 Neural Architecture Performance Predictors -- 2.2 Generative Self-supervised Learning -- 3 Method -- 3.1 Overall Framework -- 3.2 Pre-training -- 3.3 Fine-Tuning -- 3.4 Multi-head Attention-Masked Transformer -- 4 Experiments -- 4.1 Implementation Details -- 4.2 Experiments on NAS-Bench-101 -- 4.3 Experiments on NAS-Bench-201 -- 4.4 Experiments on NAS-Bench-301. 4.5 Ablation Study -- 5 Conclusion -- References -- Co-regularized Facial Age Estimation with Graph-Causal Learning -- 1 Introduction -- 2 Method -- 2.1 Problem Formulation -- 2.2 Ordinal Decision Mapping -- 2.3 Bilateral Counterfactual Pooling -- 3 Experiments -- 3.1 Datasets and Evaluation Settings -- 3.2 Comparison with State-of-the-Art Methods -- 3.3 Ablation Study -- 3.4 Performance Under Out-of-Distribution Settings -- 3.5 Qualitative Results -- 4 Conclusion -- References -- Online Distillation and Preferences Fusion for Graph Convolutional Network-Based Sequential Recommendation -- 1 Introduction -- 2 Method -- 2.1 Graph Construction -- 2.2 Collaborative Learning -- 2.3 Feature Fusion -- 3 Experiment -- 3.1 Experimental Setup -- 3.2 Experimental Results -- 3.3 Ablation Studies -- 4 Conclusion -- References -- Grassmann Graph Embedding for Few-Shot Class Incremental Learning -- 1 Introduction -- 2 Related Work -- 3 The Proposed Method -- 3.1 Problem Definition -- 3.2 Overview -- 3.3 Grassmann Manifold Embedding -- 3.4 Graph Structure Preserving on Grassmann Manifold -- 4 Experiment -- 4.1 Experimental Setup -- 4.2 Comparison with State-of-the-Art Methods -- 5 Conclusion -- References -- Global Variational Convolution Network for Semi-supervised Node Classification on Large-Scale Graphs -- 1 Introduction -- 2 Related Work -- 3 Proposed Methods -- 3.1 Positive Pointwise Mutual Information on Large-Scale Graphs -- 3.2 Global Variational Aggregation -- 3.3 Variational Convolution Kernels -- 4 Experiments -- 4.1 Comparison Experiments -- 4.2 Ablation Study -- 4.3 Runtime Study -- 5 Conclusion -- References -- Frequency Domain Distillation for Data-Free Quantization of Vision Transformer -- 1 Introduction -- 2 Related Work -- 2.1 Vision Transformer (ViT) -- 2.2 Network Quantization -- 3 Preliminaries -- 3.1 Quantizer. 3.2 Fast Fourier Transform (FFT) and Frequency Domain -- 4 Method -- 4.1 Our Insights -- 4.2 Frequency Domain Distillation -- 4.3 The Overall Pipeline -- 5 Experimentation -- 5.1 Comparison Experiments -- 5.2 Ablation Study -- 6 Conclusions -- References -- An ANN-Guided Approach to Task-Free Continual Learning with Spiking Neural Networks -- 1 Introduction -- 2 Related Works -- 2.1 Image Generation in SNNs -- 2.2 Continual Learning -- 3 Preliminary -- 3.1 The Referee Module: WGAN -- 3.2 The Player Module: FSVAE -- 4 Methodology -- 4.1 Problem Setting -- 4.2 Overview of Our Model -- 4.3 Adversarial Similarity Expansion -- 4.4 Precise Pruning -- 5 Experimental Results -- 5.1 Dataset Setup -- 5.2 Classification Tasks Under TFCL -- 5.3 The Impact of Different Thresholds and Buffer Sizes -- 5.4 ANN and SNN Under TFCL -- 6 Conclusion -- References -- Multi-adversarial Adaptive Transformers for Joint Multi-agent Trajectory Prediction -- 1 Introduction -- 2 Related Works -- 2.1 Multi-agent Trajectory Prediction -- 2.2 Domain Adaptation -- 3 Proposed Method -- 3.1 Encoder: Processing Multi-aspect Data -- 3.2 Decoder: Generating Multi-modal Trajectories -- 3.3 Adaptation: Learning Doamin Invaint Feature -- 3.4 Loss Function -- 4 Experiments -- 4.1 Dataset -- 4.2 Problem Setting -- 4.3 Evaluation Metrics -- 4.4 Implementation Details -- 4.5 Quantitative Analysis -- 4.6 Ablation Study -- 5 Conclusion -- References -- Enhancing Open-Set Object Detection via Uncertainty-Boxes Identification -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 Preliminary -- 3.2 Baseline Setup -- 3.3 Pseudo Proposal Advisor -- 3.4 Uncertainty-Box Detection -- 4 Experiment -- 4.1 Experimental Setup -- 4.2 Comparison with Other Methods -- 4.3 Ablation Studies -- 4.4 Visualization and Qualitative Analysis -- 5 Conclusions -- References. Interventional Supervised Learning for Person Re-identification.
Record Nr.	UNINA-9910799218703321