top

  Info

  • Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

  Info

  • Utilizzare questo link per rimuovere la selezione effettuata.
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part III / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part III / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
Autore Rudinac Stevan
Edizione [1st ed. 2024.]
Pubbl/distr/stampa Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Descrizione fisica 1 online resource (552 pages)
Disciplina 006.37
Altri autori (Persone) HanjalicAlan
LiemCynthia
WorringMarcel
JónssonBjö Þór
LiuBei
YamakataYoko
Collana Lecture Notes in Computer Science
Soggetto topico Computer vision
Image processing
Pattern recognition systems
Application software
Information storage and retrieval systems
Machine learning
Computer Vision
Image Processing
Automated Pattern Recognition
Computer and Information Systems Applications
Information Storage and Retrieval
Machine Learning
ISBN 3-031-53311-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Global-to-Local Feature Mining Network for RGB-Infrared Person Re-Identification -- Semantic Transition Detection for Self-Supervised Vide Scene Segmentation -- Multi-Task Collaborative Network for Image-text Retrieval -- FGENet:Fine-Grained Extraction Network for Congested Crowd Counting -- MSMV-UNet : A 2.5D Stroke Lesion Segmentation Method based on Multi-slice Feature Fusion -- Non-Local Spatial-Wise and Global Channel-Wise Transformer for Efficient Image Super-Resolution -- MobileViT-FocR: MobileViT with Fixed-One-Centre Loss and Gradient Reversal for Generalised Fake Face Detection -- ASF-Conformer: Audio Scoring Conformer with FFC for Speaker Verification in Noisy Environments -- Prior-Knowledge-Free Video Frame Interpolation with Bidirectional Regularized Implicit Neural Representations -- Two-Stage Reasoning Network with Modality Decomposition for Text VQA -- Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos -- Co-speech Gesture Generation with Variational Auto Encoder -- Differentiable Neural Architecture Search Based on Efficient Architecture for Lightweight Image Super-Resolution -- Learning Collaborative Reinforcement Attention for 3D Face Reconstruction and Dense Alignment -- Exploring Multi-Modal Fusion for Image Manipulation Detection and Localization -- Object-based Spatio-Temporal Heterogeneous Network for VideoQA -- Adaptive Token Selection and Fusion Network for Multimodal Sentiment Analysis -- Exploring Imperceptible Adversarial Examples in YCbCr Color Space -- Fractional-order image moments and applications -- Time-Quality Tradeoff of MuseHash Query Processing Performance -- Dual-Fisheye Image Stitching via Unsupervised Deep Learning -- CA-GAN: Conditional Adaptive Generative Adversarial Network for Text-to-Image Synthesis -- RDC-YOLOv5:Improved Safety Helmet Detection in Adverse Weather -- Sustainable Commercial Fishery Control using Multimedia Forensics Data from Non-trusted, Mobile Edge Nodes -- MC-TCMNER: A Multi-Modal Fusion Model Combining Contrast Learning Method for Traditional Chinese Medicine NER -- C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough Sounds -- Pseudo-label based Unsupervised Momentum Representation Learning for Multi-domain Image Retrieval -- DFGait: Decomposition Fusion Representation Learning for Multimodal Gait Recognition -- MoPE: Mixture of Pooling Experts Framework for Image-Text Retrieval -- Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation -- Unsupervised Multi-Collaborative Learning Network for 3D Face Reconstruction -- A Region Based Non-overlapping Reference Speech Estimation Method for Speaker Extraction -- Self-Supervised Edge Structure Learning for Multi-View Stereo and Parallel Optimization -- Prototype-Enhanced Hypergraph Learning for Heterogeneous Information Networks -- A Language-based solution to enable Metaverse Retrieval -- Part-aware Prompt Tuning For Weakly Supervised Referring Expression Grounding -- Adversarially Robust Deepfake Detection via Adversarial Feature Similarity Learning -- A Multidimensional Taxonomy Model for Music Tangible User Interfaces.
Record Nr. UNINA-9910806193503321
Rudinac Stevan  
Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part I / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part I / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
Autore Rudinac Stevan
Edizione [1st ed. 2024.]
Pubbl/distr/stampa Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Descrizione fisica 1 online resource (523 pages)
Disciplina 006.37
Altri autori (Persone) HanjalicAlan
LiemCynthia
WorringMarcel
JónssonBjö Þór
LiuBei
YamakataYoko
Collana Lecture Notes in Computer Science
Soggetto topico Computer vision
Image processing
Pattern recognition systems
Application software
Information storage and retrieval systems
Machine learning
Computer Vision
Image Processing
Automated Pattern Recognition
Computer and Information Systems Applications
Information Storage and Retrieval
Machine Learning
ISBN 3-031-53305-4
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Where are Biases? Adversarial Debiasing with Spurious Feature Visualization -- Cross-Modal Hash Retrieval with Category Semantics -- Spatiotemporal Representation Enhanced ViT for Video Recognition -- SCFormer: A Vision Transformer with Split Channel in Sitting Posture Recognition -- Dive into Coarse-to-Fine Strategy in Single Image Deblurring -- TICondition: Expanding Control Capabilities for Text-to-Image Generation with Multi-Modal Conditions -- Enhancing Generative Generalized Zero Shot Learning via Multi-Space Constraints and Adapative Integration -- Joint Image Data Hiding and Rate-Distortion Optimization in Neural Compressed Latent Representations -- GSUNet: A Brain Tumor Segmentation Method Based On 3D Ghost Shuffle U-Net -- ACT: Action-associated and Target-related Representations for Object Navigation -- Foreground Feature Enhancement and Peak & Background Suppression for Fine-Grained Visual Classification -- YOLOv5-SRR: Enhancing YOLOv5 for Effective Underwater Target Detection -- Image Clustering and Generation with HDGMVAE-I -- “Car or Bus?" CLearSeg: CLIP-enhanced Discrimination among Resembling Classes for Few-Shot Semantic Segmentation -- PANDA: Prompt-based Context- and Indoor-aware Pretraining for Vision and Language Navigation -- Cross-Modal Semantic Alignment Learning for Text-based Person Search -- Point Cloud Classification via Learnable Memory Bank -- Adversarially Regularized Low-Light Image Enhancement -- Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided Relation Alignment and Adaptation -- PMGCN:Preserving measuring mapping prototype graph calibration network for few-shot learning -- ARE-CAM: An interpretable approach to quantitatively evaluating the adversarial robustness of deep models based on CAM -- SSK-Yolo:Global feature-driven small object detection network for images -- MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification -- From Skulls to Faces: A Deep Generative Framework for Realistic 3D Craniofacial Reconstruction -- Structure-aware Adaptive Hybrid Interaction Modeling for Image-Text Matching -- Using Saliency and Cropping to Improve Video Memorability -- Contextual Augmentation with Bias Adaptive for Few-shot Video Object Segmentation -- A lightweight local attention network for image super resolution -- Domain Adaptation for Speaker Verification Based on Self-Supervised Learning with Adversarial Training -- Quality Scalable Video Coding based on Neural Representation -- Hierarchical Bi-Directional Temporal Context Mining for Improved Video Compression -- MAMixer: Multivariate Time Series Forecasting via Multi-Axis Mixing -- A Custom GAN-based Robust Algorithm for Medical Image Watermarking -- A Detail-guided Multi-source Fusion Network for Remote Sensing Object Detection -- A Secure and Fair Federated Learning Protocol under the Universal Composability Framework -- Bi-directional Interaction and Dense Aggregation Network for RGB-D Salient Object Detection -- Face Forgery Detection via Texture and Saliency Enhancement.
Record Nr. UNINA-9910806199003321
Rudinac Stevan  
Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part II / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part II / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
Autore Rudinac Stevan
Edizione [1st ed. 2024.]
Pubbl/distr/stampa Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Descrizione fisica 1 online resource (540 pages)
Disciplina 006.37
Altri autori (Persone) HanjalicAlan
LiemCynthia
WorringMarcel
JónssonBjö Þór
LiuBei
YamakataYoko
Collana Lecture Notes in Computer Science
Soggetto topico Computer vision
Image processing
Pattern recognition systems
Application software
Information storage and retrieval systems
Machine learning
Computer Vision
Image Processing
Automated Pattern Recognition
Computer and Information Systems Applications
Information Storage and Retrieval
Machine Learning
ISBN 3-031-53308-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Self-distillation Enhanced Vertical Wavelet Spatial Attention for Person Re-identification -- High Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Value Preprocessing and Block Classification -- HPattack: An Effective Adversarial Attack for Human Parsing -- Dynamic-Static Graph Convolutional Network for Video-Based Facial Expression Recognition -- Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis -- Semantic Importance-Based Deep Image Compression Using A Generative Approach -- Drive-CLIP: Cross-modal Contrastive Safety-Critical Driving Scenario Representation Learning and Zero-shot Driving Risk Analysis -- MRHF: Multi-stage Retrieval and Hierarchical Fusion for Textbook Question Answering -- Multi-scale Decomposition Dehazing with Polarimetric Vision -- CLF-Net: A Few-shot Cross-Language Font Generation Method -- Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation -- Audio-Visual Segmentation By Leveraging Multi-Scaled Features Learning -- Multi-head Hashing with Orthogonal Decomposition for Cross-modal Retrieval -- Fusion Boundary and Gradient Enhancement Network for Camouflage Object Detection -- Find the Cliffhanger: Multi-Modal Trailerness in Soap Operas -- SM-GAN: Single-stage and Multi-object Text Guided Image Editing -- MAVAR-SE: Multi-scale Audio-Visual Association Representation Network for End-to-end Speaker Extraction -- NearbyPatchCL: Leveraging Nearby Patches for Self-Supervised Patch-Level Multi-Class Classification in Whole-Slide Images -- Improving Small License Plate Detection with Bidirectional Vehicle-plate Relation -- A Purified Stacking Ensemble Framework for Cytology Classification -- SEAS-Net: Segment Exchange Augmentation for Semi-Supervised Brain Tumor Segmentation -- Super-Resolution-Assisted Feature Refined Extraction for Small Objects in Remote Sensing Images -- Lightweight Image Captioning Model Based on Knowledge Distillation -- Irregular License Plate Recognition via Global Information Integration -- TNT-Net: Point Cloud Completion by Transformer in Transformer -- Fourier Transformer for Joint Super-Resolution and Reconstruction of Mr Image -- MVD-NeRF: Resolving Shape-Radiance Ambiguity via Mitigating View Dependency -- DPM-Det: Diffusion Model Object Detection Based on DPM-Solver++ Guided Sampling -- CT-MVSNet: Efficient Multi-View Stereo with Cross-scale Transformer -- A Coarse and Fine Grained Masking Approach for Video-grounded Dialogue -- Deep self-supervised subspace clustering with triple loss -- LigCDnet:Remote Sensing Image Cloud Detection Based on Lightweight Framework -- Gait Recognition Based on Temporal Gait Information Enhancing -- Learning Complementary Instance Representation with Parallel Adaptive Graph-Based Network for Action Detection -- CESegNet:Context-Enhancement Semantic Segmentation Network Based on Transformer -- MoCap-Video Data Retrieval with Deep Cross-Modal Learning -- LRATNet: Local-Relationship-Aware Transformer Network for Table Structure Recognition.
Record Nr. UNINA-9910806191303321
Rudinac Stevan  
Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part IV / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part IV / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
Autore Rudinac Stevan
Edizione [1st ed. 2024.]
Pubbl/distr/stampa Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Descrizione fisica 1 online resource (419 pages)
Disciplina 006.37
Altri autori (Persone) HanjalicAlan
LiemCynthia
WorringMarcel
JónssonBjö Þór
LiuBei
YamakataYoko
Collana Lecture Notes in Computer Science
Soggetto topico Computer vision
Image processing
Pattern recognition systems
Application software
Information storage and retrieval systems
Machine learning
Computer Vision
Image Processing
Automated Pattern Recognition
Computer and Information Systems Applications
Information Storage and Retrieval
Machine Learning
ISBN 3-031-53302-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto FMM: Special Session on Foundation Models for Multimedia -- Removing Stray-Light for Wild-Field Fundus Image Fusion based on Large Generative Models -- Training-free Region Prediction with Stable Diffusion -- Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites -- GDTNet: A Synergistic Dilated Transformer and CNN by Gate Attention for Abdominal Multi-organ Segmentation -- Fine-Grained Multi-Modal Fundus Image Generation Based on Diffusion Models for Glaucoma Classification -- Adapting Pretrained Large-Scale Vision Models for Face Forgery Detection -- ICDAR: Special Session on Intelligent Cross-Data Analysis and Retrieval -- Towards Cross-modal Point Cloud Retrieval for Indoor Scenes -- Correlation visualization under missing values: a comparison between imputation and direct parameter estimation methods -- IFI: Interpreting for Improving: a Multimodal Transformer with an Interpretability Technique for Recognition of Risk Events -- OOKPIK - A Collection of Out-of-Context Image-Caption Pairs -- LUMOS-DM: Landscape-based Multimodal Scene Retrieval Enhanced by Diffusion Model -- XR-MACCI: Special Session on eXtended Reality and Multimedia - Advancing Content Creation and Interaction -- Mining Landmark Images for Scene Reconstruction from Weakly Annotated Video Collections -- A framework for 3D modeling of construction sites using aerial imagery and semantic NeRFs -- Multimodal 3D Object Retrieval -- An Integrated System for Spatio-Temporal Summarization of 360-degrees Videos -- Brave New Ideas -- Mutant Texts: A Technique for Uncovering Unexpected Inconsistencies in Large-Scale Vision-language Models -- Exploring Artificial Intelligence for Advancing Performance Processes and Events in Io3MT -- Demonstrations -- Implementation of Melody Slot Machines -- E2Evideo: End to End Video and Image Pre-processing and Analysis Tool -- Augmented Reality Photo Presentation and Content-based Image Retrieval on Mobile Devices with AR-Explorer -- Augmented Reality Photo Presentation and Content-based Image Retrieval on Mobile Devices with AR-Explorer -- AI-Based Cropping of Soccer Videos for Different Social Media Representations -- Few-shot Object Detection as a Service: Facilitating Training and Deployment for Domain Experts -- DatAR: Supporting Neuroscience Literature Exploration by Finding Relations between Topics in Augmented Reality -- EmoAda:A Multimodal Emotion Interaction and Psychological Adaptation System -- Video Browser Showdown -- Waseda Meisei SoftBank at Video Browser Showdown 2024 -- Exploring Multimedia Vector Spaces with vitrivr-VR -- A new Retrieval Engine for vitrivr -- VISIONE 5.0: Enhanced User Interface and AI Models for VBS2024 -- PraK Tool: An Interactive Search Tool Based on Video Data Services -- Exquisitor at the Video Browser Showdown 2024: Relevance Feedback Meets Conversational Search -- VERGE in VBS 2024 -- Optimizing the Interactive Video Retrieval Tool Vibro for the Video Browser Showdown 2024 -- diveXplore at the Video Browser Showdown 2024 -- Leveraging LLMs and Generative Models for Interactive Known-Item Video Search -- TalkSee: Interactive Video Retrieval Engine Using Large Language Model -- VideoCLIP 2: An Interactive CLIP-based Video Retrieval System for Novice Users at VBS2024 -- ViewsInsight: Enhancing Video Retrieval for VBS 2024 with a User-Friendly Interaction Mechanism.
Record Nr. UNINA-9910806196303321
Rudinac Stevan  
Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part I / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part I / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
Autore Rudinac Stevan
Edizione [1st ed. 2024.]
Pubbl/distr/stampa Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Descrizione fisica 1 online resource (523 pages)
Disciplina 006.37
Altri autori (Persone) HanjalicAlan
LiemCynthia
WorringMarcel
JónssonBjö Þór
LiuBei
YamakataYoko
Collana Lecture Notes in Computer Science
Soggetto topico Computer vision
Image processing
Pattern recognition systems
Application software
Information storage and retrieval systems
Machine learning
Computer Vision
Image Processing
Automated Pattern Recognition
Computer and Information Systems Applications
Information Storage and Retrieval
Machine Learning
ISBN 3-031-53305-4
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Where are Biases? Adversarial Debiasing with Spurious Feature Visualization -- Cross-Modal Hash Retrieval with Category Semantics -- Spatiotemporal Representation Enhanced ViT for Video Recognition -- SCFormer: A Vision Transformer with Split Channel in Sitting Posture Recognition -- Dive into Coarse-to-Fine Strategy in Single Image Deblurring -- TICondition: Expanding Control Capabilities for Text-to-Image Generation with Multi-Modal Conditions -- Enhancing Generative Generalized Zero Shot Learning via Multi-Space Constraints and Adapative Integration -- Joint Image Data Hiding and Rate-Distortion Optimization in Neural Compressed Latent Representations -- GSUNet: A Brain Tumor Segmentation Method Based On 3D Ghost Shuffle U-Net -- ACT: Action-associated and Target-related Representations for Object Navigation -- Foreground Feature Enhancement and Peak & Background Suppression for Fine-Grained Visual Classification -- YOLOv5-SRR: Enhancing YOLOv5 for Effective Underwater Target Detection -- Image Clustering and Generation with HDGMVAE-I -- “Car or Bus?" CLearSeg: CLIP-enhanced Discrimination among Resembling Classes for Few-Shot Semantic Segmentation -- PANDA: Prompt-based Context- and Indoor-aware Pretraining for Vision and Language Navigation -- Cross-Modal Semantic Alignment Learning for Text-based Person Search -- Point Cloud Classification via Learnable Memory Bank -- Adversarially Regularized Low-Light Image Enhancement -- Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided Relation Alignment and Adaptation -- PMGCN:Preserving measuring mapping prototype graph calibration network for few-shot learning -- ARE-CAM: An interpretable approach to quantitatively evaluating the adversarial robustness of deep models based on CAM -- SSK-Yolo:Global feature-driven small object detection network for images -- MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification -- From Skulls to Faces: A Deep Generative Framework for Realistic 3D Craniofacial Reconstruction -- Structure-aware Adaptive Hybrid Interaction Modeling for Image-Text Matching -- Using Saliency and Cropping to Improve Video Memorability -- Contextual Augmentation with Bias Adaptive for Few-shot Video Object Segmentation -- A lightweight local attention network for image super resolution -- Domain Adaptation for Speaker Verification Based on Self-Supervised Learning with Adversarial Training -- Quality Scalable Video Coding based on Neural Representation -- Hierarchical Bi-Directional Temporal Context Mining for Improved Video Compression -- MAMixer: Multivariate Time Series Forecasting via Multi-Axis Mixing -- A Custom GAN-based Robust Algorithm for Medical Image Watermarking -- A Detail-guided Multi-source Fusion Network for Remote Sensing Object Detection -- A Secure and Fair Federated Learning Protocol under the Universal Composability Framework -- Bi-directional Interaction and Dense Aggregation Network for RGB-D Salient Object Detection -- Face Forgery Detection via Texture and Saliency Enhancement.
Record Nr. UNISA-996587863403316
Rudinac Stevan  
Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part IV / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part IV / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
Autore Rudinac Stevan
Edizione [1st ed. 2024.]
Pubbl/distr/stampa Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Descrizione fisica 1 online resource (419 pages)
Disciplina 006.37
Altri autori (Persone) HanjalicAlan
LiemCynthia
WorringMarcel
JónssonBjö Þór
LiuBei
YamakataYoko
Collana Lecture Notes in Computer Science
Soggetto topico Computer vision
Image processing
Pattern recognition systems
Application software
Information storage and retrieval systems
Machine learning
Computer Vision
Image Processing
Automated Pattern Recognition
Computer and Information Systems Applications
Information Storage and Retrieval
Machine Learning
ISBN 3-031-53302-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto FMM: Special Session on Foundation Models for Multimedia -- Removing Stray-Light for Wild-Field Fundus Image Fusion based on Large Generative Models -- Training-free Region Prediction with Stable Diffusion -- Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites -- GDTNet: A Synergistic Dilated Transformer and CNN by Gate Attention for Abdominal Multi-organ Segmentation -- Fine-Grained Multi-Modal Fundus Image Generation Based on Diffusion Models for Glaucoma Classification -- Adapting Pretrained Large-Scale Vision Models for Face Forgery Detection -- ICDAR: Special Session on Intelligent Cross-Data Analysis and Retrieval -- Towards Cross-modal Point Cloud Retrieval for Indoor Scenes -- Correlation visualization under missing values: a comparison between imputation and direct parameter estimation methods -- IFI: Interpreting for Improving: a Multimodal Transformer with an Interpretability Technique for Recognition of Risk Events -- OOKPIK - A Collection of Out-of-Context Image-Caption Pairs -- LUMOS-DM: Landscape-based Multimodal Scene Retrieval Enhanced by Diffusion Model -- XR-MACCI: Special Session on eXtended Reality and Multimedia - Advancing Content Creation and Interaction -- Mining Landmark Images for Scene Reconstruction from Weakly Annotated Video Collections -- A framework for 3D modeling of construction sites using aerial imagery and semantic NeRFs -- Multimodal 3D Object Retrieval -- An Integrated System for Spatio-Temporal Summarization of 360-degrees Videos -- Brave New Ideas -- Mutant Texts: A Technique for Uncovering Unexpected Inconsistencies in Large-Scale Vision-language Models -- Exploring Artificial Intelligence for Advancing Performance Processes and Events in Io3MT -- Demonstrations -- Implementation of Melody Slot Machines -- E2Evideo: End to End Video and Image Pre-processing and Analysis Tool -- Augmented Reality Photo Presentation and Content-based Image Retrieval on Mobile Devices with AR-Explorer -- Augmented Reality Photo Presentation and Content-based Image Retrieval on Mobile Devices with AR-Explorer -- AI-Based Cropping of Soccer Videos for Different Social Media Representations -- Few-shot Object Detection as a Service: Facilitating Training and Deployment for Domain Experts -- DatAR: Supporting Neuroscience Literature Exploration by Finding Relations between Topics in Augmented Reality -- EmoAda:A Multimodal Emotion Interaction and Psychological Adaptation System -- Video Browser Showdown -- Waseda Meisei SoftBank at Video Browser Showdown 2024 -- Exploring Multimedia Vector Spaces with vitrivr-VR -- A new Retrieval Engine for vitrivr -- VISIONE 5.0: Enhanced User Interface and AI Models for VBS2024 -- PraK Tool: An Interactive Search Tool Based on Video Data Services -- Exquisitor at the Video Browser Showdown 2024: Relevance Feedback Meets Conversational Search -- VERGE in VBS 2024 -- Optimizing the Interactive Video Retrieval Tool Vibro for the Video Browser Showdown 2024 -- diveXplore at the Video Browser Showdown 2024 -- Leveraging LLMs and Generative Models for Interactive Known-Item Video Search -- TalkSee: Interactive Video Retrieval Engine Using Large Language Model -- VideoCLIP 2: An Interactive CLIP-based Video Retrieval System for Novice Users at VBS2024 -- ViewsInsight: Enhancing Video Retrieval for VBS 2024 with a User-Friendly Interaction Mechanism.
Record Nr. UNISA-996587863203316
Rudinac Stevan  
Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part V / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part V / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
Autore Rudinac Stevan
Edizione [1st ed. 2024.]
Pubbl/distr/stampa Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Descrizione fisica 1 online resource (125 pages)
Disciplina 621.382
Altri autori (Persone) HanjalicAlan
LiemCynthia
WorringMarcel
JónssonBjö Þór
LiuBei
YamakataYoko
Collana Lecture Notes in Computer Science
Soggetto topico Signal processing
Pattern recognition systems
Application software
Information storage and retrieval systems
Machine learning
Signal, Speech and Image Processing
Automated Pattern Recognition
Computer and Information Systems Applications
Information Storage and Retrieval
Machine Learning
ISBN 3-031-56435-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto RESET: Relational Similarity Extension for V3C1 Video Dataset -- A New Benchmark and OCR-free Method for Document Image Topic Classification -- The Rach3 Dataset: Towards Data-Driven Analysis of Piano Performance Rehearsal -- WikiMuTe: A web-sourced dataset of semantic descriptions for music audio -- PDTW150K: A Dataset for Patent Drawing Retrieval -- Interactive Question Answering for Multimodal Lifelog Retrieval -- Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers -- GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild.
Record Nr. UNINA-9910845096703321
Rudinac Stevan  
Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part V / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
MultiMedia Modeling [[electronic resource] ] : 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part V / / edited by Stevan Rudinac, Alan Hanjalic, Cynthia Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata
Autore Rudinac Stevan
Edizione [1st ed. 2024.]
Pubbl/distr/stampa Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Descrizione fisica 1 online resource (125 pages)
Disciplina 621.382
Altri autori (Persone) HanjalicAlan
LiemCynthia
WorringMarcel
JónssonBjö Þór
LiuBei
YamakataYoko
Collana Lecture Notes in Computer Science
Soggetto topico Signal processing
Pattern recognition systems
Application software
Information storage and retrieval systems
Machine learning
Signal, Speech and Image Processing
Automated Pattern Recognition
Computer and Information Systems Applications
Information Storage and Retrieval
Machine Learning
ISBN 3-031-56435-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto RESET: Relational Similarity Extension for V3C1 Video Dataset -- A New Benchmark and OCR-free Method for Document Image Topic Classification -- The Rach3 Dataset: Towards Data-Driven Analysis of Piano Performance Rehearsal -- WikiMuTe: A web-sourced dataset of semantic descriptions for music audio -- PDTW150K: A Dataset for Patent Drawing Retrieval -- Interactive Question Answering for Multimodal Lifelog Retrieval -- Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers -- GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild.
Record Nr. UNISA-996589544903316
Rudinac Stevan  
Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2024
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui