LEADER 06086nam 22008535 450 001 996630866203316 005 20250626165057.0 010 $a9783031729959 010 $a3031729951 024 7 $a10.1007/978-3-031-72995-9 035 $a(MiAaPQ)EBC31803898 035 $a(Au-PeEL)EBL31803898 035 $a(CKB)36672600900041 035 $a(DE-He213)978-3-031-72995-9 035 $a(OCoLC)1474243993 035 $a(EXLCZ)9936672600900041 100 $a20241124d2025 u| 0 101 0 $aeng 135 $aurcnu|||||||| 181 $ctxt$2rdacontent 182 $cc$2rdamedia 183 $acr$2rdacarrier 200 10$aComputer Vision ? ECCV 2024 $e18th European Conference, Milan, Italy, September 29?October 4, 2024, Proceedings, Part XLV /$fedited by Ale? Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol 205 $a1st ed. 2025. 210 1$aCham :$cSpringer Nature Switzerland :$cImprint: Springer,$d2025. 215 $a1 online resource (579 pages) 225 1 $aLecture Notes in Computer Science,$x1611-3349 ;$v15103 311 08$a9783031729942 311 08$a3031729943 327 $aKFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter -- Physical-Based Event Camera Simulator -- V-IRL: Grounding Virtual Intelligence in Real Life -- Adversarial Prompt Tuning for Vision-Language Models -- Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing -- Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation -- CC-SAM: Enhancing SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation -- An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding -- Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-v2) -- PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion -- X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning -- Learning Neural Volumetric Pose Features for Camera Localization -- Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation -- REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices -- Self-Training Room Layout via Geometry-aware Ray-casting -- Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback -- Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective -- Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization -- ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model -- Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach -- Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration -- When Fast Fourier Transform Meets Transformer for Image Restoration -- Dolphins: Multimodal Language Model for Driving -- Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model -- CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection -- Placing Objects in Context via Inpainting for Out-of-distribution Segmentation -- Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents. 330 $aThe multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29?October 4, 2024. The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation. 410 0$aLecture Notes in Computer Science,$x1611-3349 ;$v15103 606 $aImage processing$xDigital techniques 606 $aComputer vision 606 $aImage processing 606 $aComputer networks 606 $aUser interfaces (Computer systems) 606 $aHuman-computer interaction 606 $aMachine learning 606 $aComputers, Special purpose 606 $aComputer Imaging, Vision, Pattern Recognition and Graphics 606 $aImage Processing 606 $aComputer Communication Networks 606 $aUser Interfaces and Human Computer Interaction 606 $aMachine Learning 606 $aSpecial Purpose and Application-Based Systems 615 0$aImage processing$xDigital techniques. 615 0$aComputer vision. 615 0$aImage processing. 615 0$aComputer networks. 615 0$aUser interfaces (Computer systems) 615 0$aHuman-computer interaction. 615 0$aMachine learning. 615 0$aComputers, Special purpose. 615 14$aComputer Imaging, Vision, Pattern Recognition and Graphics. 615 24$aImage Processing. 615 24$aComputer Communication Networks. 615 24$aUser Interfaces and Human Computer Interaction. 615 24$aMachine Learning. 615 24$aSpecial Purpose and Application-Based Systems. 676 $a006.37 700 $aLeonardis$b Ales?$01757791 701 $aRicci$b Elisa$0216674 701 $aRoth$b S?tefan$00 701 $aRussakovsky$b Olga$01767663 701 $aSattler$b Torsten$01767664 701 $aVarol$b Gül$01767665 801 0$bMiAaPQ 801 1$bMiAaPQ 801 2$bMiAaPQ 906 $aBOOK 912 $a996630866203316 996 $aComputer Vision ? ECCV 2024$94213977 997 $aUNISA