1.

Record Nr.

UNINA9910983078903321

Autore

Leonardis Aleš

Titolo

Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XIV / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol

Pubbl/distr/stampa

Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025

ISBN

9783031726309

3031726308

Edizione

[1st ed. 2025.]

Descrizione fisica

1 online resource (570 pages)

Collana

Lecture Notes in Computer Science, , 1611-3349 ; ; 15072

Altri autori (Persone)

RicciElisa

RothȘtefan

RussakovskyOlga

SattlerTorsten

VarolGül

Disciplina

006.37

Soggetti

Image processing - Digital techniques

Computer vision

Image processing

Computer networks

User interfaces (Computer systems)

Human-computer interaction

Machine learning

Computers, Special purpose

Computer Imaging, Vision, Pattern Recognition and Graphics

Image Processing

Computer Communication Networks

User Interfaces and Human Computer Interaction

Machine Learning

Special Purpose and Application-Based Systems

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Nota di contenuto

ProMerge: Prompt and Merge for Unsupervised Instance Segmentation



-- M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models -- The Hard Positive Truth about Vision-Language Compositionality -- GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing -- Shapefusion: 3D localized human diffusion models -- Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing -- Prompting Language-Informed Distribution for Compositional Zero-Shot Learning -- Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment -- 3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting -- Distribution-Aware Robust Learning from Long-Tailed Data with Noisy Labels -- Free-Viewpoint Video of Outdoor Sports Using a Drone -- Wavelength-Embedding-guided Filter-Array Transformer for Spectral Demosaicing -- ConGeo: Robust Cross-view Geo-localization across Ground View Variations -- Generalizable Facial Expression Recognition -- GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views -- Self-Supervised Any-Point Tracking by Contrastive Random Walks -- MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization -- Siamese Vision Transformers are Scalable Audio-visual Learners -- LCM-Lookahead for Encoder-based Text-to-Image Personalization -- Towards Architecture-Agnostic Untrained Networks Priors for Image Reconstruction with Frequency Regularization -- Towards Open-Ended Visual Recognition with Large Language Models -- Ray-Distance Volume Rendering for Neural Scene Reconstruction -- ReNoise: Real Image Inversion Through Iterative Noising -- Attention Decomposition for Cross-Domain Semantic Segmentation -- Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation -- Handling The Non-Smooth Challenge in Tensor SVD: A Multi-Objective Tensor Recovery Framework -- RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models.

Sommario/riassunto

The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29–October 4, 2024. The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation. .