Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XVI / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XVI / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (585 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks User interfaces (Computer systems) Human-computer interaction Machine learning Computers, Special purpose Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks User Interfaces and Human Computer Interaction Machine Learning Special Purpose and Application-Based Systems |
| ISBN | 3-031-72640-5 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | Diffusion Model is a Good Pose Estimator from 3D RF-Vision -- UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues -- Learning 3D-aware GANs from Unposed Images with Template Feature Field -- TAPTR: Tracking Any Point with Transformers as Detection -- Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning -- Point-supervised Panoptic Segmentation via Estimating Pseudo Labels from Learnable Distance -- BRAVE: Broadening the visual encoding of vision-language models -- HUMOS: Human Motion Model Conditioned on Body Shape -- Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields -- MVDiffHD: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction -- FlowCon: Out-of-Distribution Detection using Flow-based Contrastive Learning -- LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation -- Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation -- Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration -- CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians -- Bayesian Evidential Deep Learning for Online Action Detection -- AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation -- Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather -- Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction -- Memory-Efficient Fine-Tuning for Quantized Diffusion Model -- VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing -- MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model -- Human Hair Reconstruction with Strand-Aligned 3D Gaussians -- COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation -- SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders -- Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection -- Global-to-Pixel Regression for Human Mesh Recovery. |
| Record Nr. | UNINA-9910983303103321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part V / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part V / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (563 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks User interfaces (Computer systems) Human-computer interaction Machine learning Computers, Special purpose Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks User Interfaces and Human Computer Interaction Machine Learning Special Purpose and Application-Based Systems |
| ISBN | 3-031-72652-9 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark -- AttnZero: Efficient Attention Discovery for Vision Transformers -- Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search -- Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search -- UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation -- TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning -- Spectral Subsurface Scattering for Material Classification -- nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding -- Dynamic Neural Radiance Field From Defocused Monocular Video -- PiTe: Pixel-Temporal Alignment for Large Video-Language Model -- CarFormer: Self-Driving with Learned Object-Centric Representations -- FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models -- Plain-Det: A Plain Multi-Dataset Object Detector -- Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation -- Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation -- Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching -- Text-Guided Video Masked Autoencoder -- Diffusion Models for Open-Vocabulary Segmentation -- Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation -- EvSign: Sign Language Recognition and Translation with Streaming Events -- QUAR-VLA: Vision-Language-Action Model for Quadruped Robots -- Zero-shot Object Counting with Good Exemplars -- TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering -- SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds -- PartSTAD: 2D-to-3D Part Segmentation Task Adaptation -- FutureDepth: Learning to Predict the Future Improves Video Depth Estimation -- LLM as Copilot for Coarse-grained Vision-and-Language Navigation. |
| Record Nr. | UNINA-9910983491603321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXXXIII / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXXXIII / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (565 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Computer networks User interfaces (Computer systems) Human-computer interaction Machine learning Computers, Special purpose Computer Imaging, Vision, Pattern Recognition and Graphics Computer Communication Networks User Interfaces and Human Computer Interaction Machine Learning Special Purpose and Application-Based Systems |
| ISBN |
9783031730108
3031730100 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation -- Efficient Training with Denoised Neural Weights -- Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning -- Integration of Global and Local Representations for Fine-grained Cross-modal Alignment -- Local and Global Flatness for Federated Domain Generalization -- SRPose: Two-view Relative Pose Estimation with Sparse Keypoints -- Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models -- Paying More Attention to Images: A Training-Free Method for Alleviating Hallucination in LVLMs -- Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer. -- Implicit Neural Models to Extract Heart Rate from Video -- Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering -- PFGS: High Fidelity Point Cloud Rendering via Feature Splatting -- Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation -- E3M: Zero-Shot Spatio-Temporal Video Grounding with Expectation-Maximization Multimodal Modulation -- EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions -- LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement -- Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs -- Efficient Vision Transformers with Partial Attention -- Generalized Coverage for More Robust Low-Budget Active Learning -- Rasterized Edge Gradients: Handling Discontinuities Differentially -- Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment -- FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning -- LLaVA-UHD: an LMM Perceiving any Aspect Ratio and High-Resolution Images -- Learning Natural Consistency Representation for Face Forgery Video Detection -- ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video -- Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems -- R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model. |
| Record Nr. | UNINA-9910983319103321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XXX / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XXX / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (571 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks User interfaces (Computer systems) Human-computer interaction Machine learning Computers, Special purpose Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks User Interfaces and Human Computer Interaction Machine Learning Special Purpose and Application-Based Systems Visió per ordinador Reconeixement de formes (Informàtica) |
| Soggetto genere / forma |
Congressos
Llibres electrònics |
| ISBN | 3-031-73404-1 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | SemanticHuman-HD: High Resolution Semantic disentangled 3D Human Generation -- CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians -- Monocular Occupancy Prediction for Scalable Indoor Scenes -- Visual Grounding for Object-Level Generalization in Reinforcement Learning -- 3DEgo: 3D Editing on the Go! -- Efficient Depth-Guided Urban View Synthesis -- Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model -- Domain-adaptive Video Deblurring via Test-time Blurring -- Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures -- NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving -- OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing -- Progressive Pretext Task Learning for Human Trajectory Prediction -- Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM -- Isomorphic Pruning for Vision Models -- Attention Prompting on Image for Large Vision-Language Models -- Learning Cross-hand Policies of High-DOF Reaching and Grasping -- Reprojection Errors as Prompts for Efficient Scene Coordinate Regression -- Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning -- Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit Adjustment -- REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models -- DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing -- VideoClusterNet: Self-Supervised and Adaptive Face Clustering for Videos -- Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients -- Controlling the World by Sleight of Hand -- Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack -- Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection -- Cross-Domain Learning for Video Anomaly Detection with Limited Supervision. |
| Record Nr. | UNINA-9910983306703321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXXXVIII / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXXXVIII / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (596 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks Machine learning Computers, Special purpose User interfaces (Computer systems) Human-computer interaction Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks Machine Learning Special Purpose and Application-Based Systems User Interfaces and Human Computer Interaction |
| ISBN |
9783031732232
3031732235 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions -- InstructGIE: Towards Generalizable Image Editing -- HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation -- Navigating Text-to-Image Generative Bias across Indic Languages -- Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning -- CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models -- Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation -- VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation -- A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation -- Towards Scene Graph Anticipation -- Non-Line-of-Sight Estimation of Fast Human Motion with Slow Scanning Imagers -- Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding -- NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration -- Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models -- Image Manipulation Detection With Implicit Neural Representation and Limited Supervision -- Scalar Function Topology Divergence: Comparing Topology of 3D Objects -- Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks -- Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models -- DeTra: A Unified Model for Object Detection and Trajectory Forecasting -- ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems -- Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction -- Common Sense Reasoning for Deep Fake Detection -- Let the Avatar Talk using Texts without Paired Training Data -- NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields -- GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning -- Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks -- AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale. |
| Record Nr. | UNINA-9910983357803321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LIV / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LIV / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (581 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks Machine learning Computers, Special purpose User interfaces (Computer systems) Human-computer interaction Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks Machine Learning Special Purpose and Application-Based Systems User Interfaces and Human Computer Interaction |
| ISBN | 3-031-72949-8 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Record Nr. | UNINA-9910983390503321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XIX / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XIX / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (540 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks User interfaces (Computer systems) Human-computer interaction Machine learning Computers, Special purpose Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks User Interfaces and Human Computer Interaction Machine Learning Special Purpose and Application-Based Systems |
| ISBN |
9783031726552
3031726553 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation -- AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling -- SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion Models -- Quantized Prompt for Efficient Generalization of Vision-Language Models -- Online Temporal Action Localization with Memory-Augmented Transformer -- Efficient Cascaded Multiscale Adaptive Network for Image Restoration -- MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model -- Occlusion-Aware Seamless Segmentation -- OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection -- Referring Atomic Video Action Recognition -- Agent3D-Zero: An Agent for Zero-shot 3D Understanding -- Stream Query Denoising for Vectorized HD-Map Construction -- SAGS: Structure-Aware 3D Gaussian Splatting -- Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval -- OneRestore: A Universal Restoration Framework for Composite Degradation -- Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation -- SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks -- Bag of Tricks to Boost Adversarial Transferability -- RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency -- Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting -- WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation -- A Unified Framework for Gradient-based Saliency Map Generation of Black-box Models -- Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance -- COIN-Matting: Confounder Intervention for Image Matting -- SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding -- Audio-driven Talking Face Generation with Stabilized Synchronization Loss -- Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos. |
| Record Nr. | UNINA-9910983050603321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXV / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXV / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (570 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks User interfaces (Computer systems) Human-computer interaction Machine learning Computers, Special purpose Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks User Interfaces and Human Computer Interaction Machine Learning Special Purpose and Application-Based Systems |
| ISBN |
9783031736506
3031736508 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | MRSP: Learn Multi-Representations of Single Primitive for Compositional Zero-Shot Learning -- Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs -- TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance -- Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing -- Towards Open Domain Text-Driven Synthesis of Multi-Person Motions -- Generative End-to-End Autonomous Driving -- Learning to Distinguish Samples for Generalized Category Discovery -- COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark -- PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning -- Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem -- WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning -- Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice -- Encapsulating Knowledge in One Prompt -- Cross-Input Certified Training for Universal Perturbations -- Visual Relationship Transformation -- Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data -- Delving into Adversarial Robustness on Document Tampering Localization -- Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing -- Confidence-Based Iterative Generation for Real-World Image Super-Resolution -- Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy -- Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection -- Seeing Faces in Things: A Model and Dataset for Pareidolia -- Cocktail Universal Adversarial Attack on Deep Neural Networks -- Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering -- AMD: Automatic Multi-step Distillation of Large-scale Vision Models -- FairViT: Fair Vision Transformer via Adaptive Masking -- TrojVLM: Backdoor Attack Against Vision Language Models. |
| Record Nr. | UNINA-9910983483203321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XLII / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XLII / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (583 pages) |
| Disciplina | 006 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks User interfaces (Computer systems) Human-computer interaction Machine learning Computers, Special purpose Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks User Interfaces and Human Computer Interaction Machine Learning Special Purpose and Application-Based Systems |
| ISBN | 3-031-72946-3 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | Open-Set Recognition in the Age of Vision-Language Models -- Unsqueeze [CLS] Bottleneck to Learn Rich Representations -- Robust Multimodal Learning via Representation Decoupling -- Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models -- WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing -- Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation -- VeCLIP: Improving CLIP Training via Visual-enriched Captions -- Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks -- Learning Representations from Foundation Models for Domain Generalized Stereo Matching -- Spike-Temporal Latent Representation for Energy-Efficient Event-to-Video Reconstruction -- Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer -- Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts -- Event-Adapted Video Super-Resolution -- Look Hear: Gaze Prediction for Speech-directed Human Attention -- Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching -- Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge -- Catastrophic Overfitting: A Potential Blessing in Disguise -- Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework -- SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models -- Visual Alignment Pre-training for Sign Language Translation -- Parrot Captions Teach CLIP to Spot Text -- Solving Motion Planning Tasks with a Scalable Generative Model -- Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models -- Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment -- Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation -- BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow -- Diffusion Reward: Learning Rewards via Conditional Video Diffusion. |
| Record Nr. | UNINA-9910983329703321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||
Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXIII / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
| Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXIII / / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
| Autore | Leonardis Aleš |
| Edizione | [1st ed. 2025.] |
| Pubbl/distr/stampa | Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 |
| Descrizione fisica | 1 online resource (572 pages) |
| Disciplina | 006.37 |
| Altri autori (Persone) |
RicciElisa
RothȘtefan RussakovskyOlga SattlerTorsten VarolGül |
| Collana | Lecture Notes in Computer Science |
| Soggetto topico |
Image processing - Digital techniques
Computer vision Image processing Computer networks User interfaces (Computer systems) Human-computer interaction Machine learning Computers, Special purpose Computer Imaging, Vision, Pattern Recognition and Graphics Image Processing Computer Communication Networks User Interfaces and Human Computer Interaction Machine Learning Special Purpose and Application-Based Systems |
| ISBN |
9783031730368
3031730364 |
| Formato | Materiale a stampa |
| Livello bibliografico | Monografia |
| Lingua di pubblicazione | eng |
| Nota di contenuto | Large-scale Reinforcement Learning for Diffusion Models -- CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion -- FedHARM: Harmonizing Model Architectural Diversity in Federated Learning -- EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS -- Global Counterfactual Directions -- TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving -- RT-Pose: A 4D Radar-Tensor based 3D Human Pose Estimation and Localization Benchmark -- EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models -- RICA^2: Rubric-Informed, Calibrated Assessment of Actions -- Region-centric Image-Language Pretraining for Open-Vocabulary Detection -- Commonly Interesting Images -- Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities -- CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching -- Caltech Aerial RGB-Thermal Dataset in the Wild -- Diffusion Soup: Model Merging for Text-to-Image Diffusion Models -- Volumetric Rendering with Baked Quadrature Fields -- CityGuessr: City-Level Video Geo-Localization on a Global Scale -- Pseudo-Labelling Should Be Aware of Disguising Channel Activations -- Bayesian Detector Combination for Object Detection with Crowdsourced Annotations -- Revising Densification in Gaussian Splatting -- FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing -- Smoothness, Synthesis, and Sampling: Re-thinking Unsupervised Multi-View Stereo with DIV Loss -- Text Motion Translator: A Bi-Directional Model for Enhanced 3D Human Motion Generation from Open-Vocabulary Descriptions -- UL-VIO: Ultra-lightweight Visual-Inertial Odometry with Noise Robust Test-time Adaptation -- PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis -- R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding -- A Graph-Based Approach for Category-Agnostic Pose Estimation. |
| Record Nr. | UNINA-9910983335503321 |
Leonardis Aleš
|
||
| Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025 | ||
| Lo trovi qui: Univ. Federico II | ||
| ||