1.

Record Nr.

UNINA9911011778903321

Autore

Liu Meng

Titolo

Intelligent and Efficient Video Moment Localization / / by Meng Liu, Yupeng Hu, Weili Guan, Liqiang Nie

Pubbl/distr/stampa

Cham : , : Springer Nature Switzerland : , : Imprint : Springer, , 2025

ISBN

3-031-87588-5

Edizione

[1st ed. 2025.]

Descrizione fisica

1 online resource (229 pages)

Altri autori (Persone)

HuYupeng

GuanWeili

NieLiqiang

Disciplina

006

Soggetti

Image processing - Digital techniques

Computer vision

Artificial intelligence - Data processing

Quantitative research

Multimedia systems

Computer Imaging, Vision, Pattern Recognition and Graphics

Data Science

Data Analysis and Big Data

Multimedia Information Systems

Lingua di pubblicazione

Inglese

Formato

Materiale a stampa

Livello bibliografico

Monografia

Nota di contenuto

Chapter 1: Introduction -- Chapter 2: Semantic Enhanced Video Moment Localization -- Chapter 3: Semantic Alignment Video Moment Localization -- Chapter 4: Semantic Pruning Video Moment Localization -- Chapter 5: Semantic Collaborative Video Moment Localization -- Chapter 6: Weakly-Supervised Video Moment Localization -- Chapter 7: Efficient Hashing based Video Moment Localization -- Chapter 8: Research Frontiers.

Sommario/riassunto

This book provides a comprehensive exploration of video moment localization, a rapidly emerging research field focused on enabling precise retrieval of specific moments within untrimmed, unsegmented videos. With the rapid growth of digital content and the rise of video-sharing platforms, users face significant challenges when searching for



particular content across vast video archives. This book addresses how video moment localization uses natural language queries to bridge the gap between video content and semantic understanding, offering an intuitive solution for locating specific moments across diverse domains like surveillance, education, and entertainment. This book explores the latest advancements in video moment localization, addressing key issues such as accuracy, efficiency, and scalability. It presents innovative techniques for contextual understanding and cross-modal semantic alignment, including attention mechanisms and dynamic query decomposition. Additionally, the book discusses solutions for enhancing computational efficiency and scalability, such as semantic pruning and efficient hashing, while introducing frameworks for better integration between visual and textual data. It also examines weakly-supervised learning approaches to reduce annotation costs without sacrificing performance. Finally, the book covers real-world applications and offers insights into future research directions.