LEADER 08725nam 22007815 450 001 996466276103316 005 20200722084740.0 010 $a3-030-05710-0 024 7 $a10.1007/978-3-030-05710-7 035 $a(CKB)4100000007279092 035 $a(DE-He213)978-3-030-05710-7 035 $a(MiAaPQ)EBC5926632 035 $a(PPN)232963967 035 $a(EXLCZ)994100000007279092 100 $a20181207d2019 u| 0 101 0 $aeng 135 $aurnn#008mamaa 181 $ctxt$2rdacontent 182 $cc$2rdamedia 183 $acr$2rdacarrier 200 10$aMultiMedia Modeling$b[electronic resource] $e25th International Conference, MMM 2019, Thessaloniki, Greece, January 8?11, 2019, Proceedings, Part I /$fedited by Ioannis Kompatsiaris, Benoit Huet, Vasileios Mezaris, Cathal Gurrin, Wen-Huang Cheng, Stefanos Vrochidis 205 $a1st ed. 2019. 210 1$aCham :$cSpringer International Publishing :$cImprint: Springer,$d2019. 215 $a1 online resource (XXVI, 721 p. 260 illus., 233 illus. in color.) 225 1 $aInformation Systems and Applications, incl. Internet/Web, and HCI ;$v11295 300 $aIncludes index. 311 $a3-030-05709-7 327 $aSentiment-aware Multi-modal Recommendation on Tourist Attractions -- SCOD: Dynamical Spatial Constraints for Object Detection -- STMP: Spatial Temporal Multi-level Proposal Network for Activity Detection -- Hierarchical Vision-Language Alignment for Video Captioning -- Task-Driven Biometric Authentication of Users in Virtual Reality (VR) Environments -- Deep Neural Network Based 3D Articulatory Movement Prediction Using Both Text and Audio Inputs -- Subjective Visual Quality Assessment of Immersive 3D Media Compressed by Open-Source Static 3D Mesh Codecs -- Joint EPC and RAN Caching of Tiled VR Videos for Mobile Networks -- Foveated Ray Tracing for VR Headsets -- Preferred Model of Adaptation to Dark for Virtual Reality Headsets -- From Movement to Events: Improving Soccer Match Annotations -- Multimodal Video Annotation for Retrieval and Discovery of Newsworthy Video in a News Verification Scenario -- Integration of Exploration and Search: A Case Study of the M^3 Model -- Face Swapping for Solving Collateral Privacy Issues in Multimedia Analytics -- Exploring the Impact of Training Data Bias on Automatic Generation of Video Captions -- Fashion Police: Towards Semantic Indexing of Clothing Information In Surveillance Data -- CNN-Based Non-Contact Detection of Food Level in Bottles from RGB Images -- Personalized Recommendation of Photography Based on Deep Learning -- Two-level Attention with Multi-task Learning for Facial Emotion Estimation -- User Interaction for Visual Lifelog Retrieval in a Virtual Environment -- Query-by-Dancing: A Dance Music Retrieval System Based on Body-Motion Similarity -- Joint Visual-Textual Sentiment Analysis Based on Cross-modality Attention Mechanism -- Deep Hashing with Triplet Labels and Unification Binary Code Selection for Fast Image Retrieval -- Incremental Training for Face Recognition -- Character Prediction in TV Series via a Semantic Projection Network -- A Test Collection for Interactive Lifelog Retrieval -- SEPHLA: Challenges and Opportunities within Environment ? Personal Health Archives -- Athens Urban Soundscape (ATHUS): A dataset for urban soundscape quality recognition -- V3C - a Research Video Collection -- Image Aesthetics Assessment using Fully Convolutional Neural Networks -- Detecting tampered videos with multimedia forensics and deep learning -- Improving Robustness of Image Tampering Detection for Compression -- Audiovisual annotation procedure for multi-view field recordings -- A Robust Multi-Athlete Tracking Algorithm by Exploiting Discriminant Features and Long-Term Dependencies -- Early Identification of Oil Spills in Satellite Images Using Deep CNNs -- Point Cloud Colorization Based on Densely Annotated 3D Shape Dataset -- evolve2vec: Learning Network Representations Using Temporal Unfolding -- The Impact of Packet Loss and Google Congestion Control on QoE for WebRTC-based Mobile Multiparty Audiovisual Telemeetings -- Hierarchical Temporal Pooling for Efficient Online Action Recognition -- Generative Adversarial Networks with Enhanced Symmetric Residual Units for Single Image Super-Resolution -- 3D ResNets for 3D object classification -- Four Models for Automatic Recognition of Left and Right Eye in Fundus Images -- On the unsolved problem of Shot Boundary Detection for Music Videos -- Enhancing Scene Text Detection via Fused Semantic Segmentation Network with Attention -- Exploiting Incidence Relation Between Subgroups for Improving Clustering-Based Recommendation Model -- Hierarchical Bayesian Network based Incremental Model for Flood Prediction -- A New Female Body Segmentation and Feature Localisation Method for Image-based Anthropometry -- Greedy Salient Dictionary Learning For Activity Video Summarization -- Accelerating Topic Detection on Web for a Large-Scale Data Set via Stochastic Poisson Deconvolution -- Automatic Segmentation of Brain Tumor Images Based on Region Growing with Co-constraint -- Proposal of an Annotation Method for Integrating Musical Technique Knowledge using a GTTM Time-Span Tree -- A hierarchical level set approach to for RGBD image matting -- A Genetic Programming Approach to Integrate Multilayer CNN Features for Image Classification -- Improving Micro-Expression Recognition Accuracy using Twofold Feature Extraction -- An effective dual-fisheye lens stitching method based on feature points -- 3D Skeletal Gesture Recognition via Sparse Coding of Time-Warping Invariant Riemannian Trajectories -- Efficient Graph based Multi-View Leaning -- DANTE speaker recognition module. An efficient and robust automatic speaker searching solution for terrorism-related scenarios. 330 $aThe two-volume set LNCS 11295 and 11296 constitutes the thoroughly refereed proceedings of the 25th International Conference on MultiMedia Modeling, MMM 2019, held in Thessaloniki, Greece, in January 2019. Of the 172 submitted full papers, 49 were selected for oral presentation and 47 for poster presentation; in addition, 6 demonstration papers, 5 industry papers, 6 workshop papers, and 6 papers for the Video Browser Showdown 2019 were accepted. All papers presented were carefully reviewed and selected from 204 submissions. 410 0$aInformation Systems and Applications, incl. Internet/Web, and HCI ;$v11295 606 $aMultimedia information systems 606 $aOptical data processing 606 $aArtificial intelligence 606 $aPattern recognition 606 $aInformation storage and retrieval 606 $aApplication software 606 $aMultimedia Information Systems$3https://scigraph.springernature.com/ontologies/product-market-codes/I18059 606 $aImage Processing and Computer Vision$3https://scigraph.springernature.com/ontologies/product-market-codes/I22021 606 $aArtificial Intelligence$3https://scigraph.springernature.com/ontologies/product-market-codes/I21000 606 $aPattern Recognition$3https://scigraph.springernature.com/ontologies/product-market-codes/I2203X 606 $aInformation Storage and Retrieval$3https://scigraph.springernature.com/ontologies/product-market-codes/I18032 606 $aInformation Systems Applications (incl. Internet)$3https://scigraph.springernature.com/ontologies/product-market-codes/I18040 615 0$aMultimedia information systems. 615 0$aOptical data processing. 615 0$aArtificial intelligence. 615 0$aPattern recognition. 615 0$aInformation storage and retrieval. 615 0$aApplication software. 615 14$aMultimedia Information Systems. 615 24$aImage Processing and Computer Vision. 615 24$aArtificial Intelligence. 615 24$aPattern Recognition. 615 24$aInformation Storage and Retrieval. 615 24$aInformation Systems Applications (incl. Internet). 676 $a006.7 702 $aKompatsiaris$b Ioannis$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aHuet$b Benoit$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aMezaris$b Vasileios$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aGurrin$b Cathal$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aCheng$b Wen-Huang$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aVrochidis$b Stefanos$4edt$4http://id.loc.gov/vocabulary/relators/edt 801 0$bMiAaPQ 801 1$bMiAaPQ 801 2$bMiAaPQ 906 $aBOOK 912 $a996466276103316 996 $aMultiMedia Modeling$92050609 997 $aUNISA