04019nam 2200577 a 450 991048321890332120200520144314.03-540-30568-810.1007/b105752(CKB)1000000000212713(SSID)ssj0000195099(PQKBManifestationID)11183930(PQKBTitleCode)TC0000195099(PQKBWorkID)10241871(PQKB)10086167(DE-He213)978-3-540-30568-2(MiAaPQ)EBC3068428(PPN)123091764(EXLCZ)99100000000021271320041230d2005 uy 0engurnn|008mamaatxtccrMachine learning for multimodal interaction first international workshop, MLMI 2004, Martigny, Switzerland, June 21-23, 2004 : revised selected papers /Samy Bengio, Herve Bourlard (eds.)1st ed. 2005.Berlin ;New York Springerc20051 online resource (XII, 362 p.) Lecture notes in computer science,0302-9743 ;3361Bibliographic Level Mode of Issuance: Monograph3-540-24509-X Includes bibliographical references and index.MLMI 2004 -- Accessing Multimodal Meeting Data: Systems, Problems and Possibilities -- Browsing Recorded Meetings with Ferret -- Meeting Modelling in the Context of Multimodal Research -- Artificial Companions -- Zakim – A Multimodal Software System for Large-Scale Teleconferencing -- Towards Computer Understanding of Human Interactions -- Multistream Dynamic Bayesian Network for Meeting Segmentation -- Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives -- An Integrated Framework for the Management of Video Collection -- The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing -- S-SEER: Selective Perception in a Multimodal Office Activity Recognition System -- Mapping from Speech to Images Using Continuous State Space Models -- An Online Algorithm for Hierarchical Phoneme Classification -- Towards Predicting Optimal Fusion Candidates: A Case Study on Biometric Authentication Tasks -- Mixture of SVMs for Face Class Modeling -- AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking -- The 2004 ICSI-SRI-UW Meeting Recognition System -- On the Adequacy of Baseform Pronunciations and Pronunciation Variants -- Tandem Connectionist Feature Extraction for Conversational Speech Recognition -- Long-Term Temporal Features for Conversational Speech Recognition -- Speaker Indexing in Audio Archives Using Gaussian Mixture Scoring Simulation -- Speech Transcription and Spoken Document Retrieval in Finnish -- A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System -- Shallow Dialogue Processing Using Machine Learning Algorithms (or Not) -- ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings -- Piecing Together the Emotion Jigsaw -- Emotion Analysis in Man-Machine Interaction Systems -- A Hierarchical System for Recognition, Tracking and Pose Estimation -- Automatic Pedestrian Tracking Using Discrete Choice Models and Image Correlation Techniques -- A Shape Based, Viewpoint Invariant Local Descriptor.Lecture notes in computer science ;3361.MLMI 2004Machine learningCongressesHuman-computer interactionCongressesMachine learningHuman-computer interaction006.3/1Bengio Samy1688709Bourlard Herve1956-1758769Workshop on Machine Learning for Multimodal InteractionMiAaPQMiAaPQMiAaPQBOOK9910483218903321Machine learning for multimodal interaction4197021UNINA