Machine learning for multimodal interaction : 5th international workshop, MLMI 2008, Utrecht, the Netherlands, September 8-10, 2008, proceedings / / Andrei Popescu-Belis, Rainer Stiefelhagen, editors |
Edizione | [1st ed. 2008.] |
Pubbl/distr/stampa | Berlin ; ; Heidelberg : , : Springer-Verlag, , [2008] |
Descrizione fisica | 1 online resource (XII, 364 p.) |
Disciplina | 006.454 |
Collana | Lecture Notes in Computer Science |
Soggetto topico |
Automatic speech recognition
Human-computer interaction Machine learning |
ISBN | 3-540-85853-9 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Face, Gesture and Nonverbal Communication -- Visual Focus of Attention in Dynamic Meeting Scenarios -- Fast and Robust Face Tracking for Analyzing Multiparty Face-to-Face Meetings -- What Does the Face-Turning Action Imply in Consensus Building Communication? -- Distinguishing the Communicative Functions of Gestures -- Optimised Meeting Recording and Annotation Using Real-Time Video Analysis -- Ambiguity Modeling in Latent Spaces -- Audio-Visual Scene Analysis and Speech Processing -- Inclusion of Video Information for Detection of Acoustic Events Using the Fuzzy Integral -- Audio-Visual Clustering for 3D Speaker Localization -- A Hybrid Generative-Discriminative Approach to Speaker Diarization -- A Neural Network Based Regression Approach for Recognizing Simultaneous Speech -- Hilbert Envelope Based Features for Far-Field Speech Recognition -- Multimodal Unit Selection for 2D Audiovisual Text-to-Speech Synthesis -- Social Signal Processing -- Decision-Level Fusion for Audio-Visual Laughter Detection -- Detection of Laughter-in-Interaction in Multichannel Close-Talk Microphone Recordings of Meetings -- Automatic Recognition of Spontaneous Emotions in Speech Using Acoustic and Lexical Features -- Daily Routine Classification from Mobile Phone Data -- Human-Human Spoken Dialogue Processing -- Hybrid Multi-step Disfluency Detection -- Exploring Features and Classifiers for Dialogue Act Segmentation -- Detecting Action Items in Meetings -- Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process -- Time-Compressing Speech: ASR Transcripts Are an Effective Way to Support Gist Extraction -- Meta Comments for Summarizing Meeting Speech -- HCI and Applications -- A Generic Layout-Tool for Summaries of Meetings in a Constraint-Based Approach -- A Probabilistic Model for User Relevance Feedback on Image Retrieval -- The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings -- Introducing Additional Input Information into Interactive Machine Translation Systems -- Computer Assisted Transcription of Text Images and Multimodal Interaction -- User Requirements and Evaluation of Meeting Browsers and Assistants -- Designing and Evaluating Meeting Assistants, Keeping Humans in Mind -- Making Remote ‘Meeting Hopping’ Work: Assistance to Initiate, Join and Leave Meetings -- Physicality and Cooperative Design -- Developing and Evaluating a Meeting Assistant Test Bed -- Extrinsic Summarization Evaluation: A Decision Audit Task. |
Record Nr. | UNINA-9910484051803321 |
Berlin ; ; Heidelberg : , : Springer-Verlag, , [2008] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Machine learning for multimodal interaction : 4th international workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007 : revised selected papers / / Andrei Popescu-Belis, Steve Renals, Hervé Bourlard (editors) |
Edizione | [1st ed. 2008.] |
Pubbl/distr/stampa | Berlin, Germany ; ; New York, New York : , : Springer, , [2008] |
Descrizione fisica | 1 online resource (XI, 308 p.) |
Disciplina | 006.3/1 |
Collana | Information Systems and Applications, incl. Internet/Web, and HCI |
Soggetto topico |
Automatic speech recognition
Human-computer interaction Machine learning |
ISBN | 3-540-78155-2 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Invited Paper -- Robust Real Time Face Tracking for the Analysis of Human Behaviour -- Multimodal Processing -- Conditional Sequence Model for Context-Based Recognition of Gaze Aversion -- Meeting State Recognition from Visual and Aural Labels -- Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers -- HCI, User Studies and Applications -- Automatic Annotation of Dialogue Structure from Simple User Interaction -- Interactive Pattern Recognition -- User Specific Training of a Music Search Engine -- An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing -- Integrating Semantics into Multimodal Interaction Patterns -- Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment -- Image and Video Processing -- Face Recognition in Smart Rooms -- Gaussian Process Latent Variable Models for Human Pose Estimation -- Discourse and Dialogue Processing -- Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech -- Term-Weighting for Summarization of Multi-party Spoken Dialogues -- Automatic Decision Detection in Meeting Speech -- Czech Text-to-Sign Speech Synthesizer -- Speech and Audio Processing -- Using Prosodic Features in Language Models for Meetings -- Posterior-Based Features and Distances in Template Matching for Speech Recognition -- A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems -- Transfer Learning for Tandem ASR Feature Extraction -- Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search -- Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding -- Modeling Vocal Interaction for Segmentation in Meeting Recognition -- Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation -- PASCAL Speech Separation Challenge II -- To Separate Speech -- Microphone Array Beamforming Approach to Blind Speech Separation. |
Record Nr. | UNINA-9910483647703321 |
Berlin, Germany ; ; New York, New York : , : Springer, , [2008] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Machine learning for multimodal interaction : 5th international workshop, MLMI 2008, Utrecht, the Netherlands, September 8-10, 2008, proceedings / / Andrei Popescu-Belis, Rainer Stiefelhagen, editors |
Edizione | [1st ed. 2008.] |
Pubbl/distr/stampa | Berlin ; ; Heidelberg : , : Springer-Verlag, , [2008] |
Descrizione fisica | 1 online resource (XII, 364 p.) |
Disciplina | 006.454 |
Collana | Lecture Notes in Computer Science |
Soggetto topico |
Automatic speech recognition
Human-computer interaction Machine learning |
ISBN | 3-540-85853-9 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Face, Gesture and Nonverbal Communication -- Visual Focus of Attention in Dynamic Meeting Scenarios -- Fast and Robust Face Tracking for Analyzing Multiparty Face-to-Face Meetings -- What Does the Face-Turning Action Imply in Consensus Building Communication? -- Distinguishing the Communicative Functions of Gestures -- Optimised Meeting Recording and Annotation Using Real-Time Video Analysis -- Ambiguity Modeling in Latent Spaces -- Audio-Visual Scene Analysis and Speech Processing -- Inclusion of Video Information for Detection of Acoustic Events Using the Fuzzy Integral -- Audio-Visual Clustering for 3D Speaker Localization -- A Hybrid Generative-Discriminative Approach to Speaker Diarization -- A Neural Network Based Regression Approach for Recognizing Simultaneous Speech -- Hilbert Envelope Based Features for Far-Field Speech Recognition -- Multimodal Unit Selection for 2D Audiovisual Text-to-Speech Synthesis -- Social Signal Processing -- Decision-Level Fusion for Audio-Visual Laughter Detection -- Detection of Laughter-in-Interaction in Multichannel Close-Talk Microphone Recordings of Meetings -- Automatic Recognition of Spontaneous Emotions in Speech Using Acoustic and Lexical Features -- Daily Routine Classification from Mobile Phone Data -- Human-Human Spoken Dialogue Processing -- Hybrid Multi-step Disfluency Detection -- Exploring Features and Classifiers for Dialogue Act Segmentation -- Detecting Action Items in Meetings -- Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process -- Time-Compressing Speech: ASR Transcripts Are an Effective Way to Support Gist Extraction -- Meta Comments for Summarizing Meeting Speech -- HCI and Applications -- A Generic Layout-Tool for Summaries of Meetings in a Constraint-Based Approach -- A Probabilistic Model for User Relevance Feedback on Image Retrieval -- The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings -- Introducing Additional Input Information into Interactive Machine Translation Systems -- Computer Assisted Transcription of Text Images and Multimodal Interaction -- User Requirements and Evaluation of Meeting Browsers and Assistants -- Designing and Evaluating Meeting Assistants, Keeping Humans in Mind -- Making Remote ‘Meeting Hopping’ Work: Assistance to Initiate, Join and Leave Meetings -- Physicality and Cooperative Design -- Developing and Evaluating a Meeting Assistant Test Bed -- Extrinsic Summarization Evaluation: A Decision Audit Task. |
Record Nr. | UNISA-996465870603316 |
Berlin ; ; Heidelberg : , : Springer-Verlag, , [2008] | ||
![]() | ||
Lo trovi qui: Univ. di Salerno | ||
|
Machine learning for multimodal interaction : 4th international workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007 : revised selected papers / / Andrei Popescu-Belis, Steve Renals, Hervé Bourlard (editors) |
Edizione | [1st ed. 2008.] |
Pubbl/distr/stampa | Berlin, Germany ; ; New York, New York : , : Springer, , [2008] |
Descrizione fisica | 1 online resource (XI, 308 p.) |
Disciplina | 006.3/1 |
Collana | Information Systems and Applications, incl. Internet/Web, and HCI |
Soggetto topico |
Automatic speech recognition
Human-computer interaction Machine learning |
ISBN | 3-540-78155-2 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Invited Paper -- Robust Real Time Face Tracking for the Analysis of Human Behaviour -- Multimodal Processing -- Conditional Sequence Model for Context-Based Recognition of Gaze Aversion -- Meeting State Recognition from Visual and Aural Labels -- Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers -- HCI, User Studies and Applications -- Automatic Annotation of Dialogue Structure from Simple User Interaction -- Interactive Pattern Recognition -- User Specific Training of a Music Search Engine -- An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing -- Integrating Semantics into Multimodal Interaction Patterns -- Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment -- Image and Video Processing -- Face Recognition in Smart Rooms -- Gaussian Process Latent Variable Models for Human Pose Estimation -- Discourse and Dialogue Processing -- Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech -- Term-Weighting for Summarization of Multi-party Spoken Dialogues -- Automatic Decision Detection in Meeting Speech -- Czech Text-to-Sign Speech Synthesizer -- Speech and Audio Processing -- Using Prosodic Features in Language Models for Meetings -- Posterior-Based Features and Distances in Template Matching for Speech Recognition -- A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems -- Transfer Learning for Tandem ASR Feature Extraction -- Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search -- Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding -- Modeling Vocal Interaction for Segmentation in Meeting Recognition -- Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation -- PASCAL Speech Separation Challenge II -- To Separate Speech -- Microphone Array Beamforming Approach to Blind Speech Separation. |
Record Nr. | UNISA-996465581103316 |
Berlin, Germany ; ; New York, New York : , : Springer, , [2008] | ||
![]() | ||
Lo trovi qui: Univ. di Salerno | ||
|
Mastering voice interfaces : creating great voice apps for real users / / Ann Thymé-Gobbel, Charles Jankowski |
Autore | Thymé-Gobbel Ann |
Pubbl/distr/stampa | Berkeley, CA : , : Apress, , [2021] |
Descrizione fisica | 1 online resource (702 pages) |
Disciplina | 006.248392 |
Soggetto topico |
Voice computing
User interfaces (Computer systems) Automatic speech recognition Ambient intelligence |
ISBN | 1-4842-7005-3 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | PART 1: Voice System Foundations Chapter 1: Say Hello to Voice Systems Chapter 2: Keeping Voice in Mind Chapter 3: Running a Voice Implementation and Noticing Issues PART 2: Planning Voice System Interactions Chapter 4: Defining your Vision: Building What, How, and Why for Whom Chapter 5: From Discovery to UX and UI Design: Tools of the Voice-First Trade PART 3: Building Voice System Interactions Chapter 6: Applying Human 'Rules of Dialog' to Reach Conversation Resolution Chapter 7: Resolving Incomplete Requests Through Disambiguation Chapter 8: Conveying Reassurance with Confidence and Confirmation Chapter 9: Helping Users Succeed Through Consistency Chapter 10: Creating Robust Coverage for Speech-to-Text Resolution Chapter 11: Reaching Understanding Through Parsing and Intent Resolution Chapter 12: Applying Accuracy Strategies to Avoid Misunderstandings Chapter 13: Choosing Strategies to Recover from Miscommunication Chapter 14: Using Context and Data to Create Smarter Conversations Chapter 15: Creating Secure Personalized Experiences PART 4: Verifying and Deploying Voice System Interactions Chapter 16: Testing and Measuring Performance in Voice Systems Chapter 17: Tuning and Deploying Voice Systems |
Record Nr. | UNINA-9910484819203321 |
Thymé-Gobbel Ann
![]() |
||
Berkeley, CA : , : Apress, , [2021] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Multilingual phone recognition in Indian languages / / K. E. Manjunath |
Autore | Manjunath K. E. |
Pubbl/distr/stampa | Cham, Switzerland : , : Springer International Publishing, , [2021] |
Descrizione fisica | 1 online resource (113 pages) |
Disciplina | 410.285 |
Collana | SpringerBriefs in Speech Technology |
Soggetto topico |
Computational linguistics - India
Automatic speech recognition |
ISBN | 3-030-80741-X |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910523880803321 |
Manjunath K. E.
![]() |
||
Cham, Switzerland : , : Springer International Publishing, , [2021] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Pattern recognition in speech and language processing / edited by Wu Chou, Biing-Hwang Juang |
Pubbl/distr/stampa | Boca Raton : CRC Press, 2003 |
Descrizione fisica | vi, 394 p. : ill. ; 24 cm |
Disciplina | 006.454 |
Altri autori (Persone) |
Chou, Wu
Juang, Biing Hwang |
Soggetto topico |
Automatic speech recognition
Pattern recognition systems |
ISBN | 0849312329 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNISALENTO-991001428359707536 |
Boca Raton : CRC Press, 2003 | ||
![]() | ||
Lo trovi qui: Univ. del Salento | ||
|
Proceedings / / Integration of Speech and Image Understanding |
Pubbl/distr/stampa | Los Alamitos, Calif., : IEEE Computer Society |
Descrizione fisica | 1 online resource |
Disciplina | 621 |
Soggetto topico |
Automatic speech recognition - Congresses
Image processing Computer vision Artificial intelligence Speech processing systems Automatic speech recognition |
Soggetto genere / forma |
Conference papers and proceedings.
Periodicals. |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Periodico |
Lingua di pubblicazione | eng |
Record Nr. | UNISA-996280563003316 |
Los Alamitos, Calif., : IEEE Computer Society | ||
![]() | ||
Lo trovi qui: Univ. di Salerno | ||
|
Proceedings / / Integration of Speech and Image Understanding |
Pubbl/distr/stampa | Los Alamitos, Calif., : IEEE Computer Society |
Descrizione fisica | 1 online resource |
Disciplina | 621 |
Soggetto topico |
Automatic speech recognition - Congresses
Image processing Computer vision Artificial intelligence Speech processing systems Automatic speech recognition |
Soggetto genere / forma |
Conference papers and proceedings.
Periodicals. |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Periodico |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910874416003321 |
Los Alamitos, Calif., : IEEE Computer Society | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
PVC : proceedings 2014 XXII Annual Pacific Voice Conference : Kraków, Poland, 11-13 April 2014 |
Pubbl/distr/stampa | New York : , : IEEE, , 2014 |
Descrizione fisica | 1 online resource (212 pages) |
Soggetto topico |
Speech processing systems
Automatic speech recognition |
ISBN | 1-4799-3700-2 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNISA-996279746403316 |
New York : , : IEEE, , 2014 | ||
![]() | ||
Lo trovi qui: Univ. di Salerno | ||
|