top

  Info

  • Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

  Info

  • Utilizzare questo link per rimuovere la selezione effettuata.
Speaker classification I : fundamentals, features and methods / / Christian MuÌller (editor)
Speaker classification I : fundamentals, features and methods / / Christian MuÌller (editor)
Edizione [1st ed. 2007.]
Pubbl/distr/stampa Berlin ; ; Heidelberg ; ; New York : , : Springer, , [2007]
Descrizione fisica 1 online resource (X, 355 p.)
Disciplina 006.454
Collana Lecture Notes in Artificial Intelligence
Soggetto topico Automatic speech recognition
ISBN 3-540-74200-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Fundamentals -- How Is Individuality Expressed in Voice? An Introduction to Speech Production and Description for Speaker Classification -- Speaker Classification Concepts: Past, Present and Future -- Characteristics -- Speaker Characteristics -- Foreign Accent -- Acoustic Analysis of Adult Speaker Age -- Speech Under Stress: Analysis, Modeling and Recognition -- Speaker Characteristics and Emotion Classification -- Emotions in Speech: Juristic Implications -- Applications -- Application of Speaker Classification in Human Machine Dialog Systems -- Speaker Classification in Forensic Phonetics and Acoustics -- Forensic Automatic Speaker Classification in the “Coming Paradigm Shift” -- The Many Roles of Speaker Classification in Speaker Verification and Identification -- Methods and Features -- Frame Based Features -- Higher-Level Features in Speaker Recognition -- Enhancing Speaker Discrimination at the Feature Level -- Classification Methods for Speaker Recognition -- Multi-stream Fusion for Speaker Classification -- Evaluation -- Evaluations of Automatic Speaker Classification Systems -- An Introduction to Application-Independent Evaluation of Speaker Recognition Systems.
Record Nr. UNINA-9910483019603321
Berlin ; ; Heidelberg ; ; New York : , : Springer, , [2007]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Speaker classification I : fundamentals, features and methods / / Christian MuÌller (editor)
Speaker classification I : fundamentals, features and methods / / Christian MuÌller (editor)
Edizione [1st ed. 2007.]
Pubbl/distr/stampa Berlin ; ; Heidelberg ; ; New York : , : Springer, , [2007]
Descrizione fisica 1 online resource (X, 355 p.)
Disciplina 006.454
Collana Lecture Notes in Artificial Intelligence
Soggetto topico Automatic speech recognition
ISBN 3-540-74200-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Fundamentals -- How Is Individuality Expressed in Voice? An Introduction to Speech Production and Description for Speaker Classification -- Speaker Classification Concepts: Past, Present and Future -- Characteristics -- Speaker Characteristics -- Foreign Accent -- Acoustic Analysis of Adult Speaker Age -- Speech Under Stress: Analysis, Modeling and Recognition -- Speaker Characteristics and Emotion Classification -- Emotions in Speech: Juristic Implications -- Applications -- Application of Speaker Classification in Human Machine Dialog Systems -- Speaker Classification in Forensic Phonetics and Acoustics -- Forensic Automatic Speaker Classification in the “Coming Paradigm Shift” -- The Many Roles of Speaker Classification in Speaker Verification and Identification -- Methods and Features -- Frame Based Features -- Higher-Level Features in Speaker Recognition -- Enhancing Speaker Discrimination at the Feature Level -- Classification Methods for Speaker Recognition -- Multi-stream Fusion for Speaker Classification -- Evaluation -- Evaluations of Automatic Speaker Classification Systems -- An Introduction to Application-Independent Evaluation of Speaker Recognition Systems.
Record Nr. UNISA-996465305303316
Berlin ; ; Heidelberg ; ; New York : , : Springer, , [2007]
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
Speech and computer : 24th International Conference, SPECOM 2022, Gurugram, India, November 14-16, 2022, proceedings / / edited by S. R. Mahadeva Prasanna, [and three others]
Speech and computer : 24th International Conference, SPECOM 2022, Gurugram, India, November 14-16, 2022, proceedings / / edited by S. R. Mahadeva Prasanna, [and three others]
Pubbl/distr/stampa Cham, Switzerland : , : Springer, , [2022]
Descrizione fisica 1 online resource (737 pages)
Disciplina 050
Collana Lecture Notes in Computer Science
Soggetto topico Automatic speech recognition
ISBN 3-031-20980-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNISA-996500064003316
Cham, Switzerland : , : Springer, , [2022]
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
Speech and computer : 24th International Conference, SPECOM 2022, Gurugram, India, November 14-16, 2022, proceedings / / edited by S. R. Mahadeva Prasanna, [and three others]
Speech and computer : 24th International Conference, SPECOM 2022, Gurugram, India, November 14-16, 2022, proceedings / / edited by S. R. Mahadeva Prasanna, [and three others]
Pubbl/distr/stampa Cham, Switzerland : , : Springer, , [2022]
Descrizione fisica 1 online resource (737 pages)
Disciplina 050
Collana Lecture Notes in Computer Science
Soggetto topico Automatic speech recognition
ISBN 3-031-20980-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910624307903321
Cham, Switzerland : , : Springer, , [2022]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Speech and language processing : an introduction to natural language processing, computational linguistics, and speech recognition / Daniel Jurafsky, James H. Martin
Speech and language processing : an introduction to natural language processing, computational linguistics, and speech recognition / Daniel Jurafsky, James H. Martin
Autore Jurafsky, Dan, 1962-
Edizione [2nd ed.]
Pubbl/distr/stampa Upper Saddle River, N.J. : Pearson Prentice Hall, c2009
Descrizione fisica 10024 p. : ill. ; 25 cm
Disciplina 006.3
Altri autori (Persone) Martin, James H., 1959-
Collana Prentice Hall series in artificial intelligence
Soggetto topico Computational linguistics
Automatic speech recognition
ISBN 9780135041963
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNISALENTO-991003569239707536
Jurafsky, Dan, 1962-  
Upper Saddle River, N.J. : Pearson Prentice Hall, c2009
Materiale a stampa
Lo trovi qui: Univ. del Salento
Opac: Controlla la disponibilità qui
Speech processing for IP networks : Media Resource Control Protocol (MRCP) / / Dave Burke
Speech processing for IP networks : Media Resource Control Protocol (MRCP) / / Dave Burke
Autore Burke Dave
Pubbl/distr/stampa Chichester ; , : John Wiley & Sons, , c2007
Descrizione fisica 1 online resource (372 p.)
Disciplina 004.62
006.4/54
006.454
Soggetto topico Speech processing systems
Automatic speech recognition
TCP/IP (Computer network protocol)
ISBN 1-280-83878-7
9786610838783
0-470-06059-X
0-470-06060-3
Classificazione 54.32
54.79
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto PART I. BACKGROUND. 1. Introduction. 1.1 Introduction to Speech Applications. 1.2 The MRCP Value Proposition. 1.3 History of MRCP Standardisation. 1.3.1 Internet Engineering Task Force. 1.3.2 World Wide Web Consortium. 1.3.3 MRCP: From Humble Beginnings Toward IETF Standard. 1.4 Summary. 2. Basic Principles of Speech Processing. 2.1 Human Speech Production. 2.1.1 Speech Sounds: Phonemics and Phonetics. 2.2 Speech Recognition. 2.2.1 Endpoint Detection. 2.2.2 Mel-Cepstrum. 2.2.3 Hidden Markov Models. 2.2.4 Language Modelling. 2.3 Speaker Verification and Identification. 2.3.1 Feature Extraction. 2.3.2 Statistical Modelling. 2.4 Speech Synthesis. 2.4.1 Front-end Processing. 2.4.2 Back-end Synthesis. 2.5 Summary. 3. Overview of MRCP. 3.1 Architecture. 3.2 Media Resource Types. 3.3 Network Scenarios. 3.3.1 VoiceXML IVR Service Node. 3.3.2 IP PBX with Voicemail. 3.3.3 Advanced Media Gateway. 3.4 Protocol Operation. 3.4.1 Establishing Communication Channels. 3.4.2 Controlling a Media Resource. 3.4.3 Walkthrough Examples. 3.5 Security. 3.6 Summary. PART II. MEDIA AND CONTROL SESSIONS. 4. Session Initiation Protocol. 4.1 Introduction. 4.2 Walkthrough Example. 4.3 SIP URIs. 4.4 Transport. 4.5 Media Negotiation. 4.5.1 Session Description Protocol. 4.5.2 Offer/Answer Model. 4.6 SIP Servers. 4.6.1 Registrars. 4.6.2 Proxy Servers. 4.6.3 Redirect Servers. 4.7 SIP Extensions. 4.7.1 Capability Discovery. 4.8 Security. 4.8.1 Transport and Network Layer Security. 4.8.2 Authentication. 4.8.3 S/MIME. 4.9 Summary. 5. Session Initiation in MRCP. 5.1 Introduction. 5.2 Initiating the Media Session. 5.3 Initiating the Control Session. 5.4 Session Initiation Examples. 5.4.1 Single Media Resource. 5.4.2 Adding and Removing Media Resources. 5.4.3 Distributed Media Source/Sink. 5.5 Locating Media Resource Servers. 5.5.1 Requesting Server Capabilities. 5.5.2 Media Resource Brokers. 5.6 Security. 5.7 Summary. 6. The Media Session. 6.1 Media Encoding. 6.1.1 Pulse Code Modulation (PCM). 6.1.2 Linear Predictive Coding (LPC). 6.2 Media Transport. 6.2.1 Real-Time Protocol (RTP). 6.2.2 DTMF. 6.3 Security. 6.4 Summary. 7. The Control Session. 7.1 Message Structure. 7.1.1 Request Message. 7.1.2 Response Message. 7.1.3 Event Message. 7.1.4 Message Bodies. 7.2 Generic Methods. 7.3 Generic Headers. 7.4 Security. 7.5 Summary. PART III. DATA REPRESENTATION FORMATS. 8. Speech Synthesis Markup Language (SSML). 8.1 Introduction. 8.2 Document Structure. 8.3 Recorded Audio. 8.4 Pronunciation. 8.4.1 Phonemic/Phonetic Content. 8.4.2 Substitution. 8.4.3 Interpreting Text . 8.5 Prosody. 8.5.1 Prosodic Boundaries. 8.5.2 Emphasis. 8.5.3 Speaking Voice. 8.5.4 Prosodic Control. 8.6 Markers . 8.7 Metadata. 8.8 Summary. 9. Speech Recognition Grammar Specification (SRGS). 9.1 Introduction. 9.2 Document Structure. 9.3 Rules, Tokens, and Sequences. 9.4 Alternatives. 9.5 Rule References. 9.5.1 Special Rules. 9.6 Repeats. 9.7 DTMF Grammars. 9.8 Semantic Interpretation. 9.8.1 Semantic Literals. 9.8.2 Semantic Scripts. 9.9 Summary. 10. Natural Language Semantics Markup Language (NLSML). 10.1 Introduction. 10.2 Document Structure. 10.3 Speech Recognition Results. 10.3.1 Serialising Semantic Interpretation Results. 10.4 Voice Enrollment Results. 10.5 Speaker Verification Results. 10.6 Summary. 11. Pronunciation Lexicon Specification (PLS). 11.1 Introduction. 11.2 Document Structure. 11.3 Lexical Entries. 11.4 Abbreviations and Acronyms. 11.5 Multiple Orthographies. 11.6 Multiple Pronunciations. 11.7 Summary. PART IV. MEDIA RESOURCES. 12. Speech Synthesiser Resource. 12.1 Overview. 12.2 Methods. 12.2.1 SPEAK. 12.2.2 PAUSE. 12.2.3 RESUME. 12.2.4 STOP. 12.2.5 BARGE-IN-OCCURRED. 12.2.6 CONTROL. 12.2.7 DEFINE-LEXICON. 12.3 Events. 12.3.1 SPEECH-MARKER. 12.3.2 SPEAK-COMPLETE. 12.4 Headers. 12.5 Summary. 13. Speech Recogniser Resource. 13.1 Overview. 13.2 Recognition Methods. 13.2.1 RECOGNIZE. 13.2.2 DEFINE-GRAMMAR. 13.2.3 START-INPUT-TIMERS. 13.2.4 GET-RESULT. 13.2.5 STOP. 13.2.6 INTERPRET. 13.3 Enrollment Methods. 13.3.1 START-PHRASE-ENROLLMENT. 13.3.2 ENROLLMENT-ROLLBACK. 13.3.3 END-PHRASE-ENROLLMENT. 13.3.4 MODIFY-PHRASE. 13.3.5 DELETE-PHRASE. 13.4 Events. 13.4.1 START-OF-INPUT. 13.4.2 RECOGNITION-COMPLETE. 13.4.3 INTERPRETATION-COMPLETE. 13.5 Recognition Headers. 13.6 Enrollment Headers. 13.7 Summary. 14. Recorder Resource. 14.1 Overview. 14.2 Methods. 14.2.1 RECORD. 14.2.2 START-INPUT-TIMERS. 14.2.3 STOP. 14.3 Events. 14.3.1 START-OF-INPUT. 14.3.2 RECORD-COMPLETE. 14.4 Headers. 14.5 Summary. 15. Speaker Verification Resource. 15.1 Overview. 15.2 Methods. 15.2.1 START-SESSION. 15.2.2 END-SESSION. 15.2.3 VERIFY. 15.2.4 VERIFY-FROM-BUFFER. 15.2.5 VERIFY-ROLLBACK. 15.2.6 START-INPUT-TIMERS. 15.2.7 GET-INTERMEDIATE-RESULT. 15.2.8 STOP. 15.2.9 CLEAR-BUFFER. 15.2.10 QUERY-VOICEPRINT. 15.2.11 DELETE-VOICEPRINT. 15.3 Events. 15.3.1 START-OF-INPUT. 15.3.2 VERIFICATION-COMPLETE. 15.4 Headers. 15.5 Summary. PART V. PROGRAMMING SPEECH APPLICATIONS. 16. Voice eXtensible Markup Language (VoiceXML). 16.1 Introduction. 16.2 Document Structure. 16.2.1 Applications and Dialogs. 16.3 Dialogs. 16.3.1 Forms. 16.3.2 Menus. 16.3.3 Mixed Initiative Dialogs. 16.4 Media Playback. 16.5 Media Recording. 16.6 Speech and DTMF Recognition. 16.6.1 Specifying Grammars. 16.6.2 Grammar Scope and Activation. 16.6.3 Configuring Recognition Settings. 16.6.4 Processing Recognition Results. 16.7 Flow Control. 16.7.1 Executable Content. 16.7.2 Variables, Scopes, and Expressions. 16.7.3 Document and Dialog Transitions . 16.7.4 Event Handling. 16.8 Resource Fetching. 16.9 Call Transfer. 16.10 Summary. 17. VoiceXML and MRCP Interworking. 17.1 Introduction. 17.2 Interworking Fundamentals. 17.2.1 Play Prompts. 17.2.2 Play and Recognise. 17.2.3 Record. 17.3 Application Example. 17.3.1 VoiceXML Scripts. 17.3.2 MRCP Flows. 17.4 Summary. Appendix A. MRCP Version 1. A.1 Overview. A.2 Session Management and Message Transport. A.3 General Protocol Details. A.4 Speech Synthesiser Resource. A.5 Speech Recogniser Resource. Appendix B. XML Primer. B.1 Background. B.2 Basic Concepts. B.3 Namespaces. B.4 Document Schemas. Appendix C. HTTP Primer. C.1 Background. C.2 Basic Concepts. C.2.1 GET Method. C.2.2 POST Method. C.3 Caching. C.4 Cookies. C.5 Security. References. Index. Acronyms.
Record Nr. UNINA-9910143588103321
Burke Dave  
Chichester ; , : John Wiley & Sons, , c2007
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Speech processing for IP networks : Media Resource Control Protocol (MRCP) / / Dave Burke
Speech processing for IP networks : Media Resource Control Protocol (MRCP) / / Dave Burke
Autore Burke Dave
Pubbl/distr/stampa Chichester ; , : John Wiley & Sons, , c2007
Descrizione fisica 1 online resource (372 p.)
Disciplina 004.62
006.4/54
006.454
Soggetto topico Speech processing systems
Automatic speech recognition
TCP/IP (Computer network protocol)
ISBN 1-280-83878-7
9786610838783
0-470-06059-X
0-470-06060-3
Classificazione 54.32
54.79
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto PART I. BACKGROUND. 1. Introduction. 1.1 Introduction to Speech Applications. 1.2 The MRCP Value Proposition. 1.3 History of MRCP Standardisation. 1.3.1 Internet Engineering Task Force. 1.3.2 World Wide Web Consortium. 1.3.3 MRCP: From Humble Beginnings Toward IETF Standard. 1.4 Summary. 2. Basic Principles of Speech Processing. 2.1 Human Speech Production. 2.1.1 Speech Sounds: Phonemics and Phonetics. 2.2 Speech Recognition. 2.2.1 Endpoint Detection. 2.2.2 Mel-Cepstrum. 2.2.3 Hidden Markov Models. 2.2.4 Language Modelling. 2.3 Speaker Verification and Identification. 2.3.1 Feature Extraction. 2.3.2 Statistical Modelling. 2.4 Speech Synthesis. 2.4.1 Front-end Processing. 2.4.2 Back-end Synthesis. 2.5 Summary. 3. Overview of MRCP. 3.1 Architecture. 3.2 Media Resource Types. 3.3 Network Scenarios. 3.3.1 VoiceXML IVR Service Node. 3.3.2 IP PBX with Voicemail. 3.3.3 Advanced Media Gateway. 3.4 Protocol Operation. 3.4.1 Establishing Communication Channels. 3.4.2 Controlling a Media Resource. 3.4.3 Walkthrough Examples. 3.5 Security. 3.6 Summary. PART II. MEDIA AND CONTROL SESSIONS. 4. Session Initiation Protocol. 4.1 Introduction. 4.2 Walkthrough Example. 4.3 SIP URIs. 4.4 Transport. 4.5 Media Negotiation. 4.5.1 Session Description Protocol. 4.5.2 Offer/Answer Model. 4.6 SIP Servers. 4.6.1 Registrars. 4.6.2 Proxy Servers. 4.6.3 Redirect Servers. 4.7 SIP Extensions. 4.7.1 Capability Discovery. 4.8 Security. 4.8.1 Transport and Network Layer Security. 4.8.2 Authentication. 4.8.3 S/MIME. 4.9 Summary. 5. Session Initiation in MRCP. 5.1 Introduction. 5.2 Initiating the Media Session. 5.3 Initiating the Control Session. 5.4 Session Initiation Examples. 5.4.1 Single Media Resource. 5.4.2 Adding and Removing Media Resources. 5.4.3 Distributed Media Source/Sink. 5.5 Locating Media Resource Servers. 5.5.1 Requesting Server Capabilities. 5.5.2 Media Resource Brokers. 5.6 Security. 5.7 Summary. 6. The Media Session. 6.1 Media Encoding. 6.1.1 Pulse Code Modulation (PCM). 6.1.2 Linear Predictive Coding (LPC). 6.2 Media Transport. 6.2.1 Real-Time Protocol (RTP). 6.2.2 DTMF. 6.3 Security. 6.4 Summary. 7. The Control Session. 7.1 Message Structure. 7.1.1 Request Message. 7.1.2 Response Message. 7.1.3 Event Message. 7.1.4 Message Bodies. 7.2 Generic Methods. 7.3 Generic Headers. 7.4 Security. 7.5 Summary. PART III. DATA REPRESENTATION FORMATS. 8. Speech Synthesis Markup Language (SSML). 8.1 Introduction. 8.2 Document Structure. 8.3 Recorded Audio. 8.4 Pronunciation. 8.4.1 Phonemic/Phonetic Content. 8.4.2 Substitution. 8.4.3 Interpreting Text . 8.5 Prosody. 8.5.1 Prosodic Boundaries. 8.5.2 Emphasis. 8.5.3 Speaking Voice. 8.5.4 Prosodic Control. 8.6 Markers . 8.7 Metadata. 8.8 Summary. 9. Speech Recognition Grammar Specification (SRGS). 9.1 Introduction. 9.2 Document Structure. 9.3 Rules, Tokens, and Sequences. 9.4 Alternatives. 9.5 Rule References. 9.5.1 Special Rules. 9.6 Repeats. 9.7 DTMF Grammars. 9.8 Semantic Interpretation. 9.8.1 Semantic Literals. 9.8.2 Semantic Scripts. 9.9 Summary. 10. Natural Language Semantics Markup Language (NLSML). 10.1 Introduction. 10.2 Document Structure. 10.3 Speech Recognition Results. 10.3.1 Serialising Semantic Interpretation Results. 10.4 Voice Enrollment Results. 10.5 Speaker Verification Results. 10.6 Summary. 11. Pronunciation Lexicon Specification (PLS). 11.1 Introduction. 11.2 Document Structure. 11.3 Lexical Entries. 11.4 Abbreviations and Acronyms. 11.5 Multiple Orthographies. 11.6 Multiple Pronunciations. 11.7 Summary. PART IV. MEDIA RESOURCES. 12. Speech Synthesiser Resource. 12.1 Overview. 12.2 Methods. 12.2.1 SPEAK. 12.2.2 PAUSE. 12.2.3 RESUME. 12.2.4 STOP. 12.2.5 BARGE-IN-OCCURRED. 12.2.6 CONTROL. 12.2.7 DEFINE-LEXICON. 12.3 Events. 12.3.1 SPEECH-MARKER. 12.3.2 SPEAK-COMPLETE. 12.4 Headers. 12.5 Summary. 13. Speech Recogniser Resource. 13.1 Overview. 13.2 Recognition Methods. 13.2.1 RECOGNIZE. 13.2.2 DEFINE-GRAMMAR. 13.2.3 START-INPUT-TIMERS. 13.2.4 GET-RESULT. 13.2.5 STOP. 13.2.6 INTERPRET. 13.3 Enrollment Methods. 13.3.1 START-PHRASE-ENROLLMENT. 13.3.2 ENROLLMENT-ROLLBACK. 13.3.3 END-PHRASE-ENROLLMENT. 13.3.4 MODIFY-PHRASE. 13.3.5 DELETE-PHRASE. 13.4 Events. 13.4.1 START-OF-INPUT. 13.4.2 RECOGNITION-COMPLETE. 13.4.3 INTERPRETATION-COMPLETE. 13.5 Recognition Headers. 13.6 Enrollment Headers. 13.7 Summary. 14. Recorder Resource. 14.1 Overview. 14.2 Methods. 14.2.1 RECORD. 14.2.2 START-INPUT-TIMERS. 14.2.3 STOP. 14.3 Events. 14.3.1 START-OF-INPUT. 14.3.2 RECORD-COMPLETE. 14.4 Headers. 14.5 Summary. 15. Speaker Verification Resource. 15.1 Overview. 15.2 Methods. 15.2.1 START-SESSION. 15.2.2 END-SESSION. 15.2.3 VERIFY. 15.2.4 VERIFY-FROM-BUFFER. 15.2.5 VERIFY-ROLLBACK. 15.2.6 START-INPUT-TIMERS. 15.2.7 GET-INTERMEDIATE-RESULT. 15.2.8 STOP. 15.2.9 CLEAR-BUFFER. 15.2.10 QUERY-VOICEPRINT. 15.2.11 DELETE-VOICEPRINT. 15.3 Events. 15.3.1 START-OF-INPUT. 15.3.2 VERIFICATION-COMPLETE. 15.4 Headers. 15.5 Summary. PART V. PROGRAMMING SPEECH APPLICATIONS. 16. Voice eXtensible Markup Language (VoiceXML). 16.1 Introduction. 16.2 Document Structure. 16.2.1 Applications and Dialogs. 16.3 Dialogs. 16.3.1 Forms. 16.3.2 Menus. 16.3.3 Mixed Initiative Dialogs. 16.4 Media Playback. 16.5 Media Recording. 16.6 Speech and DTMF Recognition. 16.6.1 Specifying Grammars. 16.6.2 Grammar Scope and Activation. 16.6.3 Configuring Recognition Settings. 16.6.4 Processing Recognition Results. 16.7 Flow Control. 16.7.1 Executable Content. 16.7.2 Variables, Scopes, and Expressions. 16.7.3 Document and Dialog Transitions . 16.7.4 Event Handling. 16.8 Resource Fetching. 16.9 Call Transfer. 16.10 Summary. 17. VoiceXML and MRCP Interworking. 17.1 Introduction. 17.2 Interworking Fundamentals. 17.2.1 Play Prompts. 17.2.2 Play and Recognise. 17.2.3 Record. 17.3 Application Example. 17.3.1 VoiceXML Scripts. 17.3.2 MRCP Flows. 17.4 Summary. Appendix A. MRCP Version 1. A.1 Overview. A.2 Session Management and Message Transport. A.3 General Protocol Details. A.4 Speech Synthesiser Resource. A.5 Speech Recogniser Resource. Appendix B. XML Primer. B.1 Background. B.2 Basic Concepts. B.3 Namespaces. B.4 Document Schemas. Appendix C. HTTP Primer. C.1 Background. C.2 Basic Concepts. C.2.1 GET Method. C.2.2 POST Method. C.3 Caching. C.4 Cookies. C.5 Security. References. Index. Acronyms.
Record Nr. UNINA-9910829978003321
Burke Dave  
Chichester ; , : John Wiley & Sons, , c2007
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Speech recognition and processing / / [compiled by John F. Buydos]
Speech recognition and processing / / [compiled by John F. Buydos]
Autore Buydos John F.
Pubbl/distr/stampa [Washington, D.C.] : , : Library of Congress, Science, Technology & Business Division, Science Reference Services, , [1994]
Descrizione fisica 1 online resource
Collana LC science tracer bullet
Soggetto topico Automatic speech recognition
Soggetto genere / forma Bibliographies.
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910704915003321
Buydos John F.  
[Washington, D.C.] : , : Library of Congress, Science, Technology & Business Division, Science Reference Services, , [1994]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Speech recognition over digital channels [[electronic resource] ] : robustness and standards / / Antonio M. Peinado, José C. Segura
Speech recognition over digital channels [[electronic resource] ] : robustness and standards / / Antonio M. Peinado, José C. Segura
Autore Peinado Antonio
Pubbl/distr/stampa Chichester, England ; ; Hoboken, NJ, : John Wiley, c2006
Descrizione fisica 1 online resource (275 p.)
Disciplina 006.454
621.384
Altri autori (Persone) SeguraJosé C
Soggetto topico Automatic speech recognition
Signal processing
Soggetto genere / forma Electronic books.
ISBN 1-280-60598-7
9786610605989
0-470-02472-0
0-470-02401-1
Classificazione 17.46
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Speech recognition with HMMs -- Networks and degradation -- Speech compression and architectures for RSR -- Robustness against transmission channel errors -- Front-end processing for robust feature extraction -- Standards for distributed speech recognition.
Record Nr. UNINA-9910143579803321
Peinado Antonio  
Chichester, England ; ; Hoboken, NJ, : John Wiley, c2006
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Speech recognition over digital channels [[electronic resource] ] : robustness and standards / / Antonio M. Peinado, José C. Segura
Speech recognition over digital channels [[electronic resource] ] : robustness and standards / / Antonio M. Peinado, José C. Segura
Autore Peinado Antonio
Pubbl/distr/stampa Chichester, England ; ; Hoboken, NJ, : John Wiley, c2006
Descrizione fisica 1 online resource (275 p.)
Disciplina 006.454
621.384
Altri autori (Persone) SeguraJosé C
Soggetto topico Automatic speech recognition
Signal processing
ISBN 1-280-60598-7
9786610605989
0-470-02472-0
0-470-02401-1
Classificazione 17.46
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Speech recognition with HMMs -- Networks and degradation -- Speech compression and architectures for RSR -- Robustness against transmission channel errors -- Front-end processing for robust feature extraction -- Standards for distributed speech recognition.
Record Nr. UNINA-9910829974603321
Peinado Antonio  
Chichester, England ; ; Hoboken, NJ, : John Wiley, c2006
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui