Audio source separation and speech enhancement / / edited by Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot |
Autore | Vincent Emmanuel |
Edizione | [1st edition] |
Pubbl/distr/stampa | [Place of publication not identified] : , : Wiley, , [2018] |
Descrizione fisica | 1 online resource (593 pages) |
Disciplina | 006.454 |
Soggetto topico |
Speech processing systems
Automatic speech recognition |
ISBN |
1-119-27991-7
1-119-27988-7 1-119-27986-0 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910812041603321 |
Vincent Emmanuel
![]() |
||
[Place of publication not identified] : , : Wiley, , [2018] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Audiotex update |
Pubbl/distr/stampa | Boston, MA, : WV Pub. Co |
Descrizione fisica | 1 online resource |
Disciplina | 384 |
Soggetto topico |
Telecommunication
Communication and traffic Automatic speech recognition |
Soggetto genere / forma | Periodicals. |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Periodico |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910140933903321 |
Boston, MA, : WV Pub. Co | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Audiotex update |
Pubbl/distr/stampa | Boston, MA, : WV Pub. Co |
Descrizione fisica | 1 online resource |
Disciplina | 384 |
Soggetto topico |
Telecommunication
Communication and traffic Automatic speech recognition |
Soggetto genere / forma | Periodicals. |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Periodico |
Lingua di pubblicazione | eng |
Record Nr. | UNISA-996199393803316 |
Boston, MA, : WV Pub. Co | ||
![]() | ||
Lo trovi qui: Univ. di Salerno | ||
|
Automatic speech and speaker recognition : large margin and kernel methods / / [edited by] Joseph Keshet, Samy Bengio |
Pubbl/distr/stampa | Chichester, U.K. ; , : J. Wiley & Sons, , 2009 |
Descrizione fisica | 1 online resource (271 p.) |
Disciplina |
006.4
006.4/54 006.454 |
Altri autori (Persone) |
KeshetJoseph
BengioSamy |
Soggetto topico | Automatic speech recognition |
ISBN |
1-282-34941-4
9786612349416 0-470-74204-6 0-470-74203-8 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
List of Contributors -- Preface -- I Foundations -- 1 Introduction (Samy Bengio and Joseph Keshet) -- 1.1 The Traditional Approach to Speech Processing -- 1.2 Potential Problems of the Probabilistic Approach -- 1.3 Support Vector Machines for Binary Classification -- 1.4 Outline -- References -- 2 Theory and Practice of Support Vector Machines Optimization (Shai Shalev-Shwartz and Nathan Srebo) -- 2.1 Introduction -- 2.2 SVM and L2-regularized Linear Prediction -- 2.3 Optimization Accuracy From a Machine Learning Perspective -- 2.4 Stochastic Gradient Descent -- 2.5 Dual Decomposition Methods -- 2.6 Summary -- References -- 3 From Binary Classification to Categorial Prediction (Koby Crammer) -- 3.1 Multi-category Problems -- 3.2 Hypothesis Class -- 3.3 Loss Functions -- 3.4 Hinge Loss Functions -- 3.5 A Generalized Perceptron Algorithm -- 3.6 A Generalized Passive / Aggressive Algorithm -- 3.7 A Batch Formulation -- 3.8 Concluding Remarks -- 3.9 Appendix. Derivations of the Duals of the Passive / Aggressive Algorithm and the Batch Formulation -- References -- II Acoustic Modeling -- 4 A Large Margin Algorithm for Forced Alignment (Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan) -- 4.1 Introduction -- 4.2 Problem Setting -- 4.3 Cost and Risk -- 4.4 A Large Margin Approach for Forced Alignment -- 4.5 An Iterative Algorithm -- 4.6 Efficient Evaluation of the Alignment Function -- 4.7 Base Alignment Functions -- 4.8 Experimental Results -- 4.9 Discussion -- References -- 5 A Kernel Wrapper for Phoneme Sequence Recognition (Joseph Keshet and Dan Chazan) -- 5.1 Introduction -- 5.2 Problem Setting -- 5.3 Frame-based Phoneme Classifier -- 5.4 Kernel-based Iterative Algorithm for Phoneme Recognition -- 5.5 Nonlinear Feature Functions -- 5.6 Preliminary Experimental Results -- 5.7 Discussion: Canwe Hope for Better Results? -- References -- 6 Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models (Mark J. F. Gales) -- 6.1 Introduction -- 6.2 Temporal Correlation Modeling.
6.3 Dynamic Kernels -- 6.4 Augmented Statistical Models -- 6.5 Experimental Results -- 6.6 Conclusions -- Acknowledgements -- References -- 7 Large Margin Training of Continuous Density Hidden Markov Models (Fei Sha and Lawrence K. Saul) -- 7.1 Introduction -- 7.2 Background -- 7.3 Large Margin Training -- 7.4 Experimental Results -- 7.5 Conclusion -- References -- III Language Modeling -- 8 A Survey of Discriminative Language Modeling Approaches for Large Vocabulary Continuous Speech Recognition (Brian Roark) -- 8.1 Introduction -- 8.2 General Framework -- 8.3 Further Developments -- 8.4 Summary and Discussion -- References -- 9 Large Margin Methods for Part-of-Speech Tagging (Yasemin Altun) -- 9.1 Introduction -- 9.2 Modeling Sequence Labeling -- 9.3 Sequence Boosting -- 9.4 Hidden Markov Support Vector Machines -- 9.5 Experiments -- 9.6 Discussion -- References -- 10 A Proposal for a Kernel Based Algorithm for Large Vocabulary Continuous Speech Recognition (Joseph Keshet) -- 10.1 Introduction -- 10.2 Segment Models and Hidden Markov Models -- 10.3 Kernel Based Model -- 10.4 Large Margin Training -- 10.5 Implementation Details -- 10.6 Discussion -- Acknowledgements -- References -- IV Applications -- 11 Discriminative Keyword Spotting (David Grangier, Joseph Keshet and Samy Bengio) -- 11.1 Introduction -- 11.2 Previous Work -- 11.3 Discriminative Keyword Spotting -- 11.4 Experiments and Results -- 11.5 Conclusions -- Acknowledgements -- References -- 12 Kernel-based Text-independent Speaker Verification (Johnny Mariéthoz, Samy Bengio and Yves Grandvalet) -- 12.1 Introduction -- 12.2 Generative Approaches -- 12.3 Discriminative Approaches -- 12.4 Benchmarking Methodology -- 12.5 Kernels for Speaker Verification -- 12.6 Parameter Sharing -- 12.7 Is the Margin Useful for This Problem? -- 12.8 Comparing all Methods -- 12.9 Conclusion -- References -- 13 Spectral Clustering for Speech Separation (Francis R. Bach and Michael I. Jordan) -- 13.1 Introduction -- 13.2 Spectral Clustering and Normalized Cuts. 13.3 Cost Functions for Learning the Similarity Matrix -- 13.4 Algorithms for Learning the Similarity Matrix -- 13.5 Speech Separation as Spectrogram Segmentation -- 13.6 Spectral Clustering for Speech Separation -- 13.7 Conclusions -- References -- Index. |
Record Nr. | UNINA-9910146404403321 |
Chichester, U.K. ; , : J. Wiley & Sons, , 2009 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Automatic speech and speaker recognition : large margin and kernel methods / / [edited by] Joseph Keshet, Samy Bengio |
Pubbl/distr/stampa | Chichester, U.K. ; , : J. Wiley & Sons, , 2009 |
Descrizione fisica | 1 online resource (271 p.) |
Disciplina |
006.4
006.4/54 006.454 |
Altri autori (Persone) |
KeshetJoseph
BengioSamy |
Soggetto topico | Automatic speech recognition |
ISBN |
1-282-34941-4
9786612349416 0-470-74204-6 0-470-74203-8 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
List of Contributors -- Preface -- I Foundations -- 1 Introduction (Samy Bengio and Joseph Keshet) -- 1.1 The Traditional Approach to Speech Processing -- 1.2 Potential Problems of the Probabilistic Approach -- 1.3 Support Vector Machines for Binary Classification -- 1.4 Outline -- References -- 2 Theory and Practice of Support Vector Machines Optimization (Shai Shalev-Shwartz and Nathan Srebo) -- 2.1 Introduction -- 2.2 SVM and L2-regularized Linear Prediction -- 2.3 Optimization Accuracy From a Machine Learning Perspective -- 2.4 Stochastic Gradient Descent -- 2.5 Dual Decomposition Methods -- 2.6 Summary -- References -- 3 From Binary Classification to Categorial Prediction (Koby Crammer) -- 3.1 Multi-category Problems -- 3.2 Hypothesis Class -- 3.3 Loss Functions -- 3.4 Hinge Loss Functions -- 3.5 A Generalized Perceptron Algorithm -- 3.6 A Generalized Passive / Aggressive Algorithm -- 3.7 A Batch Formulation -- 3.8 Concluding Remarks -- 3.9 Appendix. Derivations of the Duals of the Passive / Aggressive Algorithm and the Batch Formulation -- References -- II Acoustic Modeling -- 4 A Large Margin Algorithm for Forced Alignment (Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan) -- 4.1 Introduction -- 4.2 Problem Setting -- 4.3 Cost and Risk -- 4.4 A Large Margin Approach for Forced Alignment -- 4.5 An Iterative Algorithm -- 4.6 Efficient Evaluation of the Alignment Function -- 4.7 Base Alignment Functions -- 4.8 Experimental Results -- 4.9 Discussion -- References -- 5 A Kernel Wrapper for Phoneme Sequence Recognition (Joseph Keshet and Dan Chazan) -- 5.1 Introduction -- 5.2 Problem Setting -- 5.3 Frame-based Phoneme Classifier -- 5.4 Kernel-based Iterative Algorithm for Phoneme Recognition -- 5.5 Nonlinear Feature Functions -- 5.6 Preliminary Experimental Results -- 5.7 Discussion: Canwe Hope for Better Results? -- References -- 6 Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models (Mark J. F. Gales) -- 6.1 Introduction -- 6.2 Temporal Correlation Modeling.
6.3 Dynamic Kernels -- 6.4 Augmented Statistical Models -- 6.5 Experimental Results -- 6.6 Conclusions -- Acknowledgements -- References -- 7 Large Margin Training of Continuous Density Hidden Markov Models (Fei Sha and Lawrence K. Saul) -- 7.1 Introduction -- 7.2 Background -- 7.3 Large Margin Training -- 7.4 Experimental Results -- 7.5 Conclusion -- References -- III Language Modeling -- 8 A Survey of Discriminative Language Modeling Approaches for Large Vocabulary Continuous Speech Recognition (Brian Roark) -- 8.1 Introduction -- 8.2 General Framework -- 8.3 Further Developments -- 8.4 Summary and Discussion -- References -- 9 Large Margin Methods for Part-of-Speech Tagging (Yasemin Altun) -- 9.1 Introduction -- 9.2 Modeling Sequence Labeling -- 9.3 Sequence Boosting -- 9.4 Hidden Markov Support Vector Machines -- 9.5 Experiments -- 9.6 Discussion -- References -- 10 A Proposal for a Kernel Based Algorithm for Large Vocabulary Continuous Speech Recognition (Joseph Keshet) -- 10.1 Introduction -- 10.2 Segment Models and Hidden Markov Models -- 10.3 Kernel Based Model -- 10.4 Large Margin Training -- 10.5 Implementation Details -- 10.6 Discussion -- Acknowledgements -- References -- IV Applications -- 11 Discriminative Keyword Spotting (David Grangier, Joseph Keshet and Samy Bengio) -- 11.1 Introduction -- 11.2 Previous Work -- 11.3 Discriminative Keyword Spotting -- 11.4 Experiments and Results -- 11.5 Conclusions -- Acknowledgements -- References -- 12 Kernel-based Text-independent Speaker Verification (Johnny Mariéthoz, Samy Bengio and Yves Grandvalet) -- 12.1 Introduction -- 12.2 Generative Approaches -- 12.3 Discriminative Approaches -- 12.4 Benchmarking Methodology -- 12.5 Kernels for Speaker Verification -- 12.6 Parameter Sharing -- 12.7 Is the Margin Useful for This Problem? -- 12.8 Comparing all Methods -- 12.9 Conclusion -- References -- 13 Spectral Clustering for Speech Separation (Francis R. Bach and Michael I. Jordan) -- 13.1 Introduction -- 13.2 Spectral Clustering and Normalized Cuts. 13.3 Cost Functions for Learning the Similarity Matrix -- 13.4 Algorithms for Learning the Similarity Matrix -- 13.5 Speech Separation as Spectrogram Segmentation -- 13.6 Spectral Clustering for Speech Separation -- 13.7 Conclusions -- References -- Index. |
Record Nr. | UNINA-9910829918203321 |
Chichester, U.K. ; , : J. Wiley & Sons, , 2009 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Computer speech & language [[e-journal]] |
Pubbl/distr/stampa | London, : Academic Press, c1986- |
Disciplina | 006 |
Soggetto topico |
Speech processing systems
Automatic speech recognition |
Soggetto genere / forma | Periodicals. |
ISSN | 1095-8363 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Periodico |
Lingua di pubblicazione | eng |
Altri titoli varianti | Computer speech and language |
Record Nr. | UNISA-996205855403316 |
London, : Academic Press, c1986- | ||
![]() | ||
Lo trovi qui: Univ. di Salerno | ||
|
Computer speech & language [[e-journal]] |
Pubbl/distr/stampa | London, : Academic Press, c1986- |
Disciplina | 006 |
Soggetto topico |
Speech processing systems
Automatic speech recognition |
Soggetto genere / forma | Periodicals. |
ISSN | 1095-8363 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Periodico |
Lingua di pubblicazione | eng |
Altri titoli varianti | Computer speech and language |
Record Nr. | UNINA-9910333251403321 |
London, : Academic Press, c1986- | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Constructive dialogue modelling : speech interaction and rational agents / / Kristiina Jokinen |
Autore | Jokinen Kristiina |
Pubbl/distr/stampa | Chichester, U.K. : , : Wiley, , 2009 |
Descrizione fisica | 1 online resource (180 p.) |
Disciplina | 004.01/9 |
Collana | Wiley series in agent technology |
Soggetto topico |
Human-computer interaction
Automatic speech recognition Intelligent agents (Computer software) Dialogue - Computer simulation |
ISBN |
1-282-18857-7
9786612188572 0-470-51127-3 0-470-51124-9 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Foreword. -- Preface. -- Acknowledgements. -- Introduction. -- Two Metaphors for Interaction Design. -- Design Models for Interactive Systems. -- Human Aspects in Dialogue System Design. -- Dialogue Models. -- Brief History. -- Modelling Approaches. -- Dialogue Management. -- Constructive Dialogue Model (CDM). -- Basic Principles of Communication. -- Full-blown Communication. -- Conversations with Computer Agents. -- Construction of Dialogue and Domain Information. -- Coherence and Context - Aboutness. -- Information Sturcture of Utterances - New and Old Information. -- Definitions of NewInfo and Topic. -- Topic Shifting. -- Information Management as Feedback Giving Activity. -- Information Management and Rational Agents. -- Dialogue Systems. -- Desiderata for Dialogue Agents. -- Technical Aspects in CDM. -- Summary. -- Constructive Information Technology. -- Learning and Adaptation. -- Cognitive Systems and Group Intelligence. -- Interaction and Affordance. -- Conclusions and Future Views. -- References. -- Index. |
Record Nr. | UNINA-9910139917403321 |
Jokinen Kristiina
![]() |
||
Chichester, U.K. : , : Wiley, , 2009 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Constructive dialogue modelling : speech interaction and rational agents / / Kristiina Jokinen |
Autore | Jokinen Kristiina |
Pubbl/distr/stampa | Chichester, U.K. : , : Wiley, , 2009 |
Descrizione fisica | 1 online resource (180 p.) |
Disciplina | 004.01/9 |
Collana | Wiley series in agent technology |
Soggetto topico |
Human-computer interaction
Automatic speech recognition Intelligent agents (Computer software) Dialogue - Computer simulation |
ISBN |
1-282-18857-7
9786612188572 0-470-51127-3 0-470-51124-9 |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Foreword. -- Preface. -- Acknowledgements. -- Introduction. -- Two Metaphors for Interaction Design. -- Design Models for Interactive Systems. -- Human Aspects in Dialogue System Design. -- Dialogue Models. -- Brief History. -- Modelling Approaches. -- Dialogue Management. -- Constructive Dialogue Model (CDM). -- Basic Principles of Communication. -- Full-blown Communication. -- Conversations with Computer Agents. -- Construction of Dialogue and Domain Information. -- Coherence and Context - Aboutness. -- Information Sturcture of Utterances - New and Old Information. -- Definitions of NewInfo and Topic. -- Topic Shifting. -- Information Management as Feedback Giving Activity. -- Information Management and Rational Agents. -- Dialogue Systems. -- Desiderata for Dialogue Agents. -- Technical Aspects in CDM. -- Summary. -- Constructive Information Technology. -- Learning and Adaptation. -- Cognitive Systems and Group Intelligence. -- Interaction and Affordance. -- Conclusions and Future Views. -- References. -- Index. |
Record Nr. | UNINA-9910826259603321 |
Jokinen Kristiina
![]() |
||
Chichester, U.K. : , : Wiley, , 2009 | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|
Data dependency on measurement uncertainties in speaker recognition evaluation [[electronic resource] /] / Jin Chu Wu ... [and others] |
Pubbl/distr/stampa | Gaithersburg, MD : , : U.S. Dept. of Commerce, National Institute of Standards and Technology, , [2011] |
Descrizione fisica | 1 online resource (18 pages) : color illustrations |
Altri autori (Persone) | WuJin Chu |
Collana | NISTIR |
Soggetto topico |
Automatic speech recognition
Biometric identification |
Formato | Materiale a stampa ![]() |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNINA-9910701247503321 |
Gaithersburg, MD : , : U.S. Dept. of Commerce, National Institute of Standards and Technology, , [2011] | ||
![]() | ||
Lo trovi qui: Univ. Federico II | ||
|