top

  Info

  • Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

  Info

  • Utilizzare questo link per rimuovere la selezione effettuata.
Audio source separation and speech enhancement / / edited by Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot
Audio source separation and speech enhancement / / edited by Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot
Autore Vincent Emmanuel
Edizione [1st edition]
Pubbl/distr/stampa [Place of publication not identified] : , : Wiley, , [2018]
Descrizione fisica 1 online resource (593 pages)
Disciplina 006.454
Soggetto topico Speech processing systems
Automatic speech recognition
ISBN 1-119-27991-7
1-119-27988-7
1-119-27986-0
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910812041603321
Vincent Emmanuel  
[Place of publication not identified] : , : Wiley, , [2018]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Audiotex update
Audiotex update
Pubbl/distr/stampa Boston, MA, : WV Pub. Co
Descrizione fisica 1 online resource
Disciplina 384
Soggetto topico Telecommunication
Communication and traffic
Automatic speech recognition
Soggetto genere / forma Periodicals.
Formato Materiale a stampa
Livello bibliografico Periodico
Lingua di pubblicazione eng
Record Nr. UNINA-9910140933903321
Boston, MA, : WV Pub. Co
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Audiotex update
Audiotex update
Pubbl/distr/stampa Boston, MA, : WV Pub. Co
Descrizione fisica 1 online resource
Disciplina 384
Soggetto topico Telecommunication
Communication and traffic
Automatic speech recognition
Soggetto genere / forma Periodicals.
Formato Materiale a stampa
Livello bibliografico Periodico
Lingua di pubblicazione eng
Record Nr. UNISA-996199393803316
Boston, MA, : WV Pub. Co
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
Automatic speech and speaker recognition : large margin and kernel methods / / [edited by] Joseph Keshet, Samy Bengio
Automatic speech and speaker recognition : large margin and kernel methods / / [edited by] Joseph Keshet, Samy Bengio
Pubbl/distr/stampa Chichester, U.K. ; , : J. Wiley & Sons, , 2009
Descrizione fisica 1 online resource (271 p.)
Disciplina 006.4
006.4/54
006.454
Altri autori (Persone) KeshetJoseph
BengioSamy
Soggetto topico Automatic speech recognition
ISBN 1-282-34941-4
9786612349416
0-470-74204-6
0-470-74203-8
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto List of Contributors -- Preface -- I Foundations -- 1 Introduction (Samy Bengio and Joseph Keshet) -- 1.1 The Traditional Approach to Speech Processing -- 1.2 Potential Problems of the Probabilistic Approach -- 1.3 Support Vector Machines for Binary Classification -- 1.4 Outline -- References -- 2 Theory and Practice of Support Vector Machines Optimization (Shai Shalev-Shwartz and Nathan Srebo) -- 2.1 Introduction -- 2.2 SVM and L2-regularized Linear Prediction -- 2.3 Optimization Accuracy From a Machine Learning Perspective -- 2.4 Stochastic Gradient Descent -- 2.5 Dual Decomposition Methods -- 2.6 Summary -- References -- 3 From Binary Classification to Categorial Prediction (Koby Crammer) -- 3.1 Multi-category Problems -- 3.2 Hypothesis Class -- 3.3 Loss Functions -- 3.4 Hinge Loss Functions -- 3.5 A Generalized Perceptron Algorithm -- 3.6 A Generalized Passive / Aggressive Algorithm -- 3.7 A Batch Formulation -- 3.8 Concluding Remarks -- 3.9 Appendix. Derivations of the Duals of the Passive / Aggressive Algorithm and the Batch Formulation -- References -- II Acoustic Modeling -- 4 A Large Margin Algorithm for Forced Alignment (Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan) -- 4.1 Introduction -- 4.2 Problem Setting -- 4.3 Cost and Risk -- 4.4 A Large Margin Approach for Forced Alignment -- 4.5 An Iterative Algorithm -- 4.6 Efficient Evaluation of the Alignment Function -- 4.7 Base Alignment Functions -- 4.8 Experimental Results -- 4.9 Discussion -- References -- 5 A Kernel Wrapper for Phoneme Sequence Recognition (Joseph Keshet and Dan Chazan) -- 5.1 Introduction -- 5.2 Problem Setting -- 5.3 Frame-based Phoneme Classifier -- 5.4 Kernel-based Iterative Algorithm for Phoneme Recognition -- 5.5 Nonlinear Feature Functions -- 5.6 Preliminary Experimental Results -- 5.7 Discussion: Canwe Hope for Better Results? -- References -- 6 Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models (Mark J. F. Gales) -- 6.1 Introduction -- 6.2 Temporal Correlation Modeling.
6.3 Dynamic Kernels -- 6.4 Augmented Statistical Models -- 6.5 Experimental Results -- 6.6 Conclusions -- Acknowledgements -- References -- 7 Large Margin Training of Continuous Density Hidden Markov Models (Fei Sha and Lawrence K. Saul) -- 7.1 Introduction -- 7.2 Background -- 7.3 Large Margin Training -- 7.4 Experimental Results -- 7.5 Conclusion -- References -- III Language Modeling -- 8 A Survey of Discriminative Language Modeling Approaches for Large Vocabulary Continuous Speech Recognition (Brian Roark) -- 8.1 Introduction -- 8.2 General Framework -- 8.3 Further Developments -- 8.4 Summary and Discussion -- References -- 9 Large Margin Methods for Part-of-Speech Tagging (Yasemin Altun) -- 9.1 Introduction -- 9.2 Modeling Sequence Labeling -- 9.3 Sequence Boosting -- 9.4 Hidden Markov Support Vector Machines -- 9.5 Experiments -- 9.6 Discussion -- References -- 10 A Proposal for a Kernel Based Algorithm for Large Vocabulary Continuous Speech Recognition (Joseph Keshet) -- 10.1 Introduction -- 10.2 Segment Models and Hidden Markov Models -- 10.3 Kernel Based Model -- 10.4 Large Margin Training -- 10.5 Implementation Details -- 10.6 Discussion -- Acknowledgements -- References -- IV Applications -- 11 Discriminative Keyword Spotting (David Grangier, Joseph Keshet and Samy Bengio) -- 11.1 Introduction -- 11.2 Previous Work -- 11.3 Discriminative Keyword Spotting -- 11.4 Experiments and Results -- 11.5 Conclusions -- Acknowledgements -- References -- 12 Kernel-based Text-independent Speaker Verification (Johnny Mariéthoz, Samy Bengio and Yves Grandvalet) -- 12.1 Introduction -- 12.2 Generative Approaches -- 12.3 Discriminative Approaches -- 12.4 Benchmarking Methodology -- 12.5 Kernels for Speaker Verification -- 12.6 Parameter Sharing -- 12.7 Is the Margin Useful for This Problem? -- 12.8 Comparing all Methods -- 12.9 Conclusion -- References -- 13 Spectral Clustering for Speech Separation (Francis R. Bach and Michael I. Jordan) -- 13.1 Introduction -- 13.2 Spectral Clustering and Normalized Cuts.
13.3 Cost Functions for Learning the Similarity Matrix -- 13.4 Algorithms for Learning the Similarity Matrix -- 13.5 Speech Separation as Spectrogram Segmentation -- 13.6 Spectral Clustering for Speech Separation -- 13.7 Conclusions -- References -- Index.
Record Nr. UNINA-9910146404403321
Chichester, U.K. ; , : J. Wiley & Sons, , 2009
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Automatic speech and speaker recognition : large margin and kernel methods / / [edited by] Joseph Keshet, Samy Bengio
Automatic speech and speaker recognition : large margin and kernel methods / / [edited by] Joseph Keshet, Samy Bengio
Pubbl/distr/stampa Chichester, U.K. ; , : J. Wiley & Sons, , 2009
Descrizione fisica 1 online resource (271 p.)
Disciplina 006.4
006.4/54
006.454
Altri autori (Persone) KeshetJoseph
BengioSamy
Soggetto topico Automatic speech recognition
ISBN 1-282-34941-4
9786612349416
0-470-74204-6
0-470-74203-8
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto List of Contributors -- Preface -- I Foundations -- 1 Introduction (Samy Bengio and Joseph Keshet) -- 1.1 The Traditional Approach to Speech Processing -- 1.2 Potential Problems of the Probabilistic Approach -- 1.3 Support Vector Machines for Binary Classification -- 1.4 Outline -- References -- 2 Theory and Practice of Support Vector Machines Optimization (Shai Shalev-Shwartz and Nathan Srebo) -- 2.1 Introduction -- 2.2 SVM and L2-regularized Linear Prediction -- 2.3 Optimization Accuracy From a Machine Learning Perspective -- 2.4 Stochastic Gradient Descent -- 2.5 Dual Decomposition Methods -- 2.6 Summary -- References -- 3 From Binary Classification to Categorial Prediction (Koby Crammer) -- 3.1 Multi-category Problems -- 3.2 Hypothesis Class -- 3.3 Loss Functions -- 3.4 Hinge Loss Functions -- 3.5 A Generalized Perceptron Algorithm -- 3.6 A Generalized Passive / Aggressive Algorithm -- 3.7 A Batch Formulation -- 3.8 Concluding Remarks -- 3.9 Appendix. Derivations of the Duals of the Passive / Aggressive Algorithm and the Batch Formulation -- References -- II Acoustic Modeling -- 4 A Large Margin Algorithm for Forced Alignment (Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan) -- 4.1 Introduction -- 4.2 Problem Setting -- 4.3 Cost and Risk -- 4.4 A Large Margin Approach for Forced Alignment -- 4.5 An Iterative Algorithm -- 4.6 Efficient Evaluation of the Alignment Function -- 4.7 Base Alignment Functions -- 4.8 Experimental Results -- 4.9 Discussion -- References -- 5 A Kernel Wrapper for Phoneme Sequence Recognition (Joseph Keshet and Dan Chazan) -- 5.1 Introduction -- 5.2 Problem Setting -- 5.3 Frame-based Phoneme Classifier -- 5.4 Kernel-based Iterative Algorithm for Phoneme Recognition -- 5.5 Nonlinear Feature Functions -- 5.6 Preliminary Experimental Results -- 5.7 Discussion: Canwe Hope for Better Results? -- References -- 6 Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models (Mark J. F. Gales) -- 6.1 Introduction -- 6.2 Temporal Correlation Modeling.
6.3 Dynamic Kernels -- 6.4 Augmented Statistical Models -- 6.5 Experimental Results -- 6.6 Conclusions -- Acknowledgements -- References -- 7 Large Margin Training of Continuous Density Hidden Markov Models (Fei Sha and Lawrence K. Saul) -- 7.1 Introduction -- 7.2 Background -- 7.3 Large Margin Training -- 7.4 Experimental Results -- 7.5 Conclusion -- References -- III Language Modeling -- 8 A Survey of Discriminative Language Modeling Approaches for Large Vocabulary Continuous Speech Recognition (Brian Roark) -- 8.1 Introduction -- 8.2 General Framework -- 8.3 Further Developments -- 8.4 Summary and Discussion -- References -- 9 Large Margin Methods for Part-of-Speech Tagging (Yasemin Altun) -- 9.1 Introduction -- 9.2 Modeling Sequence Labeling -- 9.3 Sequence Boosting -- 9.4 Hidden Markov Support Vector Machines -- 9.5 Experiments -- 9.6 Discussion -- References -- 10 A Proposal for a Kernel Based Algorithm for Large Vocabulary Continuous Speech Recognition (Joseph Keshet) -- 10.1 Introduction -- 10.2 Segment Models and Hidden Markov Models -- 10.3 Kernel Based Model -- 10.4 Large Margin Training -- 10.5 Implementation Details -- 10.6 Discussion -- Acknowledgements -- References -- IV Applications -- 11 Discriminative Keyword Spotting (David Grangier, Joseph Keshet and Samy Bengio) -- 11.1 Introduction -- 11.2 Previous Work -- 11.3 Discriminative Keyword Spotting -- 11.4 Experiments and Results -- 11.5 Conclusions -- Acknowledgements -- References -- 12 Kernel-based Text-independent Speaker Verification (Johnny Mariéthoz, Samy Bengio and Yves Grandvalet) -- 12.1 Introduction -- 12.2 Generative Approaches -- 12.3 Discriminative Approaches -- 12.4 Benchmarking Methodology -- 12.5 Kernels for Speaker Verification -- 12.6 Parameter Sharing -- 12.7 Is the Margin Useful for This Problem? -- 12.8 Comparing all Methods -- 12.9 Conclusion -- References -- 13 Spectral Clustering for Speech Separation (Francis R. Bach and Michael I. Jordan) -- 13.1 Introduction -- 13.2 Spectral Clustering and Normalized Cuts.
13.3 Cost Functions for Learning the Similarity Matrix -- 13.4 Algorithms for Learning the Similarity Matrix -- 13.5 Speech Separation as Spectrogram Segmentation -- 13.6 Spectral Clustering for Speech Separation -- 13.7 Conclusions -- References -- Index.
Record Nr. UNINA-9910829918203321
Chichester, U.K. ; , : J. Wiley & Sons, , 2009
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Computer speech & language [[e-journal]]
Computer speech & language [[e-journal]]
Pubbl/distr/stampa London, : Academic Press, c1986-
Disciplina 006
Soggetto topico Speech processing systems
Automatic speech recognition
Soggetto genere / forma Periodicals.
ISSN 1095-8363
Formato Materiale a stampa
Livello bibliografico Periodico
Lingua di pubblicazione eng
Altri titoli varianti Computer speech and language
Record Nr. UNISA-996205855403316
London, : Academic Press, c1986-
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
Computer speech & language [[e-journal]]
Computer speech & language [[e-journal]]
Pubbl/distr/stampa London, : Academic Press, c1986-
Disciplina 006
Soggetto topico Speech processing systems
Automatic speech recognition
Soggetto genere / forma Periodicals.
ISSN 1095-8363
Formato Materiale a stampa
Livello bibliografico Periodico
Lingua di pubblicazione eng
Altri titoli varianti Computer speech and language
Record Nr. UNINA-9910333251403321
London, : Academic Press, c1986-
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Constructive dialogue modelling : speech interaction and rational agents / / Kristiina Jokinen
Constructive dialogue modelling : speech interaction and rational agents / / Kristiina Jokinen
Autore Jokinen Kristiina
Pubbl/distr/stampa Chichester, U.K. : , : Wiley, , 2009
Descrizione fisica 1 online resource (180 p.)
Disciplina 004.01/9
Collana Wiley series in agent technology
Soggetto topico Human-computer interaction
Automatic speech recognition
Intelligent agents (Computer software)
Dialogue - Computer simulation
ISBN 1-282-18857-7
9786612188572
0-470-51127-3
0-470-51124-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Foreword. -- Preface. -- Acknowledgements. -- Introduction. -- Two Metaphors for Interaction Design. -- Design Models for Interactive Systems. -- Human Aspects in Dialogue System Design. -- Dialogue Models. -- Brief History. -- Modelling Approaches. -- Dialogue Management. -- Constructive Dialogue Model (CDM). -- Basic Principles of Communication. -- Full-blown Communication. -- Conversations with Computer Agents. -- Construction of Dialogue and Domain Information. -- Coherence and Context - Aboutness. -- Information Sturcture of Utterances - New and Old Information. -- Definitions of NewInfo and Topic. -- Topic Shifting. -- Information Management as Feedback Giving Activity. -- Information Management and Rational Agents. -- Dialogue Systems. -- Desiderata for Dialogue Agents. -- Technical Aspects in CDM. -- Summary. -- Constructive Information Technology. -- Learning and Adaptation. -- Cognitive Systems and Group Intelligence. -- Interaction and Affordance. -- Conclusions and Future Views. -- References. -- Index.
Record Nr. UNINA-9910139917403321
Jokinen Kristiina  
Chichester, U.K. : , : Wiley, , 2009
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Constructive dialogue modelling : speech interaction and rational agents / / Kristiina Jokinen
Constructive dialogue modelling : speech interaction and rational agents / / Kristiina Jokinen
Autore Jokinen Kristiina
Pubbl/distr/stampa Chichester, U.K. : , : Wiley, , 2009
Descrizione fisica 1 online resource (180 p.)
Disciplina 004.01/9
Collana Wiley series in agent technology
Soggetto topico Human-computer interaction
Automatic speech recognition
Intelligent agents (Computer software)
Dialogue - Computer simulation
ISBN 1-282-18857-7
9786612188572
0-470-51127-3
0-470-51124-9
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Foreword. -- Preface. -- Acknowledgements. -- Introduction. -- Two Metaphors for Interaction Design. -- Design Models for Interactive Systems. -- Human Aspects in Dialogue System Design. -- Dialogue Models. -- Brief History. -- Modelling Approaches. -- Dialogue Management. -- Constructive Dialogue Model (CDM). -- Basic Principles of Communication. -- Full-blown Communication. -- Conversations with Computer Agents. -- Construction of Dialogue and Domain Information. -- Coherence and Context - Aboutness. -- Information Sturcture of Utterances - New and Old Information. -- Definitions of NewInfo and Topic. -- Topic Shifting. -- Information Management as Feedback Giving Activity. -- Information Management and Rational Agents. -- Dialogue Systems. -- Desiderata for Dialogue Agents. -- Technical Aspects in CDM. -- Summary. -- Constructive Information Technology. -- Learning and Adaptation. -- Cognitive Systems and Group Intelligence. -- Interaction and Affordance. -- Conclusions and Future Views. -- References. -- Index.
Record Nr. UNINA-9910826259603321
Jokinen Kristiina  
Chichester, U.K. : , : Wiley, , 2009
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Data dependency on measurement uncertainties in speaker recognition evaluation [[electronic resource] /] / Jin Chu Wu ... [and others]
Data dependency on measurement uncertainties in speaker recognition evaluation [[electronic resource] /] / Jin Chu Wu ... [and others]
Pubbl/distr/stampa Gaithersburg, MD : , : U.S. Dept. of Commerce, National Institute of Standards and Technology, , [2011]
Descrizione fisica 1 online resource (18 pages) : color illustrations
Altri autori (Persone) WuJin Chu
Collana NISTIR
Soggetto topico Automatic speech recognition
Biometric identification
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNINA-9910701247503321
Gaithersburg, MD : , : U.S. Dept. of Commerce, National Institute of Standards and Technology, , [2011]
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui