Share Catalogue

Storico ricerche

Pubblicazioni (Istanze)

Vai a Persone/Opere

Home / (Tutto) >> Automatic speech recognition

Info

Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

Info

Utilizzare questo link per rimuovere la selezione effettuata.

Export / Download (0)

Esporta in PDF
Esporta in Excel
Esporta in HTML
Esporta in MARC (binario)
Esporta in MARC XML
Esporta in MARC (testo)
Invia tramite E-Mail

Biblioteca

Tutto
+

MARC Lista (tabellare)

Seleziona tutti

Audio source separation and speech enhancement / / edited by Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot

Vincent Emmanuel

[Place of publication not identified] : , : Wiley, , [2018]

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Audiotex update

Boston, MA, : WV Pub. Co

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Audiotex update

Boston, MA, : WV Pub. Co

Materiale a stampa

Lo trovi qui: Univ. di Salerno

Opac:

Controlla la disponibilità qui

Automatic speech and speaker recognition : large margin and kernel methods / / [edited by] Joseph Keshet, Samy Bengio

Chichester, U.K. ; , : J. Wiley & Sons, , 2009

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Automatic speech and speaker recognition : large margin and kernel methods / / [edited by] Joseph Keshet, Samy Bengio

Chichester, U.K. ; , : J. Wiley & Sons, , 2009

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Computer speech & language [[e-journal]]

London, : Academic Press, c1986-

Materiale a stampa

Lo trovi qui: Univ. di Salerno

Opac:

Controlla la disponibilità qui

Computer speech & language [[e-journal]]

London, : Academic Press, c1986-

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Constructive dialogue modelling : speech interaction and rational agents / / Kristiina Jokinen

Jokinen Kristiina

Chichester, U.K. : , : Wiley, , 2009

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Constructive dialogue modelling : speech interaction and rational agents / / Kristiina Jokinen

Jokinen Kristiina

Chichester, U.K. : , : Wiley, , 2009

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

Data dependency on measurement uncertainties in speaker recognition evaluation [[electronic resource] /] / Jin Chu Wu ... [and others]

Gaithersburg, MD : , : U.S. Dept. of Commerce, National Institute of Standards and Technology, , [2011]

Materiale a stampa

Lo trovi qui: Univ. Federico II

Opac:

Controlla la disponibilità qui

1 2 3 4 5 6 7 8 9

Autore (Ente)

Autore (Convegno)

Opere

Machine Learning for Multimodal Interaction (4)
Audiotex update (2)
Voice technology news (2)
Computer speech & language (2)
.. Annual Pacific Voice Conference (2)
PVC (2)

Altro...

Pubbl/distr/stampa

IEEE (20)
IEEE Xplore (10)
Wiley (8)
Institute of Electrical and Electronics Engineers (6)
Springer (6)

Altro...

Lingua di pubblicazione

Inglese (84)

Data

Data di pubblicazione

2009 (10)
2014 (7)
2010 (6)
2018 (6)
1994 (4)
2006 (4)
2007 (4)
2008 (4)
2012 (4)
2021 (4)

Altro...

Soggetto (Persona)

Soggetto (Ente)

Soggetto (Convegno)

Soggetto geografico

Soggetto topico

Automatic speech recognition (84)
Speech processing systems (36)
Natural language processing (Computer science) (13)
Computational linguistics (10)
Human-computer interaction (7)

Altro...

Autore	Vincent Emmanuel
Edizione	[1st edition]
Pubbl/distr/stampa	[Place of publication not identified] : , : Wiley, , [2018]
Descrizione fisica	1 online resource (593 pages)
Disciplina	006.454
Soggetto topico	Speech processing systems Automatic speech recognition
ISBN	1-119-27991-7 1-119-27988-7 1-119-27986-0
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Record Nr.	UNINA-9910812041603321

Pubbl/distr/stampa	Boston, MA, : WV Pub. Co
Descrizione fisica	1 online resource
Disciplina	384
Soggetto topico	Telecommunication Communication and traffic Automatic speech recognition
Soggetto genere / forma	Periodicals.
Formato	Materiale a stampa
Livello bibliografico	Periodico
Lingua di pubblicazione	eng
Record Nr.	UNINA-9910140933903321

Pubbl/distr/stampa	Boston, MA, : WV Pub. Co
Descrizione fisica	1 online resource
Disciplina	384
Soggetto topico	Telecommunication Communication and traffic Automatic speech recognition
Soggetto genere / forma	Periodicals.
Formato	Materiale a stampa
Livello bibliografico	Periodico
Lingua di pubblicazione	eng
Record Nr.	UNISA-996199393803316

Pubbl/distr/stampa	Chichester, U.K. ; , : J. Wiley & Sons, , 2009
Descrizione fisica	1 online resource (271 p.)
Disciplina	006.4 006.4/54 006.454
Altri autori (Persone)	KeshetJoseph BengioSamy
Soggetto topico	Automatic speech recognition
ISBN	1-282-34941-4 9786612349416 0-470-74204-6 0-470-74203-8
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	List of Contributors -- Preface -- I Foundations -- 1 Introduction (Samy Bengio and Joseph Keshet) -- 1.1 The Traditional Approach to Speech Processing -- 1.2 Potential Problems of the Probabilistic Approach -- 1.3 Support Vector Machines for Binary Classification -- 1.4 Outline -- References -- 2 Theory and Practice of Support Vector Machines Optimization (Shai Shalev-Shwartz and Nathan Srebo) -- 2.1 Introduction -- 2.2 SVM and L2-regularized Linear Prediction -- 2.3 Optimization Accuracy From a Machine Learning Perspective -- 2.4 Stochastic Gradient Descent -- 2.5 Dual Decomposition Methods -- 2.6 Summary -- References -- 3 From Binary Classification to Categorial Prediction (Koby Crammer) -- 3.1 Multi-category Problems -- 3.2 Hypothesis Class -- 3.3 Loss Functions -- 3.4 Hinge Loss Functions -- 3.5 A Generalized Perceptron Algorithm -- 3.6 A Generalized Passive / Aggressive Algorithm -- 3.7 A Batch Formulation -- 3.8 Concluding Remarks -- 3.9 Appendix. Derivations of the Duals of the Passive / Aggressive Algorithm and the Batch Formulation -- References -- II Acoustic Modeling -- 4 A Large Margin Algorithm for Forced Alignment (Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan) -- 4.1 Introduction -- 4.2 Problem Setting -- 4.3 Cost and Risk -- 4.4 A Large Margin Approach for Forced Alignment -- 4.5 An Iterative Algorithm -- 4.6 Efficient Evaluation of the Alignment Function -- 4.7 Base Alignment Functions -- 4.8 Experimental Results -- 4.9 Discussion -- References -- 5 A Kernel Wrapper for Phoneme Sequence Recognition (Joseph Keshet and Dan Chazan) -- 5.1 Introduction -- 5.2 Problem Setting -- 5.3 Frame-based Phoneme Classifier -- 5.4 Kernel-based Iterative Algorithm for Phoneme Recognition -- 5.5 Nonlinear Feature Functions -- 5.6 Preliminary Experimental Results -- 5.7 Discussion: Canwe Hope for Better Results? -- References -- 6 Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models (Mark J. F. Gales) -- 6.1 Introduction -- 6.2 Temporal Correlation Modeling. 6.3 Dynamic Kernels -- 6.4 Augmented Statistical Models -- 6.5 Experimental Results -- 6.6 Conclusions -- Acknowledgements -- References -- 7 Large Margin Training of Continuous Density Hidden Markov Models (Fei Sha and Lawrence K. Saul) -- 7.1 Introduction -- 7.2 Background -- 7.3 Large Margin Training -- 7.4 Experimental Results -- 7.5 Conclusion -- References -- III Language Modeling -- 8 A Survey of Discriminative Language Modeling Approaches for Large Vocabulary Continuous Speech Recognition (Brian Roark) -- 8.1 Introduction -- 8.2 General Framework -- 8.3 Further Developments -- 8.4 Summary and Discussion -- References -- 9 Large Margin Methods for Part-of-Speech Tagging (Yasemin Altun) -- 9.1 Introduction -- 9.2 Modeling Sequence Labeling -- 9.3 Sequence Boosting -- 9.4 Hidden Markov Support Vector Machines -- 9.5 Experiments -- 9.6 Discussion -- References -- 10 A Proposal for a Kernel Based Algorithm for Large Vocabulary Continuous Speech Recognition (Joseph Keshet) -- 10.1 Introduction -- 10.2 Segment Models and Hidden Markov Models -- 10.3 Kernel Based Model -- 10.4 Large Margin Training -- 10.5 Implementation Details -- 10.6 Discussion -- Acknowledgements -- References -- IV Applications -- 11 Discriminative Keyword Spotting (David Grangier, Joseph Keshet and Samy Bengio) -- 11.1 Introduction -- 11.2 Previous Work -- 11.3 Discriminative Keyword Spotting -- 11.4 Experiments and Results -- 11.5 Conclusions -- Acknowledgements -- References -- 12 Kernel-based Text-independent Speaker Verification (Johnny Mariéthoz, Samy Bengio and Yves Grandvalet) -- 12.1 Introduction -- 12.2 Generative Approaches -- 12.3 Discriminative Approaches -- 12.4 Benchmarking Methodology -- 12.5 Kernels for Speaker Verification -- 12.6 Parameter Sharing -- 12.7 Is the Margin Useful for This Problem? -- 12.8 Comparing all Methods -- 12.9 Conclusion -- References -- 13 Spectral Clustering for Speech Separation (Francis R. Bach and Michael I. Jordan) -- 13.1 Introduction -- 13.2 Spectral Clustering and Normalized Cuts. 13.3 Cost Functions for Learning the Similarity Matrix -- 13.4 Algorithms for Learning the Similarity Matrix -- 13.5 Speech Separation as Spectrogram Segmentation -- 13.6 Spectral Clustering for Speech Separation -- 13.7 Conclusions -- References -- Index.
Record Nr.	UNINA-9910146404403321

Pubbl/distr/stampa	Chichester, U.K. ; , : J. Wiley & Sons, , 2009
Descrizione fisica	1 online resource (271 p.)
Disciplina	006.4 006.4/54 006.454
Altri autori (Persone)	KeshetJoseph BengioSamy
Soggetto topico	Automatic speech recognition
ISBN	1-282-34941-4 9786612349416 0-470-74204-6 0-470-74203-8
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	List of Contributors -- Preface -- I Foundations -- 1 Introduction (Samy Bengio and Joseph Keshet) -- 1.1 The Traditional Approach to Speech Processing -- 1.2 Potential Problems of the Probabilistic Approach -- 1.3 Support Vector Machines for Binary Classification -- 1.4 Outline -- References -- 2 Theory and Practice of Support Vector Machines Optimization (Shai Shalev-Shwartz and Nathan Srebo) -- 2.1 Introduction -- 2.2 SVM and L2-regularized Linear Prediction -- 2.3 Optimization Accuracy From a Machine Learning Perspective -- 2.4 Stochastic Gradient Descent -- 2.5 Dual Decomposition Methods -- 2.6 Summary -- References -- 3 From Binary Classification to Categorial Prediction (Koby Crammer) -- 3.1 Multi-category Problems -- 3.2 Hypothesis Class -- 3.3 Loss Functions -- 3.4 Hinge Loss Functions -- 3.5 A Generalized Perceptron Algorithm -- 3.6 A Generalized Passive / Aggressive Algorithm -- 3.7 A Batch Formulation -- 3.8 Concluding Remarks -- 3.9 Appendix. Derivations of the Duals of the Passive / Aggressive Algorithm and the Batch Formulation -- References -- II Acoustic Modeling -- 4 A Large Margin Algorithm for Forced Alignment (Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan) -- 4.1 Introduction -- 4.2 Problem Setting -- 4.3 Cost and Risk -- 4.4 A Large Margin Approach for Forced Alignment -- 4.5 An Iterative Algorithm -- 4.6 Efficient Evaluation of the Alignment Function -- 4.7 Base Alignment Functions -- 4.8 Experimental Results -- 4.9 Discussion -- References -- 5 A Kernel Wrapper for Phoneme Sequence Recognition (Joseph Keshet and Dan Chazan) -- 5.1 Introduction -- 5.2 Problem Setting -- 5.3 Frame-based Phoneme Classifier -- 5.4 Kernel-based Iterative Algorithm for Phoneme Recognition -- 5.5 Nonlinear Feature Functions -- 5.6 Preliminary Experimental Results -- 5.7 Discussion: Canwe Hope for Better Results? -- References -- 6 Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models (Mark J. F. Gales) -- 6.1 Introduction -- 6.2 Temporal Correlation Modeling. 6.3 Dynamic Kernels -- 6.4 Augmented Statistical Models -- 6.5 Experimental Results -- 6.6 Conclusions -- Acknowledgements -- References -- 7 Large Margin Training of Continuous Density Hidden Markov Models (Fei Sha and Lawrence K. Saul) -- 7.1 Introduction -- 7.2 Background -- 7.3 Large Margin Training -- 7.4 Experimental Results -- 7.5 Conclusion -- References -- III Language Modeling -- 8 A Survey of Discriminative Language Modeling Approaches for Large Vocabulary Continuous Speech Recognition (Brian Roark) -- 8.1 Introduction -- 8.2 General Framework -- 8.3 Further Developments -- 8.4 Summary and Discussion -- References -- 9 Large Margin Methods for Part-of-Speech Tagging (Yasemin Altun) -- 9.1 Introduction -- 9.2 Modeling Sequence Labeling -- 9.3 Sequence Boosting -- 9.4 Hidden Markov Support Vector Machines -- 9.5 Experiments -- 9.6 Discussion -- References -- 10 A Proposal for a Kernel Based Algorithm for Large Vocabulary Continuous Speech Recognition (Joseph Keshet) -- 10.1 Introduction -- 10.2 Segment Models and Hidden Markov Models -- 10.3 Kernel Based Model -- 10.4 Large Margin Training -- 10.5 Implementation Details -- 10.6 Discussion -- Acknowledgements -- References -- IV Applications -- 11 Discriminative Keyword Spotting (David Grangier, Joseph Keshet and Samy Bengio) -- 11.1 Introduction -- 11.2 Previous Work -- 11.3 Discriminative Keyword Spotting -- 11.4 Experiments and Results -- 11.5 Conclusions -- Acknowledgements -- References -- 12 Kernel-based Text-independent Speaker Verification (Johnny Mariéthoz, Samy Bengio and Yves Grandvalet) -- 12.1 Introduction -- 12.2 Generative Approaches -- 12.3 Discriminative Approaches -- 12.4 Benchmarking Methodology -- 12.5 Kernels for Speaker Verification -- 12.6 Parameter Sharing -- 12.7 Is the Margin Useful for This Problem? -- 12.8 Comparing all Methods -- 12.9 Conclusion -- References -- 13 Spectral Clustering for Speech Separation (Francis R. Bach and Michael I. Jordan) -- 13.1 Introduction -- 13.2 Spectral Clustering and Normalized Cuts. 13.3 Cost Functions for Learning the Similarity Matrix -- 13.4 Algorithms for Learning the Similarity Matrix -- 13.5 Speech Separation as Spectrogram Segmentation -- 13.6 Spectral Clustering for Speech Separation -- 13.7 Conclusions -- References -- Index.
Record Nr.	UNINA-9910829918203321

Pubbl/distr/stampa	London, : Academic Press, c1986-
Disciplina	006
Soggetto topico	Speech processing systems Automatic speech recognition
Soggetto genere / forma	Periodicals.
ISSN	1095-8363
Formato	Materiale a stampa
Livello bibliografico	Periodico
Lingua di pubblicazione	eng
Altri titoli varianti	Computer speech and language
Record Nr.	UNISA-996205855403316

Pubbl/distr/stampa	Gaithersburg, MD : , : U.S. Dept. of Commerce, National Institute of Standards and Technology, , [2011]
Descrizione fisica	1 online resource (18 pages) : color illustrations
Altri autori (Persone)	WuJin Chu
Collana	NISTIR
Soggetto topico	Automatic speech recognition Biometric identification
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Record Nr.	UNINA-9910701247503321

Autore	Jokinen Kristiina
Pubbl/distr/stampa	Chichester, U.K. : , : Wiley, , 2009
Descrizione fisica	1 online resource (180 p.)
Disciplina	004.01/9
Collana	Wiley series in agent technology
Soggetto topico	Human-computer interaction Automatic speech recognition Intelligent agents (Computer software) Dialogue - Computer simulation
ISBN	1-282-18857-7 9786612188572 0-470-51127-3 0-470-51124-9
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Foreword. -- Preface. -- Acknowledgements. -- Introduction. -- Two Metaphors for Interaction Design. -- Design Models for Interactive Systems. -- Human Aspects in Dialogue System Design. -- Dialogue Models. -- Brief History. -- Modelling Approaches. -- Dialogue Management. -- Constructive Dialogue Model (CDM). -- Basic Principles of Communication. -- Full-blown Communication. -- Conversations with Computer Agents. -- Construction of Dialogue and Domain Information. -- Coherence and Context - Aboutness. -- Information Sturcture of Utterances - New and Old Information. -- Definitions of NewInfo and Topic. -- Topic Shifting. -- Information Management as Feedback Giving Activity. -- Information Management and Rational Agents. -- Dialogue Systems. -- Desiderata for Dialogue Agents. -- Technical Aspects in CDM. -- Summary. -- Constructive Information Technology. -- Learning and Adaptation. -- Cognitive Systems and Group Intelligence. -- Interaction and Affordance. -- Conclusions and Future Views. -- References. -- Index.
Record Nr.	UNINA-9910139917403321

Autore	Jokinen Kristiina
Pubbl/distr/stampa	Chichester, U.K. : , : Wiley, , 2009
Descrizione fisica	1 online resource (180 p.)
Disciplina	004.01/9
Collana	Wiley series in agent technology
Soggetto topico	Human-computer interaction Automatic speech recognition Intelligent agents (Computer software) Dialogue - Computer simulation
ISBN	1-282-18857-7 9786612188572 0-470-51127-3 0-470-51124-9
Formato	Materiale a stampa
Livello bibliografico	Monografia
Lingua di pubblicazione	eng
Nota di contenuto	Foreword. -- Preface. -- Acknowledgements. -- Introduction. -- Two Metaphors for Interaction Design. -- Design Models for Interactive Systems. -- Human Aspects in Dialogue System Design. -- Dialogue Models. -- Brief History. -- Modelling Approaches. -- Dialogue Management. -- Constructive Dialogue Model (CDM). -- Basic Principles of Communication. -- Full-blown Communication. -- Conversations with Computer Agents. -- Construction of Dialogue and Domain Information. -- Coherence and Context - Aboutness. -- Information Sturcture of Utterances - New and Old Information. -- Definitions of NewInfo and Topic. -- Topic Shifting. -- Information Management as Feedback Giving Activity. -- Information Management and Rational Agents. -- Dialogue Systems. -- Desiderata for Dialogue Agents. -- Technical Aspects in CDM. -- Summary. -- Constructive Information Technology. -- Learning and Adaptation. -- Cognitive Systems and Group Intelligence. -- Interaction and Affordance. -- Conclusions and Future Views. -- References. -- Index.
Record Nr.	UNINA-9910826259603321