LEADER 05573nam 22007215 450 001 9910299856803321 005 20200702062151.0 010 $a1-4939-1456-1 024 7 $a10.1007/978-1-4939-1456-2 035 $a(CKB)3710000000261345 035 $a(EBL)1965080 035 $a(OCoLC)894114450 035 $a(SSID)ssj0001372506 035 $a(PQKBManifestationID)11761799 035 $a(PQKBTitleCode)TC0001372506 035 $a(PQKBWorkID)11304581 035 $a(PQKB)10147657 035 $a(DE-He213)978-1-4939-1456-2 035 $a(MiAaPQ)EBC1965080 035 $a(PPN)182093298 035 $a(EXLCZ)993710000000261345 100 $a20141014d2015 u| 0 101 0 $aeng 135 $aur|n|---||||| 181 $ctxt 182 $cc 183 $acr 200 10$aSpeech and Audio Processing for Coding, Enhancement and Recognition$b[electronic resource] /$fedited by Tokunbo Ogunfunmi, Roberto Togneri, Madihally (Sim) Narasimha 205 $a1st ed. 2015. 210 1$aNew York, NY :$cSpringer New York :$cImprint: Springer,$d2015. 215 $a1 online resource (347 p.) 300 $aDescription based upon print version of record. 311 $a1-4939-1455-3 320 $aIncludes bibliographical references. 327 $aFrom ?Harmonic Telegraph? to Cellular Phones -- Challenges in Speech Coding Research -- Recent Speech Coding Technologies and Standards -- Ensemble Learning Approaches in Speech Recognition -- Dynamic and Deep Networks For Speech Modeling and Recognition -- Speech Based Emotion Recognition -- Speaker Diarization: Challenges and Emerging Research -- Maximum a posteriori spectral estimation with source log-spectral priors for multichannel speech enhancement -- Modulation Processing for Speech Enhancement. 330 $aThis book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas. ·         Offers readers a single-source reference on the significant applications of speech and audio processing to speech coding, speech enhancement and speech/speaker recognition. Enables readers involved in algorithm development and implementation issues for speech coding to understand the historical development and future challenges in speech coding research; ·         Discusses speech coding methods yielding bit-streams that are multi-rate and scalable for Voice-over-IP (VoIP) Networks; ·         Presents an overview of recent developments in conversational speech coding technologies, important new algorithmic advances, and recent standardization activities in ITU-T, 3GPP, 3GPP2, MPEG and IETF that offer a significantly improved user experience during voice calls on existing and future communication systems; ·         Presents an overview of ensemble learning efforts based on different machine learning techniques that have emerged in automatic speech recognition in recent years; ·         Emphasizes signal processing for efficient time-domain and spectral-domain representations, reduction of noise, channel and session variabilities, extraction of temporal and spectral features for recognition and modeling; ·         Informs readers of the latest research and developments in advanced statistical estimation and deep neural networks for speech recognition; ·         Presents readers with the architectural framework and key approaches involved in the ?hot? research areas of emotion recognition and speaker diairization systems; ·         Provides readers with a more enriching view of state of the art research in speech enhancement arising from novel multi-microphone and time-frequency solutions. 606 $aSignal processing 606 $aImage processing 606 $aSpeech processing systems 606 $aUser interfaces (Computer systems) 606 $aMultimedia information systems 606 $aSignal, Image and Speech Processing$3https://scigraph.springernature.com/ontologies/product-market-codes/T24051 606 $aUser Interfaces and Human Computer Interaction$3https://scigraph.springernature.com/ontologies/product-market-codes/I18067 606 $aMultimedia Information Systems$3https://scigraph.springernature.com/ontologies/product-market-codes/I18059 615 0$aSignal processing. 615 0$aImage processing. 615 0$aSpeech processing systems. 615 0$aUser interfaces (Computer systems). 615 0$aMultimedia information systems. 615 14$aSignal, Image and Speech Processing. 615 24$aUser Interfaces and Human Computer Interaction. 615 24$aMultimedia Information Systems. 676 $a005.437 676 $a006.7 676 $a4019 676 $a620 702 $aOgunfunmi$b Tokunbo$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aTogneri$b Roberto$4edt$4http://id.loc.gov/vocabulary/relators/edt 702 $aNarasimha$b Madihally (Sim)$4edt$4http://id.loc.gov/vocabulary/relators/edt 906 $aBOOK 912 $a9910299856803321 996 $aSpeech and audio processing for coding, enhancement and recognition$91465850 997 $aUNINA