Audio source separation and speech enhancement /

Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and be...

Full description

Saved in:

Bibliographic Details
Other Authors:	Vincent, Emmanuel (Research scientist) (Editor), Virtanen, Tuomas (Editor), Gannot, Sharon (Editor)
Format:	Electronic eBook
Language:	English
Published:	Hoboken, NJ : John Wiley & Sons, 2018.
Subjects:	Speech processing systems. Automatic speech recognition.
Online Access:	CONNECT

MARC


LEADER	00000cam a2200000 i 4500
001	in00006066699
006	m o d
007	cr \|\|\|\|\|\|\|\|\|\|\|
008	180430s2018 nju ob 001 0 eng
005	20220713131325.9
010			\|a 2018021195
035			\|a 1WRLDSHRon1033578988
040			\|a DLC \|b eng \|e rda \|e pn \|c DLC \|d OCLCF \|d N$T \|d YDX \|d UIU \|d EBLCP \|d NLE \|d MERER \|d UAB \|d RECBK \|d DLC \|d U3W \|d OCLCQ \|d DG1 \|d OCLCQ \|d COO \|d OCLCQ \|d BRF \|d UKAHL \|d VT2 \|d OCLCO
019			\|a 1100461341 \|a 1192349952 \|a 1240507930
020			\|a 9781119279914 \|q (epub)
020			\|a 1119279917 \|q (epub)
020			\|a 9781119279884 \|q (pdf)
020			\|a 1119279887 \|q (pdf)
020			\|a 9781119279860 \|q (electronic bk. ; \|q oBook)
020			\|a 1119279860 \|q (electronic bk. ; \|q oBook)
020			\|z 9781119279891 \|q (cloth)
020			\|a 1119279895
020			\|a 9781119279891
035			\|a (OCoLC)1033578988 \|z (OCoLC)1100461341 \|z (OCoLC)1192349952 \|z (OCoLC)1240507930
042			\|a pcc
050	1	0	\|a TK7882.S65
082	0	0	\|a 006.4/54 \|2 23
049			\|a TXMM
245	0	0	\|a Audio source separation and speech enhancement / \|c edited by Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot.
264		1	\|a Hoboken, NJ : \|b John Wiley & Sons, \|c 2018.
300			\|a 1 online resource
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
504			\|a Includes bibliographical references and index.
588	0		\|a Print version record and CIP data provided by publisher.
520			\|a Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software. Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting. Key features: -Consolidated perspective on audio source separation and speech enhancement.-Both historical perspective and latest advances in the field, e.g. deep neural networks.-Diverse disciplines: array processing, machine learning, and statistical signal processing.-Covers the most important techniques for both single-channel and multichannel processing. This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.
505	0		\|a Intro; Table of Contents; List of Authors; Preface; Acknowledgment; Notations; Acronyms; About the Companion Website; Part I: Prerequisites; Chapter 1: Introduction; 1.1 Why are Source Separation and Speech Enhancement Needed?; 1.2 What are the Goals of Source Separation and Speech Enhancement?; 1.3 How can Source Separation and Speech Enhancement be Addressed?; 1.4 Outline; Bibliography; Chapter 2: Time-Frequency Processing: Spectral Properties; 2.1 Time-Frequency Analysis and Synthesis; 2.2 Source Properties in the Time-Frequency Domain; 2.3 Filtering in the Time-Frequency Domain
505	8		\|a 2.4 SummaryBibliography; Chapter 3: Acoustics: Spatial Properties; 3.1 Formalization of the Mixing Process; 3.2 Microphone Recordings; 3.3 Artificial Mixtures; 3.4 Impulse Response Models; 3.5 Summary; Bibliography; Chapter 4: Multichannel Source Activity Detection, Localization, and Tracking; 4.1 Basic Notions in Multichannel Spatial Audio; 4.2 Multi-Microphone Source Activity Detection; 4.3 Source Localization; 4.4 Summary; Bibliography; Part II: Single-Channel Separation and Enhancement; Chapter 5: Spectral Masking and Filtering; 5.1 Time-Frequency Masking
505	8		\|a 5.2 Mask Estimation Given the Signal Statistics5.3 Perceptual Improvements; 5.4 Summary; Bibliography; Chapter 6: Single-Channel Speech Presence Probability Estimation and Noise Tracking; 6.1 Speech Presence Probability and its Estimation; 6.2 Noise Power Spectrum Tracking; 6.3 Evaluation Measures; 6.4 Summary; Bibliography; Chapter 7: Single-Channel Classification and Clustering Approaches; 7.1 Source Separation by Computational Auditory Scene Analysis; 7.2 Source Separation by Factorial HMMs; 7.3 Separation Based Training; 7.4 Summary; Bibliography
505	8		\|a Chapter 8: Nonnegative Matrix Factorization8.1 NMF and Source Separation; 8.2 NMF Theory and Algorithms; 8.3 NMF Dictionary Learning Methods; 8.4 Advanced NMF Models; 8.5 Summary; Bibliography; Chapter 9: Temporal Extensions of Nonnegative Matrix Factorization; 9.1 Convolutive NMF; 9.2 Overview of Dynamical Models; 9.3 Smooth NMF; 9.4 Nonnegative State-Space Models; 9.5 Discrete Dynamical Models; 9.6 The Use of Dynamic Models in Source Separation; 9.7 Which Model to Use?; 9.8 Summary; 9.9 Standard Distributions; Bibliography; Part III: Multichannel Separation and Enhancement
505	8		\|a Chapter 10: Spatial Filtering10.1 Fundamentals of Array Processing; 10.2 Array Topologies; 10.3 Data-Independent Beamforming; 10.4 Data-Dependent Spatial Filters: Design Criteria; 10.5 Generalized Sidelobe Canceler Implementation; 10.6 Postfilters; 10.7 Summary; Bibliography; Chapter 11: Multichannel Parameter Estimation; 11.1 Multichannel Speech Presence Probability Estimators; 11.2 Covariance Matrix Estimators Exploiting SPP; 11.3 Methods for Weakly Guided and Strongly Guided RTF Estimation; 11.4 Summary; Bibliography; Chapter 12: Multichannel Clustering and Classification Approaches
590			\|a O'Reilly Online Learning Platform: Academic Edition (SAML SSO Access)
650		0	\|a Speech processing systems.
650		0	\|a Automatic speech recognition.
700	1		\|a Vincent, Emmanuel \|c (Research scientist), \|e editor.
700	1		\|a Virtanen, Tuomas, \|e editor.
700	1		\|a Gannot, Sharon, \|e editor.
730	0		\|a WORLDSHARE SUB RECORDS
776	0	8	\|i Print version: \|t Audio source separation and speech enhancement. \|d Hoboken, NJ : John Wiley & Sons, 2018 \|z 9781119279891 \|w (DLC) 2018013163
856	4	0	\|u https://go.oreilly.com/middle-tennessee-state-university/library/view/-/9781119279891/?ar \|z CONNECT \|3 O'Reilly \|t 0
949			\|a ho0
994			\|a 92 \|b TXM
998			\|a wi \|d z
999	f	f	\|s 8c367c90-c53e-4654-a748-0d44432973e4 \|i f3cca35e-b2d7-49a5-a9ba-8e29527d7305 \|t 0
952	f	f	\|a Middle Tennessee State University \|b Main \|c James E. Walker Library \|d Electronic Resources \|t 0 \|e TK7882.S65 \|h Library of Congress classification
856	4	0	\|3 O'Reilly \|t 0 \|u https://go.oreilly.com/middle-tennessee-state-university/library/view/-/9781119279891/?ar \|z CONNECT

Audio source separation and speech enhancement /

MARC

Similar Items