Browse by author
Lookup NU author(s): Dr Bin Gao, Dr Wai Lok Woo, Emeritus Professor Satnam Dlay
Full text for this publication is not currently held within this repository. Alternative links are provided below where available.
A new unsupervised single-channel source separation method is presented. The proposed method does not require training knowledge and the separation system is based on nonuniform time-frequency (TF) analysis and feature extraction. Unlike conventional researches that concentrate on the use of spectrogram or its variants, we develop our separation algorithms using an alternative TF representation based on the gammatone filterbank. In particular, we show that the monaural mixed audio signal is considerably more separable in this nonuniform TF domain. We also provide the analysis of signal separability to verify this finding. In addition, we derive two new algorithms that extend the recently published Itakura-Saito nonnegative matrix factorization to the case of convolutive model for the nonstationary source signals. These formulations are based on the Quasi-EM framework and the multiplicative gradient descent (MGD) rule, respectively. Experimental tests have been conducted which show that the proposed method is efficient in extracting the sources' spectral-temporal features that are characterized by large dynamic range of energy, and thus leading to significant improvement in source separation performance.
Author(s): Gao B, Woo WL, Dlay SS
Publication type: Article
Publication status: Published
Journal: IEEE Transactions on Circuits and Systems I
Year: 2013
Volume: 60
Issue: 3
Pages: 662-675
Print publication date: 01/10/2012
ISSN (print): 1932-4545
ISSN (electronic): 1940-9990
Publisher: IEEE
URL: http://dx.doi.org/10.1109/TCSI.2012.2215735
DOI: 10.1109/TCSI.2012.2215735
Altmetrics provided by Altmetric