TY - JOUR
T1 - Source Counting and Separation Based on Simplex Analysis
AU - Laufer-Goldshtein, Bracha
AU - Talmon, Ronen
AU - Gannot, Sharon
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2018/12/15
Y1 - 2018/12/15
N2 - Blind source separation is addressed, using a novel data-driven approach, based on a well-established probabilistic model. The proposed method is specifically designed for separation of multichannel audio mixtures. The algorithm relies on spectral decomposition of the correlation matrix between different time frames. The probabilistic model implies that the column space of the correlation matrix is spanned by the probabilities of the various speakers across time. The number of speakers is recovered by the eigenvalue decay, and the eigenvectors form a simplex of the speakers' probabilities. Time frames dominated by each of the speakers are identified exploiting convex geometry tools on the recovered simplex. The mixing acoustic channels are estimated utilizing the identified sets of frames, and a linear umixing is performed to extract the individual speakers. The derived simplexes are visually demonstrated for mixtures of two, three, and four speakers. We also conduct a comprehensive experimental study, showing high separation capabilities in various reverberation conditions.
AB - Blind source separation is addressed, using a novel data-driven approach, based on a well-established probabilistic model. The proposed method is specifically designed for separation of multichannel audio mixtures. The algorithm relies on spectral decomposition of the correlation matrix between different time frames. The probabilistic model implies that the column space of the correlation matrix is spanned by the probabilities of the various speakers across time. The number of speakers is recovered by the eigenvalue decay, and the eigenvectors form a simplex of the speakers' probabilities. Time frames dominated by each of the speakers are identified exploiting convex geometry tools on the recovered simplex. The mixing acoustic channels are estimated utilizing the identified sets of frames, and a linear umixing is performed to extract the individual speakers. The derived simplexes are visually demonstrated for mixtures of two, three, and four speakers. We also conduct a comprehensive experimental study, showing high separation capabilities in various reverberation conditions.
KW - Blind audio source separation (BASS)
KW - relative transfer function (RTF)
KW - simplex
KW - spectral decomposition
UR - http://www.scopus.com/inward/record.url?scp=85055017005&partnerID=8YFLogxK
U2 - 10.1109/TSP.2018.2876349
DO - 10.1109/TSP.2018.2876349
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85055017005
SN - 1053-587X
VL - 66
SP - 6458
EP - 6473
JO - IEEE Transactions on Signal Processing
JF - IEEE Transactions on Signal Processing
IS - 24
M1 - 8493325
ER -