TY - JOUR
T1 - Global and local simplex representations for multichannel source separation
AU - Laufer-Goldshtein, Bracha
AU - Talmon, Ronen
AU - Gannot, Sharon
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2020
Y1 - 2020
N2 - The problem of blind audio source separation (BASS) in noisy and reverberant conditions is addressed by a novel approach, termed Global and LOcal Simplex Separation (GLOSS), which integrates full- and narrow-band simplex representations. We show that the eigenvectors of the correlation matrix between time frames in a certain frequency band form a simplex that organizes the frames according to the speaker activities in the corresponding band. We propose to build two simplex representations: One global based on a broad frequency band and one local based on a narrow band. In turn, the two representations are combined to determine the dominant speaker in each time-frequency (TF) bin. Using the identified dominating speakers, a spectral mask is computed and is utilized for extracting each of the speakers using spatial beamforming followed by spectral postfiltering. The performance of the proposed algorithm is demonstrated using real-life recordings in various noisy and reverberant conditions.
AB - The problem of blind audio source separation (BASS) in noisy and reverberant conditions is addressed by a novel approach, termed Global and LOcal Simplex Separation (GLOSS), which integrates full- and narrow-band simplex representations. We show that the eigenvectors of the correlation matrix between time frames in a certain frequency band form a simplex that organizes the frames according to the speaker activities in the corresponding band. We propose to build two simplex representations: One global based on a broad frequency band and one local based on a narrow band. In turn, the two representations are combined to determine the dominant speaker in each time-frequency (TF) bin. Using the identified dominating speakers, a spectral mask is computed and is utilized for extracting each of the speakers using spatial beamforming followed by spectral postfiltering. The performance of the proposed algorithm is demonstrated using real-life recordings in various noisy and reverberant conditions.
KW - Blind audio source separation (BASS)
KW - beamformer
KW - relative transfer function (RTF)
KW - simplex
KW - spectral mask
UR - http://www.scopus.com/inward/record.url?scp=85082390350&partnerID=8YFLogxK
U2 - 10.1109/TASLP.2020.2975423
DO - 10.1109/TASLP.2020.2975423
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85082390350
SN - 2329-9290
VL - 28
SP - 914
EP - 928
JO - IEEE/ACM Transactions on Audio Speech and Language Processing
JF - IEEE/ACM Transactions on Audio Speech and Language Processing
M1 - 9004553
ER -