Antoine Liutkus, Fabian-Robert Stöter Inria and LIRMM, University of Montpellier, France [email protected]
- is_blind: no
- additional_training_data: no
- Code: https://github.com/sigsep/sigsep-mus-oracle
- Demos: Not available
The Multichannel Wiener Filter (MWF) exploiting the Local Gaussian Model (LGM) has been initially proposed in the following paper:
Duong, Ngoc QK, Emmanuel Vincent, and Rémi Gribonval. "Under-determined reverberant audio source separation using a full-rank spatial covariance model." IEEE Transactions on Audio, Speech, and Language Processing 18.7 (2010): 1830-1840.
Its core feature is to extend the single channel Wiener filter by exploiting interchannel correlations of the sources.
We write
The local Gaussian model assumes
Gallager, Robert G. "Circularly-symmetric Gaussian random vectors." Technical report, MIT (2008).
This is written:
-
$v_j(f,t)$ is the Power Spectral Density (PSD) of source$j$ at TF bin$(f,t)$ . It can be understood as the energy at that TF bin. -
$R_j(f)$ is the Spatial Covariance Matrix (SCM) of source$j$ at frequency$f$ . It is a$2\times 2$ matrix that encodes the correlations between left and right channels for this source at that frequency. The SCM can be understood as encoding how much one channel gives any information about the other through correlations. Note that in the LGM, the SCM is assumed to be constant over time, which basically means we expect all sources to have a consistant spatial configuration throughout the song.
The LGM model can be shown to generalize several previously proposed models, such as the linear instantaneous and the convolutive, that assume some deterministic relationship between left and right channels. Its strength is to relax such approaches by introducing some stochasticity: channels are only assumed correlated, and not necessarily either independent or deterministically related.
One advantage of the LGM is that it allows for straightforward separation, if we have the true parameters
$\hat{y}j(f,t)=v_j(f,t)R_j(f)\left[\sum_j' v{j'}(f,t)R_{j'}(f)\right]^\dagger x(f,t),$
where
This submission is an oracle, meaning that it knows the true sources to compute the optimal parameters
Given the true sources
- A. Liutkus and F.-R. Stöter, The 2018 Signal Separation Evaluation Campaign, Proceedings of LVA/ICA, 2018
@inproceedings{sisec2018, title={The 2018 signal separation evaluation campaign}, author={A. Liutkus and F.-R. St{"o}ter and N. Ito}, booktitle={International Conference on Latent Variable Analysis and Signal Separation}, year={2018}, }