Bad performance when using for speech enhancement #33

jkzhang7 · 2020-08-11T17:27:13Z

Hi, very nice work. I noticed that some people are using Conv-TasNet for speech enhancement and get good results. While I encountered some problem while using this code for speech enhancement... I am trying to split clean speech and noise from a noisy speech. I am using VCTK dataset. The waveform of the results seem very weird...

When I changed the activation of mask to sigmoid, the result is still not good.

I wonder anyone has a thought how to solve this problem. Thanks in advance!

Andong-Li-speech · 2020-08-14T16:15:53Z

It seems to be caused by the choice of loss function, i.e., SI-SDR. SI-SDR does not restrict the magnitude of waveform, which may cause the the chopping effect. I think you can replace SI-SDR loss with other loss options like SNR or wave-L_1.

jkzhang7 · 2020-08-18T23:06:36Z

@Andong-Li-speech Hi, thanks for your suggestions! While the result seems still not very good after changing the loss function to SNR loss... But it works much better! I wonder if you are also working on this part, what kind of loss function are you using? Thanks a lot in advance!

LittleFlyingSheep · 2020-08-28T11:59:54Z

@jkzhang7 Hi, do you get a better performance? I face the same problem now. Best wishes to you!

forestlee95 · 2021-06-08T03:43:01Z

@LittleFlyingSheep Hi～ Did you solved this problem now? seem to meet the same problem , the magnitude of separate waveform is too big and sounds not very well, thanks a lot if you could give me some advice~

LittleFlyingSheep · 2021-06-08T05:57:55Z

@forestlee95 One way I choose to solve it is to scale the waveform artificially. I choose the max value of the input noisy and divide it with the output. This method will get a relatively good performance. This is just my helpless action. If you have any other methods, please letter me.

sewichou · 2022-03-22T14:35:13Z

@LittleFlyingSheep @jkzhang7 Hi, I am looking for the speech enhancement performance of conv-tasnet on vctk dataset, do you guys have any performance data about it? Much appreciated.

LittleFlyingSheep · 2022-03-22T14:35:35Z

收到

yyd19948 · 2022-08-31T07:23:35Z

Hi, very nice work. I noticed that some people are using Conv-TasNet for speech enhancement and get good results. While I encountered some problem while using this code for speech enhancement... I am trying to split clean speech and noise from a noisy speech. I am using VCTK dataset. The waveform of the results seem very weird...

When I changed the activation of mask to sigmoid, the result is still not good.

I wonder anyone has a thought how to solve this problem. Thanks in advance!
How did you solve it?i meet the same bug while testing

LittleFlyingSheep · 2022-10-11T08:05:56Z

收到

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bad performance when using for speech enhancement #33

Bad performance when using for speech enhancement #33

jkzhang7 commented Aug 11, 2020 •

edited

Loading

Andong-Li-speech commented Aug 14, 2020

jkzhang7 commented Aug 18, 2020 •

edited

Loading

LittleFlyingSheep commented Aug 28, 2020

forestlee95 commented Jun 8, 2021

LittleFlyingSheep commented Jun 8, 2021 •

edited

Loading

sewichou commented Mar 22, 2022 •

edited

Loading

LittleFlyingSheep commented Mar 22, 2022 via email

yyd19948 commented Aug 31, 2022

LittleFlyingSheep commented Oct 11, 2022 via email

Bad performance when using for speech enhancement #33

Bad performance when using for speech enhancement #33

Comments

jkzhang7 commented Aug 11, 2020 • edited Loading

Andong-Li-speech commented Aug 14, 2020

jkzhang7 commented Aug 18, 2020 • edited Loading

LittleFlyingSheep commented Aug 28, 2020

forestlee95 commented Jun 8, 2021

LittleFlyingSheep commented Jun 8, 2021 • edited Loading

sewichou commented Mar 22, 2022 • edited Loading

LittleFlyingSheep commented Mar 22, 2022 via email

yyd19948 commented Aug 31, 2022

LittleFlyingSheep commented Oct 11, 2022 via email

jkzhang7 commented Aug 11, 2020 •

edited

Loading

jkzhang7 commented Aug 18, 2020 •

edited

Loading

LittleFlyingSheep commented Jun 8, 2021 •

edited

Loading

sewichou commented Mar 22, 2022 •

edited

Loading