Output structure for frame-level predictions #11

cwitkowitz · 2022-11-28T02:46:07Z

The output structure of waveform-to-label models seems to be more targeted towards intervallic predictions, such as music tagging or instrument recognition. It would be nice to have a more straightforward way to output frame-level predictions, such as for frame-wise music transcription.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output structure for frame-level predictions #11

Output structure for frame-level predictions #11

cwitkowitz commented Nov 28, 2022

Output structure for frame-level predictions #11

Output structure for frame-level predictions #11

Comments

cwitkowitz commented Nov 28, 2022