-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: Tutorials importing test audio files do not work on Google Colab #147
Comments
So the way that the tutorials should work I think might be different than the way that you're using them. I would recommend testing this again once we have that tutorial merged into mainline as I think that would allow us to see if the "Open in Colab" button works more as expected. Another fix for what you're showing is to not reference these by relative file paths and rather use wget or something to locally download them onto colab from the github. |
@wilke0818 nice catch! are you suggesting including a piece of code for downloading some audio as part of the tutorials? Something like this (https://github.com/sensein/fab/blob/main/tutorials/voice_anonymization/voice_anonymization.ipynb): # This variable holds the web address from which we'll download the EmoDB dataset.
# It's like a treasure map guiding us to the wonderful voice recordings!
dataset_url = "http://emodb.bilderbar.info/download/download.zip"
# The data_folder variable points to the location where we'll store all the data and audio recordings.
# Think of it as our backstage area, well-organized and ready to showcase the talents of our voices!
data_folder = "./data/"
# The dataset_name variable will be the name we give to the EmoDB dataset once we download it.
# Just a friendly label to recognize it easily when we work with it later on.
dataset_name = "emodb_dataset"
|
Yeah I mean technically it could be anything. I was thinking (not certain this would work) in colab
|
yours works if you want an entire dataset, though at that point it might be better to do something like use HuggingFace and convert it to a SenselabDataset which is the approach I was using in the ser tutorial. |
and also my note to Miles above was that it is possible that the code you have will work, once it is pulled into mainline. I have found that when working with the notebooks, there is a weird sort of Github/Colab interaction where Colab tries to use notebooks from the main branch of Github. |
oh wow! |
I agree that for most cases having one or two files is more than enough |
what branch/PR are you referring to? |
I'm not exactly sure what you mean by this, as |
I just re-tested it and I see that I was mistaken (I thought |
how about differentiating the flow on
|
Gotcha. I wasn't sure if the missing |
@fabiocat93 not sure how much sense it makes to differentiate the two cases. the "local flow" I guess only really effects those that are running this after setting everything up for development which probably will not be most people. I feel like the tutorial shouldn't assume anything regarding. Also in both cases it seems like we need to have |
@900miles, can you handle this issue in all the tutorials? You may create a utility for downloading an existing dataset to be processed. I have tentatively assigned this to you |
Description
Any tutorial that imports test audio files (e.g.
Audio.from_filepath("../src/tests/data_for_testing/audio_48khz_mono_16bits.wav")
) do not work on Google Colab, as there is no audio file to load in that environment. This affects most if not all of the tutorials that we currently have.Steps to Reproduce
Open a notebook tutorial, for example
speech_to_text.ipynb
. Add!pip install senselab
to the top of the file, and then run.Expected Results
The tutorial runs as expected.
Actual Results
When running the following code block:
I get the following error:
Additional Notes
No response
The text was updated successfully, but these errors were encountered: