Chap 15, pg 513 : ModuleNotFoundError: No module named 'torchdata.datapipes' #199

Emmanuel-Ibekwe · 2024-12-24T15:50:12Z

`
from torchtext.datasets import IMDB
train_dataset = IMDB(split='train')
test_dataset = IMDB(split='test')

`
I keep getting this error despite manually installing torchdata. When I tried installing the exact version of torchtext used in the chapter, version 0.10.0, pip couldn't recognize as a valid version.

I can't find any solution to it online

kostuyn · 2025-01-04T00:17:38Z

@Emmanuel-Ibekwe I installed 0.17.0 version the package and it work (for colab)
!pip install portalocker --quiet
!pip install torchtext==0.17.0 --quiet

after installed - Runtime -> Restart runtime option in the Colab menu

(last version of torchtext has a problem pytorch/text#2272)

rasbt · 2025-01-04T16:07:32Z

@Emmanuel-Ibekwe It looks like you are right, and the PyTorch maintainers removed torchtext 0.10.0 from PyPi for some reason. The ch15 notebook here on GitHub should be updated to work with newer versions of torchtext though as @kostuyn mentioned. It would require installing portalocker as well as described above. Let us know in case this still doesn't work.

Emmanuel-Ibekwe · 2025-01-07T15:06:17Z

Thanks @rasbt and @kostuyn for the responses. I did find out through chatgpt (great tool) that the datasets package from the Huggingface community has the imdb dataset. So I used it.
Using the datasets package I got values for the various training and validation accuracies of different epochs that were different from the ones in the text. The model overfitted. At some point both accuracies maintained an accuracy score of 100%. But the model performed terribly on the test dataset. I got an accuracy of 68.5%.
Thanks one more time.

Edit: built a custom dataset for the imdb dataset from torch.utils.data to help in data loading.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chap 15, pg 513 : ModuleNotFoundError: No module named 'torchdata.datapipes' #199

Chap 15, pg 513 : ModuleNotFoundError: No module named 'torchdata.datapipes' #199

Emmanuel-Ibekwe commented Dec 24, 2024 •

edited

Loading

kostuyn commented Jan 4, 2025 •

edited

Loading

rasbt commented Jan 4, 2025

Emmanuel-Ibekwe commented Jan 7, 2025 •

edited

Loading

Chap 15, pg 513 : ModuleNotFoundError: No module named 'torchdata.datapipes' #199

Chap 15, pg 513 : ModuleNotFoundError: No module named 'torchdata.datapipes' #199

Comments

Emmanuel-Ibekwe commented Dec 24, 2024 • edited Loading

kostuyn commented Jan 4, 2025 • edited Loading

rasbt commented Jan 4, 2025

Emmanuel-Ibekwe commented Jan 7, 2025 • edited Loading

Emmanuel-Ibekwe commented Dec 24, 2024 •

edited

Loading

kostuyn commented Jan 4, 2025 •

edited

Loading

Emmanuel-Ibekwe commented Jan 7, 2025 •

edited

Loading