-
Currently, I have a dataset hosted on Huggingface with a custom script here. I'm loading my dataset as below. from datasets import load_dataset, IterableDatasetDict
dataset = IterableDatasetDict()
dataset["train"] = load_dataset("RitchieP/VerbaLex_voice", "ar", split="train", use_auth_token=True, streaming=True)
dataset["test"] = load_dataset("RitchieP/VerbaLex_voice", "ar", split="test", use_auth_token=True, streaming=True) And when I try to see the data I have loaded with list(dataset["train"].take(1)) And it gives me this stack trace
After looking into the stack trace, and referring to the source codes, it looks like its trying to access a directory in the notebook's environment and I don't understand why. Not sure if its a bug in Datasets library, so I'm opening a discussions first. Feel free to ask for more information if needed. Appreciate any help in advance! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
This issue has been solved. The solution for the issue is within a PR for the dataset in Huggingface Hub linked below. https://huggingface.co/datasets/RitchieP/VerbaLex_voice/discussions/6 |
Beta Was this translation helpful? Give feedback.
This issue has been solved. The solution for the issue is within a PR for the dataset in Huggingface Hub linked below.
https://huggingface.co/datasets/RitchieP/VerbaLex_voice/discussions/6