You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The dataset 'timit_asr' only includes one text phrase. It only includes the transcription "Would such an act of refusal be useful?" multiple times rather than different phrases.
Write a short function to display some random samples of the dataset.
fromdatasetsimportClassLabelimportrandomimportpandasaspdfromIPython.displayimportdisplay, HTMLdefshow_random_elements(dataset, num_examples=10):
assertnum_examples<=len(dataset), "Can't pick more elements than there are in the dataset."picks= []
for_inrange(num_examples):
pick=random.randint(0, len(dataset)-1)
whilepickinpicks:
pick=random.randint(0, len(dataset)-1)
picks.append(pick)
df=pd.DataFrame(dataset[picks])
display(HTML(df.to_html()))
show_random_elements(timit["train"].remove_columns(["file"]))
Expected results
10 random different transcription phrases.
Actual results
10 of the same transcription phrase "Would such an act of refusal be useful?"
Environment info
datasets version: 1.4.1
Platform: macOS-10.15.7-x86_64-i386-64bit
Python version: 3.8.5
PyArrow version: not listed
The text was updated successfully, but these errors were encountered:
Describe the bug
The dataset 'timit_asr' only includes one text phrase. It only includes the transcription "Would such an act of refusal be useful?" multiple times rather than different phrases.
Steps to reproduce the bug
Note: I am following the tutorial https://huggingface.co/blog/fine-tune-wav2vec2-english
Expected results
10 random different transcription phrases.
Actual results
10 of the same transcription phrase "Would such an act of refusal be useful?"
Environment info
datasets
version: 1.4.1The text was updated successfully, but these errors were encountered: