Proposal for Providing Ground Truth Data for Stuttering Speech Dataset #15

oven521 · 2024-03-13T16:48:13Z

Hi @seblemaguer , @colincsl , I am currently engaged in stuttering-related speech research, and have been utilizing your dataset for our studies. During our utilization, we have observed that the stuttering dataset lacks accurate ground truth annotations. We understand that manually annotating ground truth for stuttering audio is a highly tedious and time-consuming task.

Therefore, we propose to offer our method to provide the necessary ground truth annotations for the dataset. Our technology primarily transforms stuttering audios into clear, stutter-free versions: starting by identifying stutter components within the audio and converting them into tokens, then using advanced large models for precise corrections, and finally reconstructing the audio with voice cloning technology to closely match the original recordings. In this process, repaired texts are also generated.

We believe these highly accurate texts could serve as the ground truth annotations for the dataset. We have processed the entire dataset and generated corresponding repaired texts. As an example, we have provided content from the beginning of the dataset segment “male-episode-4-with-joseph”.

Hi everybody, this is Pam coming to you from the very occasional mail series that I do called He Stutters, She Asks Him. That can be found on the blog, Make Room for the Stuttering, and it's also freely available as a download on iTunes. Today I have the opportunity and pleasure to chat with actually a very good friend of mine. So I want to say hello to Joseph. Hey, Joseph. How's it going? 
Hey, Pam. Thanks for having me. 
Oh, I'm really excited about it. I think this is going to be a really insightful conversation, and thanks for agreeing to share with us. So to get us started, could you introduce yourself and tell us who you are and where you're from and what you do to keep yourself busy? 
Well, you begin with the easiest question in the world by asking me or us to say our name. We all love doing that. But before I do that, I'm sure glad you don't have a meter on my heart because it's pounding 100 miles an hour. 
Why? 
It's exciting to be on this call and talk about us, to talk about me, who's better, and it's a big part of my life. 
Well, we're really interested in hearing your story, so tell that heart to just slow down and just take a deep breath and just be yourself. 
Okay. So my introduction, my name is Joseph Diaz. I live in Dallas, born and raised in Dallas, Texas, longtime member of the NSA, and I've had a long career with the local power light company. And then the past 10 years I've had like three different kinds of jobs, different kinds of work. And right now I work for a wine company. We go to grocery stores and take orders of wine for the grocery stores. 
Do you do that in person with people in the grocery stores, or do you do that over the phone? 
No, no, in person. We go inside the stores, and we look on the shelves. We need three bottles of this. We need seven bottles of that in terms of our wine that we deal with. And so, is there a lot of communication involved with your job? It sounds like with placing orders and stuff, it sounds like there's a lot of talking involved. 
Well, you know, there really isn't. I work with my co-worker, and he does more of the ordering, and we, you know, again, we take the order, and then he goes online in our computer system and just clicks in what we need on the computer. So there's not too much talking. Occasionally with the management to kind of see, to ask them, okay, we're having three cases of this type of wine coming in. We're going to put it on this aisle and just kind of go over some things with them. So a lot of the discussion, if needed, is with these store directors. 
So working in the wine industry, this time of year must have been like crazy busy, huh? 
Okay, yeah, crazy busy is the right word. It was, you know, since Thanksgiving or before Thanksgiving, it's when, you know, all the holidays, you know, bump one after another. So we were very busy and just lots going on, lots of long hours. But hopefully last night it's all over with and now things should come down for the better of all of us. Now, a lot of people who stutter, myself included, when things are really much more stressful than usual, sometimes that has an impact on my stuttering. Do you find that's the case with you, like during these really busy times at work? Do you find that your speech is impacted at all?
 You know, I think my speech is pretty well under somewhat stress with my coworkers and stuff. It seems because if I have to explain something or have to answer a question, it's more to the point. You know, sometimes with our speech, answering a question has to be to the point. And a lot of times my speech is more fluent answering the question, not giving a whole three-minute explanation, just a, you know, 10-second explanation, to the point. 
Ah, so you are a man of few words then, basically. 
Well, no, no, just if needed, you know, if needed. And then behind everything, I talk to my coworker and let out my frustration. How come the boss wants to know this and wants to know that and stuff like that.

The text was updated successfully, but these errors were encountered:

Jinming00 · 2024-09-30T05:10:07Z

Hello，i'm pretty interested in your work , is it open sourced? How can i get access to it?

Aagam-ML · 2024-10-09T07:52:26Z

Hi @seblemaguer , @colincsl , I am currently engaged in stuttering-related speech research, and have been utilizing your dataset for our studies. During our utilization, we have observed that the stuttering dataset lacks accurate ground truth annotations. We understand that manually annotating ground truth for stuttering audio is a highly tedious and time-consuming task.

Therefore, we propose to offer our method to provide the necessary ground truth annotations for the dataset. Our technology primarily transforms stuttering audios into clear, stutter-free versions: starting by identifying stutter components within the audio and converting them into tokens, then using advanced large models for precise corrections, and finally reconstructing the audio with voice cloning technology to closely match the original recordings. In this process, repaired texts are also generated.

We believe these highly accurate texts could serve as the ground truth annotations for the dataset. We have processed the entire dataset and generated corresponding repaired texts. As an example, we have provided content from the beginning of the dataset segment “male-episode-4-with-joseph”.

Hi everybody, this is Pam coming to you from the very occasional mail series that I do called He Stutters, She Asks Him. That can be found on the blog, Make Room for the Stuttering, and it's also freely available as a download on iTunes. Today I have the opportunity and pleasure to chat with actually a very good friend of mine. So I want to say hello to Joseph. Hey, Joseph. How's it going? 
Hey, Pam. Thanks for having me. 
Oh, I'm really excited about it. I think this is going to be a really insightful conversation, and thanks for agreeing to share with us. So to get us started, could you introduce yourself and tell us who you are and where you're from and what you do to keep yourself busy? 
Well, you begin with the easiest question in the world by asking me or us to say our name. We all love doing that. But before I do that, I'm sure glad you don't have a meter on my heart because it's pounding 100 miles an hour. 
Why? 
It's exciting to be on this call and talk about us, to talk about me, who's better, and it's a big part of my life. 
Well, we're really interested in hearing your story, so tell that heart to just slow down and just take a deep breath and just be yourself. 
Okay. So my introduction, my name is Joseph Diaz. I live in Dallas, born and raised in Dallas, Texas, longtime member of the NSA, and I've had a long career with the local power light company. And then the past 10 years I've had like three different kinds of jobs, different kinds of work. And right now I work for a wine company. We go to grocery stores and take orders of wine for the grocery stores. 
Do you do that in person with people in the grocery stores, or do you do that over the phone? 
No, no, in person. We go inside the stores, and we look on the shelves. We need three bottles of this. We need seven bottles of that in terms of our wine that we deal with. And so, is there a lot of communication involved with your job? It sounds like with placing orders and stuff, it sounds like there's a lot of talking involved. 
Well, you know, there really isn't. I work with my co-worker, and he does more of the ordering, and we, you know, again, we take the order, and then he goes online in our computer system and just clicks in what we need on the computer. So there's not too much talking. Occasionally with the management to kind of see, to ask them, okay, we're having three cases of this type of wine coming in. We're going to put it on this aisle and just kind of go over some things with them. So a lot of the discussion, if needed, is with these store directors. 
So working in the wine industry, this time of year must have been like crazy busy, huh? 
Okay, yeah, crazy busy is the right word. It was, you know, since Thanksgiving or before Thanksgiving, it's when, you know, all the holidays, you know, bump one after another. So we were very busy and just lots going on, lots of long hours. But hopefully last night it's all over with and now things should come down for the better of all of us. Now, a lot of people who stutter, myself included, when things are really much more stressful than usual, sometimes that has an impact on my stuttering. Do you find that's the case with you, like during these really busy times at work? Do you find that your speech is impacted at all?
 You know, I think my speech is pretty well under somewhat stress with my coworkers and stuff. It seems because if I have to explain something or have to answer a question, it's more to the point. You know, sometimes with our speech, answering a question has to be to the point. And a lot of times my speech is more fluent answering the question, not giving a whole three-minute explanation, just a, you know, 10-second explanation, to the point. 
Ah, so you are a man of few words then, basically. 
Well, no, no, just if needed, you know, if needed. And then behind everything, I talk to my coworker and let out my frustration. How come the boss wants to know this and wants to know that and stuff like that.

Heyy i hope you are doing amazing , have you found the ground truth data for sep28k?? . If yes can you tell mw how you got it?.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Proposal for Providing Ground Truth Data for Stuttering Speech Dataset #15

Proposal for Providing Ground Truth Data for Stuttering Speech Dataset #15

oven521 commented Mar 13, 2024 •

edited

Loading

Jinming00 commented Sep 30, 2024

Uh oh!

Aagam-ML commented Oct 9, 2024

Uh oh!

Proposal for Providing Ground Truth Data for Stuttering Speech Dataset #15

Proposal for Providing Ground Truth Data for Stuttering Speech Dataset #15

Comments

oven521 commented Mar 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Jinming00 commented Sep 30, 2024

Uh oh!

Aagam-ML commented Oct 9, 2024

Uh oh!

oven521 commented Mar 13, 2024 •

edited

Loading