You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As with other combinations, some of the generated data seem to not be parsed properly.
While it is possible to synthesize REALISED_DURATIONS from TEXT or EMOTIONML,
generating audio from these is not possible (server gets stuck with 100% cpu).
An example input is
0.080000006 125 h
0.14000002 125 a
0.19500002 125 l
0.30500004 125 o:
0.64500004 125 _
It would be very helpful to have this fine grained control over the output.
The text was updated successfully, but these errors were encountered:
I believe that the source of this issue is that REALISED_DURATIONS and REALISED_ACOUSTPARAMS were intended only as output types, but when they were originally introduced (in bd582c8 and eb8807b, respectively), their isInputType parameter was set to true.
As with other combinations, some of the generated data seem to not be parsed properly.
While it is possible to synthesize REALISED_DURATIONS from TEXT or EMOTIONML,
generating audio from these is not possible (server gets stuck with 100% cpu).
An example input is
0.080000006 125 h
0.14000002 125 a
0.19500002 125 l
0.30500004 125 o:
0.64500004 125 _
It would be very helpful to have this fine grained control over the output.
The text was updated successfully, but these errors were encountered: