You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
/* init the kokoro model for tts */
self.kokoro =sherpaOnnxOfflineTtsKokoroModelConfig(
model: model.path(),
voices: voices.path(),
tokens: tokens.path(),
dataDir: dataDir
)
/* setup the serpa onnx offline tts wrapper */
letmodelConfig=sherpaOnnxOfflineTtsModelConfig(kokoro:self.kokoro!, debug:0)varttsConfig=sherpaOnnxOfflineTtsConfig(model: modelConfig)self.tts =SherpaOnnxOfflineTtsWrapper(config:&ttsConfig)
/* generate the audio file in the temp folder */
lettext="Insane in the membrane"letspeed:Float=1.0letaudioFileName="/tmp/blabla.wav"letaudio=self.tts?.generate(text: text, sid:Int(voiceId), speed: speed)
_ = audio?.save(filename: tempAudioFile.path())
Is there anything wrong with my implementation or is this just the model? I tried Kokoro 0.19 online at Huggingface and couldn't identify a similar issue. Therefore I am not sure if this is just the model, my code or a bug. Hence, reporting it here for potential investigation.
The text was updated successfully, but these errors were encountered:
OS: iOS 18.2, 18.3
Device: iPhone 14, iPad 7th Gen, iPhone simulators
The following generated audio outlines the issue, specifically noticable at 03m 04s:
SoundCloud: sherpa-onnx (iOS) sample
This isolated single generated voice output also shows an "f" being cut off at 0:13:
SoundCloud: sherpa-onnx (iOS) isolated sample
The same sample as above, but generated on HuggingFace with ZeroGPU:
SoundCloud: ZeroGPU sample
The implementation is as follows.
Is there anything wrong with my implementation or is this just the model? I tried Kokoro 0.19 online at Huggingface and couldn't identify a similar issue. Therefore I am not sure if this is just the model, my code or a bug. Hence, reporting it here for potential investigation.
The text was updated successfully, but these errors were encountered: