You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have thoroughly reviewed the project documentation but couldn't find any relevant information that meets my needs.
I have searched for existing issues, including closed ones, and found not discussion yet.
I confirm that I am using English to submit this report in order to facilitate communication.
1. Is this request related to a challenge you're experiencing? Tell us your story.
I have worked few days of the project f5-tts and i am grateful to author because they are active and give response with short time. My question: here i have found chunk stream of f5-tts but is it possible real time stream like as xtts v2 stream inference byte label stream or have any possibility add stream inference instead of chunk stream
2. What is your suggested solution?
i have found some inference code like stream but it merge and used cross fade,
` # inference
with torch.inference_mode():
generated, _ = model_obj.sample(
cond=audio,
text=final_text_list,
duration=duration,
steps=nfe_step,
cfg_strength=cfg_strength,
sway_sampling_coef=sway_sampling_coef,
)
Checks
1. Is this request related to a challenge you're experiencing? Tell us your story.
I have worked few days of the project f5-tts and i am grateful to author because they are active and give response with short time. My question: here i have found chunk stream of f5-tts but is it possible real time stream like as xtts v2 stream inference byte label stream or have any possibility add stream inference instead of chunk stream
2. What is your suggested solution?
i have found some inference code like stream but it merge and used cross fade,
` # inference
with torch.inference_mode():
generated, _ = model_obj.sample(
cond=audio,
text=final_text_list,
duration=duration,
steps=nfe_step,
cfg_strength=cfg_strength,
sway_sampling_coef=sway_sampling_coef,
)
https://github.com/SWivid/F5-TTS/blob/main/src/f5_tts/infer/utils_infer.py line number 455. is it possible generated wav yield as a stream
3. Additional context or comments
Details already share into above section
4. Can you help us with this feature?
The text was updated successfully, but these errors were encountered: