Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

长文本语音合成优化 #3365

Open
5 tasks done
zhanghx0905 opened this issue Dec 11, 2024 · 1 comment
Open
5 tasks done

长文本语音合成优化 #3365

zhanghx0905 opened this issue Dec 11, 2024 · 1 comment

Comments

@zhanghx0905
Copy link
Contributor

例行检查

  • 我已确认目前没有类似 features
  • 我已确认我已升级到最新版本
  • 我已完整查看过项目 README,已确定现有版本无法满足需求
  • 我理解并愿意跟进此 features,协助测试和提供反馈
  • 我理解并认可上述内容,并理解项目维护者精力有限,不遵循规则的 features 可能会被无视或直接关闭

功能描述

OpenAI 默认的 TTS(Text-to-Speech)接口对输入文本有不超过 4096 个字符的限制。当输入文本超过这个限制时,API 会返回错误。为了提升用户体验,希望项目能够自动处理超过 4096 个字符的文本,将其拆分成多个部分,分别调用 TTS API 生成音频,最后将生成的多个音频片段合并成一个完整的音频文件。

应用场景

长文本语音播放:在自动化生成语音内容的场景中,确保不会因为文本长度限制而失败,提升自动化流程的鲁棒性。

相关示例

@William-715
Copy link

赞同

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants