Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

想问下实时语音转换和RVC比怎么样 #126

Open
skyCloud-CN opened this issue Feb 11, 2025 · 4 comments
Open

想问下实时语音转换和RVC比怎么样 #126

skyCloud-CN opened this issue Feb 11, 2025 · 4 comments

Comments

@skyCloud-CN
Copy link

尤其是中文, RVC经常吐字不清吞字什么的

@Plachtaa
Copy link
Owner

seed-vc在吐字清晰度上是绝对远远强于RVC的,但是缺点在于硬性要求有N卡并且延迟比RVC要高一点

@skyCloud-CN
Copy link
Author

seed-vc在吐字清晰度上是绝对远远强于RVC的,但是缺点在于硬性要求有N卡并且延迟比RVC要高一点

配置好了试了一下,除了实时转换不好用之外效果非常好,nice job!!!

@skyCloud-CN
Copy link
Author

好像实时转换参考音频如果放一个比较长的音频会只提取前几秒的声音? 电音很大, 效果比infer-web差很多, 是checkpoint不一样的原因吗

@Plachtaa
Copy link
Owner

实时转换为了减少每个chunk的合成速度,刻意减少了diffusion steps导致音质会差一些。prompt音频长度被限制也是为了减少每个step的推理时长。以后会尽量想办法优化这一块

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants