New TTS Model! (collecting feedback before merging) #179
Replies: 13 comments 1 reply
-
Overall, it definitely sounds like an improvement over the older TTS model. At its worst, it can get a bit tinny, shakey, robotic, or hollow in some places, but not more than the older TTS. At its best, it sounds more natural, varied, and emotive (albeit a little too emotive in some places as well). That all being said, I like it. |
Beta Was this translation helpful? Give feedback.
-
Sample 3: podcastfy README |
Beta Was this translation helpful? Give feedback.
-
Sample 4: 5h+ Lex Fridman Podcast with Dario Amodei - Anthropic's CEO |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Sample 6: The Autobiography of Benjamin Franklin |
Beta Was this translation helpful? Give feedback.
-
that's awesome! worked very nicely! have to dig in and see how you did it! |
Beta Was this translation helpful? Give feedback.
-
Thank you so much for the feedback! This has been released: v0.4.0.
https://github.com/souzatharsis/podcastfy/blob/main/podcastfy.ipynb |
Beta Was this translation helpful? Give feedback.
-
I think overshot requested length which was 10000 chars, says it's calling openai tts, maybe I'm doing it wrong, config is below? sounds pretty good !
|
Beta Was this translation helpful? Give feedback.
-
there is a bit of repetition in the qa transcript, I need to probably step through in the debugger and see if it is duplicated in the prompts that are sent ... search for : propublica wrote about summary.txt |
Beta Was this translation helpful? Give feedback.
-
Thank you @druce ! Sounds like it's working, right? However, in order to use the new TTS model you need to https://github.com/souzatharsis/podcastfy/blob/main/usage/config.md#setting-up-google-tts-model I am still setting openai as the default model since it does not require extra setup steps as in the Google TTS Models. |
Beta Was this translation helpful? Give feedback.
-
Oh and thanks for letting me know about |
Beta Was this translation helpful? Give feedback.
-
wow that's so much better!
We should create a markdown doc with side-by-side comparison of the TTS
models available - such a huge impact in output quality and engagement!
<http://linkedin.com/in/tharsissouza>
…On Sat, Nov 16, 2024 at 5:28 PM Druce Vertes ***@***.***> wrote:
that worked, sounds great! YouTube <https://youtu.be/0LjESoSexXc>
—
Reply to this email directly, view it on GitHub
<#179 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADTMY3OV42U2QE66CUAVR532A6TFTAVCNFSM6AAAAABR3VISAWVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTCMRXHE4DANA>
.
You are receiving this because you modified the open/close state.Message
ID: ***@***.***
com>
|
Beta Was this translation helpful? Give feedback.
-
Here's a sample from the famous Newsletter AI News: Source: https://buttondown.com/ainews/archive/ainews-stripe-lets-agents-spend-money-with/ Audio: |
Beta Was this translation helpful? Give feedback.
-
Hi, I think we have a break-through... The audio sample below uses what I believe is NotebookLM's TTS model!
Would love your feedback - I'll merge into main once I collect enough feedback:
Sample 1: open-notebook.ai website
(longterm param set to half the default)
https://soundcloud.com/manoel-tobias-here/open-notebook?in=manoel-tobias-here/sets/new-tts-model&si=90e10620ed10402d9cfb1a666c298c82
Sample 2: Google Handbook on Prompt Engineering
(longterm param set to default)
https://soundcloud.com/manoel-tobias-here/prompt-eng-handbook?in=manoel-tobias-here/sets/new-tts-model&si=6ea2fc3783cd445fa8362322c8aa6555
Beta Was this translation helpful? Give feedback.
All reactions