#SpeechSynthesis

2025-05-23

🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬

Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬

follow hem here: @thorstenvoice
or on YouTube: youtube.com/@ThorstenMueller YouTube channel!

#Accessibility #FLOSS #TTS #ParlerTTS #OpenSource #VoiceTech #TextToSpeech #AI #CoquiAI #VoiceAssistant #Sprachassistent #MachineLearning #AccessibilityMatters #FLOSS #TTS #OpenSource #Inclusivity #FOSS #Coqui #AI #CoquiAI #VoiceAssistant #Sprachassistent #VoiceTechnology #KünstlicheStimme #MachineLearning #Python #Rhasspy #TextToSpeech #VoiceTech #STT #SpeechSynthesis #SpeechRecognition #Sprachsynthese #ArtificialVoice #VoiceCloning #Spracherkennung #CoquiTTS #voice #a11y #ScreenReader

N-gated Hacker Newsngate
2025-05-01

🚀🎤 Behold the future of speech synthesis: LLama-powered nonsense! In a world where tech jargon reigns supreme, why not strap a llama to your neural network and call it innovation? 🤔💡 Perfect for those who think "simple" means adding to complexity. 🦙💬
llasatts.github.io/llasatts/

Farooq | فاروقfarooqkz@cr8r.gg
2025-04-27

After my #wake_word_detection #research has delievered fruits, I have plans to continue works in the voice domain. I would love if I could train a #TTS model which has #British accent so I would use it to practice.

I was wondering if I could do the inference on #A311D #NPU. However, as I am skimming papers of different models, having inference on A311D with reasonable performance seems unlikely. Even training of these models on my entry level #IntelArc #GPU would be painful.

Maybe I could just finetune an already existing models. I am also thinking about using #GeneticProgramming for some components of these TTS models to see if there will be better inference performance.

There are #FastSpeech2 and #SpeedySpeech which look promising. I wonder how much natural their accents will be. But they would be good starting points.

BTW, if anyone needs opensource models, I would love to work as a freelancer and have an #opensource job. Even if someone can just provide access to computation resources, that would be good.

#forhire #opensourcejob #job #hiring

#AI #VoiceAI #opensourceai #ml #speechrecognition #speechsynthesis #texttospeech #machinelearning #artificialintelligence #getfedihired #FediHire #hireme #wakeworddetection

N-gated Hacker Newsngate
2025-04-07

🤖🚀 "AI-prepped engineer nails interview... until they hit the mute button and let the bots do the talking. Who knew their speech synthesis module was on the fritz? 🤷‍♂️🎙️ "
kapwing.com/blog/what-its-like

Em-squaredemsquared
2025-03-31

The Voder speech synthesis machine (1939). Kraftwerk would've loved this. Hand operated keys and foot pedal control.
youtube.com/watch?v=0rAyrmm7vv

Farooq | فاروقfarooqkz@cr8r.gg
2025-02-07

For learning languages, do you think it's a good idea to practice with an AI Speech Recognition and an AI Speech Synthesis engine?

I'm specifically interesting in British English and German.

#AI #ML #LanguageLearning #Learning #SprachenLernen #British #English #DeutchLernen #EnglishLearning #speechrecognition #speechtotext #speechrecognitionsoftware #speechsynthesis #SpeechSynthesizer

2025-02-05

@MXC48 @niavy Oh, wow ! Je viens de tester #Sherpa #TTS, c'est une dinguerie. Les progrès sont spectaculaires entre il y a quelques années, où je trouvais limite que des voix hyper robotiques, en dehors de la #SynthèseVocale de #Google, et maintenant ^^

#TextToSpeech #SherpaTTS #SpeechSynthesis

#text_to_speech #google_text_to_speech #texttospeech #speechtotext #text_to_speech #google_text_to_speech #opensource #open_source #android #androidapp #FDroid #fdroidstore #fdroidrepo #SpeechToText

Die Forschende Hochschule HofHomeOfResearch@wisskomm.social
2025-01-09

🗣 At #HofUniversity’s Institute for Information Systems, researchers delve into #VoiceCloning—a specialized form of #SpeechSynthesis replicating voices from short samples. This tech fuels #innovation (e.g., #Gaming NPCs) but raises #Security concerns (phone #scams, identity theft). Learn more from Prof. René Peinl in this interview: t1p.de/yul8r

#AI #EUAIAct #GDPR #Innovation
t1p.de/yul8r

Nick's world 🌎 👨‍🦯gocu54@caneandable.social
2024-12-22

This is a video of speech synthesis development through the years with examples.
youtu.be/huq2TSV99hI?si=iLN397
#Blind, #SpeechSynthesis

Nick's world 🌎 👨‍🦯gocu54@caneandable.social
2024-12-10

I wish there was a comprehensive multi-hour documentary on the history of screenreaders and voice synthesis. This is something I would binge non-stop, because I've been using this software for so many years and it would be great to know the entire history of these sorts of things. #Blind, #ScreenReaders, #SpeechSynthesis

Nick's world 🌎 👨‍🦯gocu54@caneandable.social
2024-12-10

Last boost, i've been waiting for something like this. I've been looking for a real documentary on the history of speech synthesis. It's time this history be told. #blind, #SpeechSynthesis

2024-11-12

Setting up an #Ubuntu #VM since #ArchLinux wouldn't work. Using #VMware Workstation Pro. Does anyone know what could cause speech during installation to be stuttery and choppy sounding? I can barely understand what it's saying. This is on #Windows11.
#speechSynthesis #screenReader

Also, because I accidentally installed Workstation Player at first and then installed Workstation Pro, I now have both on my system. Only one entry shows up in my uninstaller, so I removed it and reinstalled Pro, but I still see both of them. Any ideas what might cause this?
#softwareInstallation #uninstaller

#Windows #tech #techSupport #blind #accessibility
@mastoblind @main

Resolviendo la incógnita 🌐RLIBlog
2024-11-08

En 1961, John L. Kelly Jr. y Carol Lockbaum programaron en una IBM 7094 de Bell Labs la primera síntesis de voz cantando Daisy Bell de Harry Dacre con música de Max Mathews. Se podría considerar el abuelo de Hatsune Miku.

2024-10-21

Сьогодні дивився на Open-Source Speech Synthesis, і все дуже цікаво.

Ну, спочатку, існують речі такі як `espeak-ng`, які можна встановити з репозиторію і вони наче як ... стандартні.

Але господи, яке воно страшне, найжахливіший синтезований голос шо я чув.

Далі я поліз гуглити, спочатку знайшов Mozilla TTS: github.com/mozilla/TTS/ але воно схоже давно мертве. У Mozilla схоже є звичка шось починати і закидать.

Потім, знайшов github.com/coqui-ai/TTS ... В якому дуже цікаво виглядає те шо структура README дуже схожа з попереднім, команда інсталяції через pip така сама...

Вдалось його запустити, генерує непоганий голос, але така купа залежностей, тягте CUDA навіть коли воно мені не треба, але працює.

Далі цікавіше, Tortoise TTS:

huggingface.co/spaces/Manmay/t

Ось тут воно працює і непогано, але якшо спробувати запустити локально, то як мінімум на ноутбуці все настільки повільно шо я не дочекався поки згенерується одна фраза. Мабуть правду писали в README шо треба NVIDIA GPU.

Потім я знайшов ось цей реддіт тред, reddit.com/r/MachineLearning/c

Пішов дивитись на Mimic, і десь там на форумі побачив шо вони out of business, зате подивіть на `piper-tts`.

І ось тут починаєтья найцікавіше: github.com/rhasspy/piper

> A fast, local neural text to speech system

Є варіанти встановити як модуль python, є бінарник. Я спочатку думав шо якийсь з python, але ні. І воно генерує дуже непогану мову, дуже швидко, і без 10 гігабайт dependencies.

Дуже прикольна штука. Буду копати далі. Є навіть українські голоси, якість правда так собі, але є.

rhasspy.github.io/piper-sample

Єдина проблема, воно чомусь не сприймає newlines в тексті, доводиться робити отак:

```
echo $text | tr "\n\r" " " | ./piper -m ~/src/speak/en_US-lessac-medium.onnx -f - | paplay
```

Але то вже таке, шось придумаємо!

#tts #SpeechSynthesis #PiperTTS

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst