#STT

2025-07-03

@thelinuxEXP I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by @mkiol

It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.

I primarily use #WhisperAI for transcription and Piper for voice, but many other models are available as well.

It is available as flatpak and github.com/mkiol/dsnote

#TTS #transcription #TextToSpeech #translator translation #offline #machinetranslation #sailfishos #SpeechSynthesis #SpeechRecognition #speechtotext #nmt #linux-desktop #stt #asr #flatpak-applications #SpeechNote

The image shows a screenshot of the "About" page for Speech Note 4.8.1. The page is structured with a dark gray header and a light gray body. The header includes a title "About" and a version number "4.8.1" with a subtitle "Note taking, reading and translating with Speech to Text, Text to Speech and Machine Translation." Below this, there is a section titled "Changes," followed by "About," which includes links to the project website and bug reporting pages on GitHub and GitLab, along with a support email address. The page also states that Speech Note is developed as an open-source project under the Mozilla Public License 2.0. The "Authors" section lists Michal Kosciessa as the copyright holder for the years 2021-2025. The "Translators" section lists several names, including Heimen Stoffels, Béranger Arnaud, and others. The "Libraries in use" section lists various libraries such as Qt, Coqui STT, Vosk, and others. The page has a "Close" button in the bottom right corner.

Provided by @altbot, generated privately and locally using Ovis2-8B
2025-06-30

📢 Income Tax Compliance:
Return for Securities Transaction Tax (STT) for FY 2024-25 is due by 30th June 2025.
Applicable for those liable to report STT transactions.

Fossery Tech :debian: :gnome:fosserytech@social.linux.pizza
2025-06-29

(Linux news in original post)

FOSS NEWS

Proton Mail gets Newsletter view to manage all email subscriptions in one place:
proton.me/blog/proton-mail-new
(That's really cool. Now we can tell normies that Proton Mail has this feature and Gmail doesn't lol)

Proton Pass adds 14 new entry types, option to create custom types:
alternativeto.net/news/2025/6/
(Really tempting feature, but personally I would advise against storing every piece of sensitive data in one central database in the cloud. Proton can get hacked any time, like any other company, and also the new Swiss law can force them to hand over all that personal data in plain text, so you can mess up your privacy really badly. I'm not pointing fingers at Proton, but I think this update wasn't quite a good idea, it puts too much responsibility on them.)

Firefox 140 ESR released with unload tab feature, support for adding custom search engines in Search settings, support for keeping more or fewer pinned vertical tabs in view, "Select All" option for bookmarks on Android:
9to5linux.com/firefox-140-esr-

Firefox 141 beta is available with less memory usage on Linux, ability to drag a tab to the pinned tabs tray and drag it out to unpin it, etc.:
9to5linux.com/firefox-141-prom

Mozilla discontinues DeepSpeech, an embedded/offline speech-to-text engine:
phoronix.com/news/Mozilla-Deep
(GNOME: *drops a feature every few releases*
Mozilla: Hold my beer. *drops a service each week*)

(more FOSS news in comment)

#WeeklyNews #OpenSource #FOSSNews #OpenSourceNews #FOSS #Proton #ProtonMail #ProtonPass #Firefox #Firefox140 #Firefox140ESR #FirefoxBeta #Mozilla #DeepSpeech #STT #SpeechToText #Browser #WebBrowser #Email #EmailService #EmailProvider #PasswordManager #Privacy #Security #FosseryTech

Jean-Jerome Levyjeanjeromelevy
2025-06-25

🔊 Whisper has a serious challenger: Moshi STT

Developed by the French research lab Kyutai, Moshi STT is a new open-source speech recognition system that’s blazingly fast, highly accurate, and optimized for Apple Silicon and CUDA — all designed with real-time performance in mind.

scalastic.io/en/moshi-stt-vs-w

2025-06-23

#Whisper #WebGPU by #Huggingface sounds very exciting!

Does this mean an #activitypub server could delegate translation-into-user's-language of all the posts to the user's device?

I'm too thick to have been able to find any system-requirements information for just the text-translation feature... Is this #translation feature likely to fly on mobile devices too?

Am I getting too excited too soon?

dev.to/proflead/real-time-audi

github.com/keatonkraiger/Whisp

#language #linguistics #AI #STT #SpeechToText #piefed #mastodon #edgeComputing

2025-06-23

System-Wide Dictation Tool with Vosk for Manjaro Linux

This project implements a powerful, system-wide dictation feature for Manjaro Linux (and other Linux distributions with minor adjustments). Once set up, you can press a hotkey in any text field (browser, editor, chat, etc.) to immediately start dictating. The spoken text will be automatically typed out for you.

github.com/sl5net/Vosk-System-
#FOSS #Offline #Vosk #STT

2025-06-22

völlig underrated:

#SpeechNote ist eine datenschutzfreundliche Linux-App, die Sprache in Text umwandelt (#STT), Text vorliest (auch Dateien) (#TTS) und übersetzt – alles lokal ohne Internetverbindung.
Viele Sprachen und Open-Source-Modelle stehen zum einbinden zur Verfügung!

2025-06-06

#AI Speech to Text for your Desktop 🎙️ 🗣️

superwhisper.com/ - 85$/year (lifetime: 250$) 💲💲
wisprflow.ai - 144$/year (no lifetime) 💲💲💲💲
#Windows only: Win-Key + H - free 😏

what else do you know or use ? 🤔

#ai #tools #mac #apple #STT

Shitshow nimeltään "hallituksen rasisminvastainen koulutus" jatkuu jatkumistaan.

#STT on tehnyt tietopyynnön koulutuksen sisällöstä. Sitä on kieltäydytty luovuttamasta.

Kun se oikeasti pitäisi luovuttaa, sanoo proffa.

yle.fi/a/74-20164739

EDIT: Hessussa asiasta Yleä pidemmin:

hs.fi/politiikka/art-200001126

#hallitus #sinimustahallitus #rasistihallitus #rasismi #politiikka #MePuhummeTeoin #tietopyyntö #avoimuus

2025-05-25
Bild von Markiertem textAnsicht der Vorlesesoftwear
2025-05-23

🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬

Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬

follow hem here: @thorstenvoice
or on YouTube: youtube.com/@ThorstenMueller YouTube channel!

#Accessibility #FLOSS #TTS #ParlerTTS #OpenSource #VoiceTech #TextToSpeech #AI #CoquiAI #VoiceAssistant #Sprachassistent #MachineLearning #AccessibilityMatters #FLOSS #TTS #OpenSource #Inclusivity #FOSS #Coqui #AI #CoquiAI #VoiceAssistant #Sprachassistent #VoiceTechnology #KünstlicheStimme #MachineLearning #Python #Rhasspy #TextToSpeech #VoiceTech #STT #SpeechSynthesis #SpeechRecognition #Sprachsynthese #ArtificialVoice #VoiceCloning #Spracherkennung #CoquiTTS #voice #a11y #ScreenReader

2025-05-14

KoljaB/RealtimeSTT: A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

github.com/KoljaB/RealtimeSTT

#stt

2025-05-07

Hello Masto,
Would you have any recommandation for a great opensource #CLI (or #Python lib) to perform some speech-to-text (audio-to-txt, #stt) magic?
Thanks 🙏

2025-05-06

With a parameter of only 0.6B, this audio transcription model from Nvidia is insanely fast and accurate! Hopefully, multi-language recognition will be supported soon!
Real-time speech-to-text on mobile devices is just around the corrner!

huggingface.co/nvidia/parakeet

#Nvidia #STT

Tao of Mactaoofmac
2025-05-01

AI Speech Technologies

This page is a collection of notes and links related to AI speech technologies, including Text-to-Speech (TTS), Speech-to-Text (STT), voice synthesis, voice cloning, and other rela(...)

taoofmac.com/space/ai/speech

AI Speech Technologies
athmane mokraoui [BoF] ⏚ꝃ⌁⁂ButterflyOfFire@mstdn.fr
2025-04-29
Dr James Ravenscroftjamesravey@fosstodon.org
2025-04-24

I saw @simon's note about this post and I thought it was quite fun. I am already running Whisper on my own hardware and I want to try the post-processing with a local model like Gemma3:27b interconnected.org/home/2025/0)

#slm #stt #whisper #TwinPeaks (brainsteam.co.uk/notes/2025/04)

athmane mokraoui [BoF] ⏚ꝃ⌁⁂ButterflyOfFire@mstdn.fr
2025-03-18

ibus-speech-to-text will provide voice dictation capabilities to any application supporting IBus input methods in #Fedora Linux 42, using VOSK for local voice recognition.

🔗 fedoraproject.org/wiki/Changes

#ibus #STT #SpeechToText #VOSK

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst