#AudioToText

Rohit Farmer, Ph.D.swatantra@fosstodon.org
2024-12-16

Thanks to everyone for suggesting #OpenAI #Whisper for my problem. I have now settled with the following workflow:

rohitfarmer.com/tinylog/#monda

#ai #audiototext #accessibility

Rohit Farmer, Ph.D.swatantra@fosstodon.org
2024-12-15

Is there a free to relatively cheap way to do audio-to-text transcription? Currently, I use otter.ai, but it has a limit of 30 minutes per recording in the free plan. Though I love it, my current usage is infrequent and doesn't justify a pro subscription.

It doesn't have to be a real-time transcription. I can record and batch transcribe using a standalone, web, or mobile app. Thanks!

#fedihelp #speachtotext #audiototext #foss #accessibility

2024-11-08

Hey folks :FediverseSymbol:

We've actually done an unwritten, off-the-cusp trans voice Friday recording today :TransHeart:

We've not listened back to it, because voice dysphoria, but we've added full alt text.

In case you're wondering how we've done that without listening back to it, we've once against used an amazing tool called Subtitle Edit, which has audio to text functionality via the Whisper speech recognition engine.

We used the large-v3 model, which is about 3.1 GB, but gives incredibly accurate transcription.

In case anyone can't access the alt text, we've added the full transcript below too.

#TransVoiceFriday #TransVoice #voice #VoiceFeminisation #VoiceFeminization #VoiceTraining #trans #transgender #TransFem #VoiceDysphoria #SubtitleEdit #PurfviewWhisper #AudioToText #SpeechToText #SpeechRecognition

Hey folks, I know that we haven't done a voice note in forever, and that's been for a multitude of reasons, some of which are related to mental health, some of which are related to work, stress, anxiety, depression, etc, things like that, which comes under mental health anyway, yeah, partly due to poor time management, yay for being AuDHD! But not gonna lie, some of it does come down to underlying voice dysphoria, because this is the best we've managed to get since December 2021. And just for anyone who hasn't heard roughly what we sounded like beforehand, we haven't exactly moved our voice up a lot. I mean, the base level would just be down here. So I can move my voice back up here easily now, and this is the comfortable, this is the default voice. But, um... It's not where I want it to be, it's not in the female range, and I can't easily push the pitch up higher without it sounding wrong. But yeah, there's been a lot of stuff going on recently, um, a lot of bad stuff for everyone, don't want to talk about all of that. But, um, let's just focus on supporting each other, helping each other, um, being kind to ourselves and others right now, and being compassionate and empathetic. That's all I've really got to say. I'm trying to do the same thing with ourselves, but yeah, it's hard sometimes. Anyway, ta-ta for now.

2024-02-06

Still amazed that I can convert audio to text on my PC for free with a local version of #OpenAI Whisper. Handles multiple audio & video formats and generates txt, srt & 3 more result files.

GitHub repo: github.com/openai/whisper - open-source MIT License!

Python library: pypi.org/project/openai-whispe

Easy-to-follow installation video by Kevin Stratvert: youtube.com/watch?v=ABFqbY_rmE

Even easier video for free cloud use via a Colab notebook youtube.com/watch?v=8SQV-B83tP

#AudioToText #AI #ddj

2023-07-27

Anyone on Masto have recommendations for uploading audio to somewhere that will then transcribe it for you?

Open Source or Free options preferred.

#audiototext #tranascription

2023-04-16

I created a #Python #automation for transcribing any #YouTube video to text files with language detection. You will get accurate, customizable results while saving time and free of cost. It is very easy to use and can be useful for #content #creators, #researchers, and #educators!

Learn more in my latest blogpost: javedali.net/post/2023-04-audi

#pythonprogramming #audio #transcription #audiototext

2020-07-29

Auriez-vous un #conseil pour un logiciel de #transcription de fichiers audios ? De préférence #floss évidemment !
#audio #audiototext

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst