#whispercpp

2024-12-31

Does anybody know of a better #speechToText alternative to this?

This feels like a terrible hack that keeps breaking. I decided to look for alternatives after I saw them using /dev/shm to store ML models.

QuantiusBenignus/BlahST
github.com/QuantiusBenignus/Bl

SpeechNote (aka dsnote) does not qualify since it doesn't integrate with the clipboard.

#STT #WhisperCPP

2024-12-10

🚀 #Whisperphp Makes Speech Recognition Accessible in #PHP

🔧 New #PHP binding for #Whispercpp brings powerful #AI speech recognition capabilities:
• Supports #Linux (x86_64/arm64) and #macOS platforms with both high and low-level APIs for maximum flexibility

github.com/CodeWithKyrian/whis

2024-10-30

@itsfoss Well, it's probably better to have #WhisperCpp integrated in #Shotcut than to wait until audio exports just to put it through AI externally again.

#Whisper

whispercppがインストールできるようになった

RubyGems.orgにはまだ乗ってない

blogs.kitaitimakoto.net/~/Apeh

2024-09-30

Russian talk radio:

[00:00:00.000 --> 00:00:06.140] Кто должен задать эти новые, что такое хорошо и что такое плохо? Государство?
[00:00:06.140 --> 00:00:16.060] Я думаю, ну, какая-то государственная комиссия, ну, такая реальная комиссия, реальная, которая готова заглянуть в будущее.
[00:00:16.060 --> 00:00:18.500] Кого мы хотим сейчас воспитать?
[00:00:18.500 --> 00:00:23.140] Кого мы хотим воспитать? Я не очень понимаю.

#whispercpp

2024-02-20

@bert_hubert @hanno +1 #WhisperCpp is well-documented and straight-forwardly setup-able.

Frederik Elwertfelwert@mstdn.social
2023-10-24

Experimented with #whispercpp today. Results are quite impressive. I let it transcribe a 3 min snippet of an interview in German. What I noticed: -medium works significantly better than the smaller models. But it smoothens the text quite a bit, removing duplications, interjections etc., which might be undesirable for academic purposes. The OpenVINO version runs significantly faster even on an Intel GPU, but not by a magnitude. Having everything local is a huge plus for sensitive data.

Dr. Fortyseven 🥃 █▓▒░fortyseven@defcon.social
2023-10-21

Seeing an epidemic of people using automatic captioning tools and not actually reviewing the output. Numerous obvious, easily fixed errors.

This does absolutely no favors to people actually _depending_ on those captions to be accurate.

#ai #whispercpp #accessibility

Harvey Sandstromcd0
2023-07-16

This actually happened a couple weeks ago but I didn't see it until now. They've added experimental "speaker diarization" to . This allows it to mark separations when the person speaking changes. It appears that this uses both the sound of the voice and semantic clues simultaneously.
github.com/ggerganov/whisper.c

Harvey Sandstromcd0
2023-04-15

continues to evolve. :owi: Among many other changes the ones I find most interesting are:
- Multiple transcriptions can run in parallel using a single copy of the model in memory (each using a private state structure but sharing access to the model).
- On many android devices the native 16-bit float (fp16) instructions can now be utilized.
- CoreML can now be used. This is neat but is inherently Apple-only.
github.com/ggerganov/whisper.c

tobozotobozo
2023-03-02

what happens when you get whisper.cpp to listen to ?

both speak at 16KHz so they should understand each other, right?

github.com/ggerganov/whisper.c

Track: Alpha by @lukhash

Harvey Sandstromcd0
2023-02-04

Whisper.cpp 1.2 is out with very significant reductions in memory usage with the same models. Roughly in half.

github.com/ggerganov/whisper.c

2023-02-01

Wonder how much € would be saved if people only knew about free and/or open source solutions. #whisperai #whispercpp #subtitleedit

(((Jann Gobble)))🏳️‍🌈jann@twit.social
2022-12-23

@brainwane @simon Have you looked at #whispercpp yet? I use it daily to catalog my #podcasts and #index the contents. It's WAY faster than normal #python-based whisper...even with #PyTorch. Then again, it's got a limitation on the #algorithms it uses.. it's #inference & #greedy-only right now.

github.com/ggerganov/whisper.c

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst