Azure AI: I’m too old for this shit, but Whisper works—locally

I’m not a fan of “everything in the Cloud” dogma. I prefer local apps, because otherwise why would we need anything else than Chromebooks, if everything is hosted on Azure, AWS or Google Cloud, with JS frontends in a browser?

This doesn’t apply to the big chatbots that require such intensive resources (and I refuse to own a “decent” graphics card) that running them locally is not feasible. But I hate when everybody hosts something “in the Cloud” because they can. Because this is how you do things today.

Given that the world decided otherwise in the last ~15 years, I tried to get a grasp of Azure more than once. Each time, I thought I’ll get a stroke. The architectural complexity of their shit is abysmal! How could so many people use Azure and still mimic mental sanity?!

It reminds me of another aspect of modern software development that’s so typical for the last 20 years: you cannot just build an app and test for bugs. No, you first have to deal with missing dependencies, wrong paths, and generally a fucked-up development environment. Gone are those times when you were installing an IDE, creating a new project, writing code, compiling and linking it, and it just worked!

But Azure seems to be a meta-meta-meta shit of this kind. You need advanced training to even understand where to start from, and how to make the necessary 2,000 clicks that would eventually allow you to do something useful. This is beyond insane.

The last time I tried to explore the Azure shit, I made sure not to subscribe to anything that would make me pay. So just the free tier. But Azure is not something where you’d be able to get a quick “Hello, World!” proof of concept. It’s modern shit, so it’s fucking complex. So I gave up. C’est le parcours du combattant.

Yesterday, while I was exploring the availability of an online voice recognition and transcription service that would accept a 60-minute MP3 and crunch it at no cost, I discovered that Azure AI Speech, which is part of Azure Cognitive Services Speech, which in turn is part of Azure AI Services, offers “5 audio hours free per month” in the free tier. And this can be used in the Azure Speech Playground, “typically accessed via Azure AI Studio or Speech Studio.” Azure this, Azure that.

Speaking of Azure AI Services, I wanted to try the Azure AI Foundry, only to discover that Romanian is not a supported language:

gpt-4o-mini: Supported languages: en, it, af, es, de, fr, id, ru, pl, uk, el, lv, zh, ar, tr, ja, sw, cy, ko, is, bn, ur, ne, th, pa, mr, te
o1: Supported languages: en, it, af, es, de, fr, id, ru, pl, uk, el, lv, zh, ar, tr, ja, sw, cy, ko, is, bn, ur, ne, th, pa, mr, te
gpt-4.5-preview: Supported languages: en, it, af, es, de, fr, id, ru, pl, uk, el, lv, zh, ar, tr, ja, sw, cy, ko, is, bn, ur, ne, th, pa, mr, te

WTF?! All these models support input and output in many more languages when accessed in an online chatbot!

So their offering is utterly pointless and useless.

But I reactivated my “subscription” (no payment, remember?) to be able to try Azure-AI-Speech, since Real-time transcription promises “Live transcription capabilities on your own audio without writing any code.”

Wow, no code.

For fuck’s sake, it didn’t work! It failed to accept my MP3. Then, a 16-bit PCM, allegedly the standard for Azure Speech Services, failed too!

Is there anything by Microsoft that works? Oh, Copilot, right. But how are people using Azure AI Services, if they fail to work more often than not?

So I decided to try Whisper. It just worked, with minor adjustments (a local environment was needed):

python3 -m venv whisper_env
source whisper_env/bin/activate
pip install -U openai-whisper
pip install setuptools-rust
whisper input.mp3 --model medium

Now, of course, I have to correct the resulting output (generated as txt, srt, vtt, tsv, and json), because there are errors. But the result is surprisingly decent for a modest model running on a €400 laptop with Intel video!

Fuck you, Azure AI shit!

Béranger - July 17th, 2025 at 11:01 PM none Comment author #115912 on Azure AI: I’m too old for this shit, but Whisper works—locally by Homo Ludditus

Oh, Whisper has a huge contender: Mistral’s new Voxtral.

Apparently, Whisper is the worst transcription model of all (by error rate) in the FLEURS (Few-shot Learning Evaluation of Universal Representations of Speech) benchmark!

These state‑of‑the‑art speech understanding models are available in two sizes—a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license, and are also available on our API. The API routes transcription queries to a transcribe-optimized version of Voxtral Mini (Voxtral Mini Transcribe) that delivers unparalleled cost and latency-efficiency.

As long as the Voxtral family are speech understanding models, I don’t understand why would anyone want to use them as text-only models!

Text
Voxtral retains the text capabilities of its Language-Model backbone, enabling it to be used as a drop-in replacement for Ministral and Mistral Small 3.1 respectively.

OK, so:

Download and run locally: Both Voxtral (24B) and Voxtral Mini (3B) are available to download on Hugging Face.

Try the API: Integrate frontier speech intelligence into your application with a single API call. Pricing starts at $0.001 per minute, making high-quality transcription and understanding affordable at scale. Check out our documentation here.

Try it on Le Chat: Try Voxtral in Le Chat’s voice mode (rolling out to all users in the next couple of weeks)—on web or mobile. Record or upload audio, get transcriptions, ask questions, or generate summaries.

The retards can’t even properly format the text on a fucking web page! (Bold was added by me.) OK, some links:
● Voxtral-Mini-3B-2507
● Voxtral-Small-24B-2507

As I only use simple laptops (and a mini-PC), all with Intel video, I can only use GGUF (Global Grid Unified Format) quantized models, and the only one currently on HF is this:
● Voxtral-3B-But-4B-Text-Only-GGUF

I can’t figure out what it could be used for! For fuck’s sake, it’s text-only! To use it as a handicapped, dumbified Mistral Small 3.1?

OK, I tried it in GPT4All. It sucks. It’s small, so its limited understanding was something to be expected.

For the time being, I can’t see Voxtral as a practical transcription tool for everyone. Sigh.

Oh, but look how retarded are the French idiots from Mistral! They have chosen the name “Voxtral” (Vox + Mistral), but this name was already taken by some other retards, the shady people behind Voxtral.org! Nobody knows who they are, but they do exist!

Anyway, Mistral is a huge French failure in IT. Their models were much better at the beginning; now they’re increasingly fucked-up. Here, only a couple of hours ago, Mistral refused to answer a question, both in standard and in Think mode, without giving any reason!

Béranger - July 27th, 2025 at 12:38 PM none Comment author #115935 on Azure AI: I’m too old for this shit, but Whisper works—locally by Homo Ludditus

Whisper can also be used locally (a small model) on Android and iOS through the app NotelyVoice.

Béranger on 250 years of hypocrisy and lies: “I’d like to present some objections to these theses. First, the idea of “natural rights” was not invented by John…” Jul 5, 09:46

Cozy on 250 years of hypocrisy and lies: “I agree with the sentiment; we should be having a funeral here… But there is a quote I’d like to…” Jul 5, 03:01

Béranger on Stop drinking Kool-Aid regarding battery life in Linux: “I wish XFCE Settings Manager had something like what Budgie Control Center has (here, in Ultramarine 44): The default for…” Jul 4, 21:05

Béranger on Gramatica geto-dacă e cea mai superioară, etc. (cu completări): “„bun simț” sau „bun-simț”? Substantivul este dat de toate dicționarele de pe dexonline.ro cu cratimă. De pildă, DLRM (1958). Doar…” Jul 4, 10:17

Béranger on Is Debian the Answer?: “No, because everyone who insists that Btrfs is a better file system is mentally retarded.” Jul 4, 10:14

santosh on Is Debian the Answer?: “Have you looked at Butterbian? Claims to be a better Debian setup.” Jul 4, 10:11

Béranger on Gramatica geto-dacă e cea mai superioară, etc. (cu completări): “Azile sau aziluri? Inițial, am crezut că e vorba de un bug în dexonline: – sinteza, care este căcatul cu…” Jul 3, 09:18

alecs on Perspectiva narativă pizdodiegetică: “Școala te pregătește pentru viață. Se poate să muncești patru ani de zile doar pentru ca o mână de incompetenți,…” Jul 3, 03:44

Béranger on Când Justiția poate suspenda tot ce vrea ea: cazul ROMATSA ● Acum și Justiția belgiană!: “Îmi pușcă o venă pe creier! Cum adică Pfizer a blocat conturile Romatsa? Ce treabă are datoria guvernului României către…” Jul 2, 20:48

Béranger on Dafuq: Claude Code appears to have leaked! 😱: “Claude Code builds older than version 2.1.197 were using hidden system prompt markers based on API base URL and timezone…” Jul 2, 16:23

Béranger on Claude Desktop for Linux: I didn’t even know it existed!: “There is now an official Claude Desktop on Linux (beta) for Ubuntu 22.04 or later, or Debian 12 or later.” Jul 2, 16:17

Béranger on The umpteenth AI compromise: “First, I said I should stop using Chinese LLMs, only to reconsider the decision one week later. Now, I might…” Jul 2, 10:50

Béranger on Linux: Backing the wrong horse or beating a dead horse?: “Also by Matthew Garrett: Preventing token theft. A comment summarizes it perfectly: “I hoped to read how to prevent token…” Jul 2, 10:10

Béranger on Today, I visited China (online): “Oh, fuck! Of course there are many other Chinese YT channels focusing on the same trope: the life of a…” Jul 2, 09:03

Liandro on Dumbo SPECIAL: Crappy Wayland—stupid with GNOME, better but imperfect with KDE: “Wayland’s definitely been a headache — I’ve had the same experience bouncing between GNOME and KDE, and yeah, GNOME just…” Jul 2, 03:07

Béranger on Chess and Go channels on YouTube: “I haven’t played chess since around high school. I haven’t even played against software in about ten years, so if…” Jul 1, 23:58

Lynne Goldberg on A rare gem in a world of decay: The Graystones: “I thought they were very talented and enjoyed the music. My grandchildren all play instruments and do vocals. I love…” Jun 30, 22:46

HAL on This is not a review of Basalt Linux 1.1—it’s a critique: “Usually all distros come with a clipboard, whatever it may be. Basalt doesn’t have one, at least in live mode.…” Jun 29, 20:04

Béranger on ComicStripBrowser now runs on Windows and supports Comics Kingdom too!: “Version 2.5.2 was released: • Fixed a caching bug where falling back to yesterday’s comic (due to US/local time zone…” Jun 29, 18:20

Béranger on Small polish touches to Debian 13 installed via Xebian: “Things happened when using both FSearch and Vinyl. I’m not sure whether this was a bug in FSearch or in…” Jun 29, 18:15

Béranger on This is not a review of Basalt Linux 1.1—it’s a critique: “You mean xfce4-clipman-plugin? Xebian has it. But Basalt might install more packages than present in the live ISO. I can’t…” Jun 29, 18:03

HAL on This is not a review of Basalt Linux 1.1—it’s a critique: “One thing though, Basalt doesn’t seem to have a clipboard installed. It’s surprising and rare, usually there is always one.” Jun 29, 17:59

Béranger on This is not a review of Basalt Linux 1.1—it’s a critique: “Adding Flatpak support is literally a 2-liner: sudo apt install flatpak flatpak remote-add –if-not-exists flathub https://dl.flathub.org/repo/flathub.flatpakrepo If you’re using GNOME…” Jun 29, 17:56

HAL on This is not a review of Basalt Linux 1.1—it’s a critique: “Basalt comes with Bluetooth, LibreOffice, VLC, Audacious, KeePassXC, Timeshift, Flatpak support, and GNOME Software preinstalled, and some people would appreciate…” Jun 29, 17:48

Béranger on A few notes about Antigravity CLI and non-alternatives: “After having used Antigravity CLI, now I found Antigravity IDE to be everything I need! Google Antigravity Downloads include (for…” Jun 29, 17:42

HAL on De nouveaux bogues pour le français: “Tout est foutu dans ce monde C’est tout-à-fait ça, mais l’IA va nous sauver 🤨” Jun 29, 17:18

sofleet on A rare gem in a world of decay: The Graystones: “Apparently that was the last video from the Graystones from the April collaboration. They set up a go-fund-me page last…” Jun 28, 15:06

Béranger on Furious German YouTuber Packs His Bags: to Japan! ● Updated!: “Updated with opinions and a long discussion on several German topics.” Jun 27, 23:10

Béranger on Today, I visited China (online): “Some crazy Canadians in China! JetLag Warriors (Steve, Ivana, and baby Jean, “a full-time travelling family from Canada”): ‒ May…” Jun 27, 13:20

Béranger on Palme d’Or for Mungiu’s Fjord: Cannes conned by a wily movie!: “Puisque ce film traitait de la Norvège… Apparemment, la Norvège est un pays barbare. 7 ans en Norvège, 3 enfants,…” Jun 27, 10:56

sofleet on A rare gem in a world of decay: The Graystones: “New song released by the Graystones about 2 hours ago and it already has more than 500 comments: Without You…” Jun 26, 19:09

Béranger on I’m so tired of all these “tech” news reports!: “This is my favorite kind of AI news: ① OpenAI Codex bombards SSDs with needless write operations, costing millions: Modern…” Jun 24, 19:14

edel on I’m so tired of all these “tech” news reports!: “Marvelous compilation! Kept me busy for 3h. Most interesting; CodePuppy and Fedora’s numbers, both the good ones and the bad…” Jun 23, 08:49

Béranger on I’m so tired of all these “tech” news reports!: “Morning has broken, and I could enjoy a couple of articles linked to by DistroWatch Weekly, a place where I’m…” Jun 22, 10:55

Béranger on Today, I visited China (online): “Both Mia chen and GuYi Alone released new videos: – Mia chen: Realistic daily life in an ordinary Chinese village…” Jun 21, 22:53

HAL on GNOME’s Tracker makes Linux as shitty as Windows: “Same here. Very informative. Thanks.” Jun 21, 18:45

Béranger on Limba română de la Humanitas la Veștea (și nu numai): “Regionalism din Moldova. Nu cred Gen Z a auzit de el.” Jun 21, 11:28

Al Sal on Limba română de la Humanitas la Veștea (și nu numai): “Nu știu dacă e chiar arhaic termenul. Mie mi-a venit în cap ca fiind o parte din mahala, la începutul…” Jun 21, 11:27

Béranger on Limba română de la Humanitas la Veștea (și nu numai): “Nu. 99% din cititori nu au auzit în viața lor acest regionalism arhaic.” Jun 21, 11:04

Al Sal on Limba română de la Humanitas la Veștea (și nu numai): “Sau poate „hudiță”.” Jun 21, 11:03

Béranger on Limba română de la Humanitas la Veștea (și nu numai): “Merge foarte bine, dar numai când textul se referă la astfel de mahalale la nivel general. Când e vorba de…” Jun 21, 10:40

Al Sal on Limba română de la Humanitas la Veștea (și nu numai): “Poate ”mahalale” pentru ”callejones”?” Jun 21, 10:10

Béranger on Gramatica geto-dacă e cea mai superioară, etc. (cu completări): “Mi-am pierdut simțul „limbei străbune”! Zilele trecute mă uitam la un individ cum a intrat ca la el acasă în…” Jun 20, 01:05

Al Sal on One can’t ask Chinese chatbots literally anything about China!: “For what it’s worth this is the answer I got from Qwen3.7 Plus inside the Kagi Assistant wrapper: The user…” Jun 19, 14:45

Béranger on Palme d’Or for Mungiu’s Fjord: Cannes conned by a wily movie!: “O discuție în limba română cu Gemini despre un video pe care nu l-am vizionat, dar care analizează filmul lui…” Jun 17, 18:00

Béranger on GNOME’s Tracker makes Linux as shitty as Windows: “I had to ask Gemini to understand what you meant regarding Fitts’ Law.” Jun 16, 08:41

zugu on GNOME’s Tracker makes Linux as shitty as Windows: “I agree, but for me the major culprit is that GNOME absolutely ignores Fitts’ Law when it comes to screen…” Jun 16, 08:14

Béranger on Today, I visited China (online): ““GuYi Alone”: China’s Fragile Pension Reality | Who Gets Sacrificed? The Real Reason Behind China’s High Savings. (16:28)” Jun 15, 20:35

edel on I tried and tried and couldn’t write on politics: “No just the UK, Germany is doing the same for showing a watermelon graphic. And the EU, sanctions 50 something…” Jun 15, 20:04

Friedhelm Mehnert on A rare gem in a world of decay: The Graystones: “Béranger, I’m very very sorry! I have been informed that the comments have been invisble because of a technical problem…” Jun 15, 14:52

Béranger on I tried and tried and couldn’t write on politics: “Here’s an exception to my self-imposed rule. Tell me how much Britain starts looking like Russia by only citing news…” Jun 15, 12:37

Béranger on A rare gem in a world of decay: The Graystones: “Gut zu wissen. Fun fact for other Germans who write in English: the comma between subordinate clauses is mandatory in…” Jun 15, 12:24

Friedhelm Mehnert on A rare gem in a world of decay: The Graystones: “I must warn you about the YT reaction channel Glenn and Adrian’s Rock Talk. Those jerks are not objective. They…” Jun 15, 10:53

disgorge on Lumo by Proton: a fraud of an AI: “I think the webclient source code is here https://github.com/ProtonMail/WebClients and Android and iOS is here: https://github.com/ProtonLumo” Jun 14, 19:00

Béranger on GNOME’s Tracker makes Linux as shitty as Windows: “I fully agree, but I would still insist on the stupidity of Files/Nautilus that has only 2 views and cannot…” Jun 14, 10:46

zugu on GNOME’s Tracker makes Linux as shitty as Windows: “GNOME is a cancer. Apart from GNOME 2, that is. Apart from under the hood stuff like this, they are…” Jun 14, 10:41

Béranger on Ideile fasciste poloneze cu privire la garanția „za butelki”: “Iată ce scrie pe FB un cetățean din Dortmund: În Germania, oamenii care lasă dozele și sticlele cu garanție în…” Jun 13, 12:37

Béranger on A rare gem in a world of decay: The Graystones: “The Turnarounds posted one more iPhone-recorded video: Lady Hear Me Tonight | The Turnarounds LIVE @ Little Lou’s (Phone Recording)…” Jun 12, 18:31

Béranger on Today, I visited China (online): ““GuYi Alone” again: Being a Chinese YouTuber | My YouTube earnings, tiny life upgrades & thoughts on changing my look……” Jun 11, 22:32

Béranger on Bypassing GoComics’ paywall: “😻 Michael Yingling’s Calvin & Hobbes Search Engine is working again!” Jun 11, 15:59

Azure AI: I’m too old for this shit, but Whisper works—locally

2 Comments Already

Leave a Reply Cancel reply