Azure AI: I’m too old for this shit, but Whisper works—locally

I’m not a fan of “everything in the Cloud” dogma. I prefer local apps, because otherwise why would we need anything else than Chromebooks, if everything is hosted on Azure, AWS or Google Cloud, with JS frontends in a browser?

This doesn’t apply to the big chatbots that require such intensive resources (and I refuse to own a “decent” graphics card) that running them locally is not feasible. But I hate when everybody hosts something “in the Cloud” because they can. Because this is how you do things today.

Given that the world decided otherwise in the last ~15 years, I tried to get a grasp of Azure more than once. Each time, I thought I’ll get a stroke. The architectural complexity of their shit is abysmal! How could so many people use Azure and still mimic mental sanity?!

It reminds me of another aspect of modern software development that’s so typical for the last 20 years: you cannot just build an app and test for bugs. No, you first have to deal with missing dependencies, wrong paths, and generally a fucked-up development environment. Gone are those times when you were installing an IDE, creating a new project, writing code, compiling and linking it, and it just worked!

But Azure seems to be a meta-meta-meta shit of this kind. You need advanced training to even understand where to start from, and how to make the necessary 2,000 clicks that would eventually allow you to do something useful. This is beyond insane.

The last time I tried to explore the Azure shit, I made sure not to subscribe to anything that would make me pay. So just the free tier. But Azure is not something where you’d be able to get a quick “Hello, World!” proof of concept. It’s modern shit, so it’s fucking complex. So I gave up. C’est le parcours du combattant.

Yesterday, while I was exploring the availability of an online voice recognition and transcription service that would accept a 60-minute MP3 and crunch it at no cost, I discovered that Azure AI Speech, which is part of Azure Cognitive Services Speech, which in turn is part of Azure AI Services, offers “5 audio hours free per month” in the free tier. And this can be used in the Azure Speech Playground, “typically accessed via Azure AI Studio or Speech Studio.” Azure this, Azure that.

Speaking of Azure AI Services, I wanted to try the Azure AI Foundry, only to discover that Romanian is not a supported language:

gpt-4o-mini: Supported languages: en, it, af, es, de, fr, id, ru, pl, uk, el, lv, zh, ar, tr, ja, sw, cy, ko, is, bn, ur, ne, th, pa, mr, te
o1: Supported languages: en, it, af, es, de, fr, id, ru, pl, uk, el, lv, zh, ar, tr, ja, sw, cy, ko, is, bn, ur, ne, th, pa, mr, te
gpt-4.5-preview: Supported languages: en, it, af, es, de, fr, id, ru, pl, uk, el, lv, zh, ar, tr, ja, sw, cy, ko, is, bn, ur, ne, th, pa, mr, te

WTF?! All these models support input and output in many more languages when accessed in an online chatbot!

So their offering is utterly pointless and useless.

But I reactivated my “subscription” (no payment, remember?) to be able to try Azure-AI-Speech, since Real-time transcription promises “Live transcription capabilities on your own audio without writing any code.”

Wow, no code.

For fuck’s sake, it didn’t work! It failed to accept my MP3. Then, a 16-bit PCM, allegedly the standard for Azure Speech Services, failed too!

Is there anything by Microsoft that works? Oh, Copilot, right. But how are people using Azure AI Services, if they fail to work more often than not?

So I decided to try Whisper. It just worked, with minor adjustments (a local environment was needed):

python3 -m venv whisper_env
source whisper_env/bin/activate
pip install -U openai-whisper
pip install setuptools-rust
whisper input.mp3 --model medium

Now, of course, I have to correct the resulting output (generated as txt, srt, vtt, tsv, and json), because there are errors. But the result is surprisingly decent for a modest model running on a €400 laptop with Intel video!

Fuck you, Azure AI shit!

Béranger - July 17th, 2025 at 11:01 PM none Comment author #115912 on Azure AI: I’m too old for this shit, but Whisper works—locally by Homo Ludditus

Oh, Whisper has a huge contender: Mistral’s new Voxtral.

Apparently, Whisper is the worst transcription model of all (by error rate) in the FLEURS (Few-shot Learning Evaluation of Universal Representations of Speech) benchmark!

These state‑of‑the‑art speech understanding models are available in two sizes—a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license, and are also available on our API. The API routes transcription queries to a transcribe-optimized version of Voxtral Mini (Voxtral Mini Transcribe) that delivers unparalleled cost and latency-efficiency.

As long as the Voxtral family are speech understanding models, I don’t understand why would anyone want to use them as text-only models!

Text
Voxtral retains the text capabilities of its Language-Model backbone, enabling it to be used as a drop-in replacement for Ministral and Mistral Small 3.1 respectively.

OK, so:

Download and run locally: Both Voxtral (24B) and Voxtral Mini (3B) are available to download on Hugging Face.

Try the API: Integrate frontier speech intelligence into your application with a single API call. Pricing starts at $0.001 per minute, making high-quality transcription and understanding affordable at scale. Check out our documentation here.

Try it on Le Chat: Try Voxtral in Le Chat’s voice mode (rolling out to all users in the next couple of weeks)—on web or mobile. Record or upload audio, get transcriptions, ask questions, or generate summaries.

The retards can’t even properly format the text on a fucking web page! (Bold was added by me.) OK, some links:
● Voxtral-Mini-3B-2507
● Voxtral-Small-24B-2507

As I only use simple laptops (and a mini-PC), all with Intel video, I can only use GGUF (Global Grid Unified Format) quantized models, and the only one currently on HF is this:
● Voxtral-3B-But-4B-Text-Only-GGUF

I can’t figure out what it could be used for! For fuck’s sake, it’s text-only! To use it as a handicapped, dumbified Mistral Small 3.1?

OK, I tried it in GPT4All. It sucks. It’s small, so its limited understanding was something to be expected.

For the time being, I can’t see Voxtral as a practical transcription tool for everyone. Sigh.

Oh, but look how retarded are the French idiots from Mistral! They have chosen the name “Voxtral” (Vox + Mistral), but this name was already taken by some other retards, the shady people behind Voxtral.org! Nobody knows who they are, but they do exist!

Anyway, Mistral is a huge French failure in IT. Their models were much better at the beginning; now they’re increasingly fucked-up. Here, only a couple of hours ago, Mistral refused to answer a question, both in standard and in Think mode, without giving any reason!

Béranger - July 27th, 2025 at 12:38 PM none Comment author #115935 on Azure AI: I’m too old for this shit, but Whisper works—locally by Homo Ludditus

Whisper can also be used locally (a small model) on Android and iOS through the app NotelyVoice.

HAL on 📖 In the unlikely event you installed Lubuntu 25.10…: “Yes. Peut-être est-ce le nouveau menu par défaut de LXQt 2.3.0 ? Si c’est cela, espérons que Lubuntu le gardera dans…” Nov 6, 23:49

Béranger on 📖 In the unlikely event you installed Lubuntu 25.10…: “Aaah, chez Bobby Borisov ! Je ne sais pas comment il a fait pour obtenir ça. Si c’est comme ça par…” Nov 6, 23:45

HAL on 📖 In the unlikely event you installed Lubuntu 25.10…: “Je voulais parler du screenshot dans l’article dont vous donnez le lien en update de votre dernier commentaire. https://distrowatch.com/images/ktyxqzobhgijab/lubuntu.png Je…” Nov 6, 23:43

Béranger on 📖 In the unlikely event you installed Lubuntu 25.10…: “Ça doit être vous 🙂 Ici, le menu est tout à fait normal.” Nov 6, 23:01

HAL on 📖 In the unlikely event you installed Lubuntu 25.10…: “Is it just me, or is the Start Menu upside down? I prefer the categories on the left and the…” Nov 6, 22:54

Béranger on Alibaba outsmarts DeepSeek in AI offerings: “I only used Qwen in a browser, in some weeks more than in the 2nd week of September, when it…” Nov 6, 18:34

Béranger on Windows IoT 10 Enterprise LTSC 2021: Would you have expected me to write about it?: “This is what’s wrong in your story: left her notebook on repair at local PC store/workshop Oh, and I found…” Nov 6, 00:30

Béranger on 📖 In the unlikely event you installed Lubuntu 25.10…: “Oh, LXQt 2.3.0 has been released! Announcement, GitHub. I don’t care about Wayland, but there are minor improvements in PCManFM-Qt,…” Nov 6, 00:28

Alex on Windows IoT 10 Enterprise LTSC 2021: Would you have expected me to write about it?: “I will write a retarded comment, but it is a true story: my friend some time ago left her notebook…” Nov 5, 23:12

John Jones Jr on So You Know Why People Vote for Extremists in Europe: “I suspect capitalists in the EU are pushing this on purpose (behind the scenes) to create a furor, they know…” Nov 5, 14:44

Béranger on Windows IoT 10 Enterprise LTSC 2021: Would you have expected me to write about it?: “Bleeping Computer: Microsoft has warned that some systems may boot into BitLocker recovery after installing the October 2025 Windows security…” Nov 5, 13:19

Aldus on 🤖 AI: from LLMs to Enslavement ● ChatGPT lies about its Search!: “Nine months later… In a comment to my previous post, I noted how Hugging Face alone, which is undoubtedly only…” Nov 5, 06:14

Sean Streiff on Hibernation, ZRAM and mental retardation in Linux: “Yes, this is EXACTLY the issue. Hibernation is just a civilized way to interact with any computer on which one…” Nov 4, 21:06

whoever on Lumo by Proton: a fraud of an AI: ““While I do need to process and store our conversation” → so, okay, there you’ve already overstepped, Proton. God forbid…” Nov 4, 14:45

Béranger on I still prefer VMware to VirtualBox: “Bleeping Computer: CISA orders feds to patch VMware Tools flaw exploited by Chinese hackers: Tracked as CVE-2025-41244 and patched one…” Nov 1, 02:05

Béranger on Kimi and Z.AI: The more Chinese, the merrier!: “Is it still true that the Kimi version from Google Play Store doesn’t allow authenticating with Google, but only by…” Nov 1, 01:25

Béranger on Când credeam că le-am văzut pe toate…: “Căcăciosul de la BackPackYourLife a vizitat bârlogul banditului de Radu Mazăre.” Nov 1, 00:54

Béranger on Bypassing GoComics’ paywall: “Indeed, I hate how they diluted so many strips through animations, games, paraphernalia, anything that can be exploited via franchising.…” Oct 31, 19:38

Uncle Max on Bypassing GoComics’ paywall: “I just wanted to add that my nieces and nephews who are around 10 years old, they’re deep into screens,…” Oct 31, 19:33

Uncle Max on Bypassing GoComics’ paywall: “You are a hero. Thank you!” Oct 31, 19:25

Béranger on namesake vs. homonym vs. eponym vs. Romance languages: “French has a modern meaning of “truculent” that’s not present in either English or other Romance languages: Le Robert online:…” Oct 31, 00:36

Béranger on 📖 In the unlikely event you installed Lubuntu 25.10…: “The most severe limitation of Lubuntu is that the live session cannot mount internal partitions, so it cannot be used…” Oct 29, 23:57

HAL on 📖 In the unlikely event you installed Lubuntu 25.10…: “Cool, thank you very much! It’s neither complicated nor difficult to do, so how can developers not be able to…” Oct 29, 18:12

Béranger on 📖 In the unlikely event you installed Lubuntu 25.10…: “Added a 21st tip: Mounting ISO files in PCManFM-Qt.” Oct 29, 13:12

Béranger on 📻 Radio Streams: “Overhaul of Joe.nl/Joe.be.” Oct 29, 12:27

HAL on Bluesabre is pathetic, and “the Xubuntu project” is retarded: “Unfortunately, this is likely to happen in the near future.” Oct 28, 23:33

Béranger on Bluesabre is pathetic, and “the Xubuntu project” is retarded: “Let it be this way. Who needs Wayland can go use KDE.” Oct 28, 20:09

HAL on Bluesabre is pathetic, and “the Xubuntu project” is retarded: “You forgot the previous part where they also talk about the lack of manpower forcing them to postpone certain developments,…” Oct 28, 20:06

Béranger on Lumo by Proton: a fraud of an AI: ““Access to advanced AI models” is the relevant feature. Maybe to the same question, a paying user is automatically “upgraded”…” Oct 28, 19:45

HAL on Lumo by Proton: a fraud of an AI: “Lumo Free: 0 €, Free forever — Limited daily chats — Access to web search — Basic chat history —…” Oct 28, 19:41

Béranger on Bluesabre is pathetic, and “the Xubuntu project” is retarded: “Oh, yes: We need help with development. Lubuntu is effectively in “maintenance mode” at this point, but we’d like to…” Oct 28, 19:03

HAL on Bluesabre is pathetic, and “the Xubuntu project” is retarded: “Lubuntu aussi demande de l’aide : Lubuntu 25.10 (Questing Quokka) Released! See Technical Notes. “We need help with development.”” Oct 28, 16:01

Béranger on Bluesabre is pathetic, and “the Xubuntu project” is retarded: “Ubuntu Unity 25.10 is the only official flavor that hasn’t been released yet. The reason? Ubuntu Unity is (or was)…” Oct 28, 15:20

Béranger on A €20 Linux tax: cui et quantum prodest?: “Since I mentioned Fedora, for having tested some prereleases of 43, here’s a strange thing: Fedora 43 has been available…” Oct 28, 11:42

Béranger on One more interview with Geoffrey Hinton: “SJVN: Remember ELIZA? AI has no intelligence at all.” Oct 28, 02:48

Béranger on 😾 30 Years Defending Linux — Until I Called It Quits: “FOLLOW-UP: A €20 Linux tax: cui et quantum prodest?” Oct 27, 20:22

Béranger on My chatbot use in the last week: “Indeed, I failed to notice it right away.” Oct 27, 13:19

Aldus on My chatbot use in the last week: “But yes, it is a good idea to check your later posts. 🙂 So this is on me.” Oct 27, 13:16

Aldus on My chatbot use in the last week: “I’ll check your posts regarding AI from the beginning. But this is relatively new, from last month. I used Grok…” Oct 27, 13:14

Aldus on My chatbot use in the last week: “Regarding the last point, from a purely technical standpoint: 1) The model is not programmed to do what you tell…” Oct 27, 13:06

Béranger on My chatbot use in the last week: “Absolutely, except that your image doesn’t work because this comments system bans WEBP. Why don’t you use PNG? And Grok…” Oct 27, 12:47

Aldus on My chatbot use in the last week: “Grok 3 is not the only free one. Grok 4 is freely available from at least august. Now you have…” Oct 27, 12:45

Béranger on Kimi’s useless “OK Computer” agent: “Oh, but maybe Kimi CLI works better. Apparently, it’s API-based, so it can’t be used for free.” Oct 27, 12:15

Aldus on Minor updates in Copilot and Grok: “I saved the same request as a memory: Utilizatorul preferă să nu i se ofere întrebări de continuare a dialogului…” Oct 27, 11:32

Béranger on Lumo by Proton: a fraud of an AI: “Except for the more generous limits, I don’t know what is it that paid brings that free doesn’t have, but…” Oct 27, 04:02

Matt on Lumo by Proton: a fraud of an AI: “When I first used Lumo, the free version, I was impressed. It reviewed and made useful improvement suggestions on my…” Oct 27, 03:36

Béranger on 📖 In the unlikely event you installed Lubuntu 25.10…: “Since I recommended enabling Flathub for Flatpaks, there is a thing you need to know should you want to install…” Oct 26, 22:06

Béranger on 📖 In the unlikely event you installed Lubuntu 25.10…: “Barely posted, and I already had to make some corrections!” Oct 25, 05:54

HAL on Linux: More Proof It’s a Pathetic Joke: “À propos des deux options dans PCManFM-Qt, la deuxième semble écrite différemment en anglais et en français : Move deleted files…” Oct 25, 02:27

Béranger on The magic of Amazon’s Kiro: my 1st vibe-coded PyQt6 app!: “UPDATE: Comic Strip Browser v. 1.1.1: new features and new bugs!” Oct 24, 23:29

HAL on 😾 30 Years Defending Linux — Until I Called It Quits: “MATE is stagnant. XFCE is moribund. LXQt isn’t doing much better. Only KDE seems to have a future. You’re forgetting…” Oct 24, 20:28

Béranger on namesake vs. homonym vs. eponym vs. Romance languages: “Another pair of confusing apparent synonyms: maniacal and manic, both adjectives. — “Maniacal” initially applied to the medical condition known…” Oct 24, 15:57

Béranger on SPECIAL: You Don’t Even Know How Terrible Your Linux Distro Is!: “Touché !” Oct 24, 02:11

HAL on SPECIAL: You Don’t Even Know How Terrible Your Linux Distro Is!: “It’s only the idiots who don’t change their mind 😉” Oct 24, 02:06

Béranger on SPECIAL: You Don’t Even Know How Terrible Your Linux Distro Is!: “So, on Aug. 5, 2017, artmg opened Issue #558: Adding shortcuts to View menu, asking the developer to add in…” Oct 23, 14:24

Béranger on 😾 30 Years Defending Linux — Until I Called It Quits: “Well, GNOME 3/4x is also sort of Metro. And the castration of Nautilus/Files that cannot have a compact list view…” Oct 22, 13:22

dell.shill on 😾 30 Years Defending Linux — Until I Called It Quits: “I’m aware many still use 7 because it was that good. Even first release of 8 wasn’t so bad in…” Oct 22, 13:14

Béranger on 😾 30 Years Defending Linux — Until I Called It Quits: “Understood. This being said, I still have a Win7 laptop that I’m using daily. NO ANTI-VIRUS OF ANY KIND. Built-in…” Oct 21, 15:37

deb.shill on 😾 30 Years Defending Linux — Until I Called It Quits: “For 15 years I was mostly in Ubuntu / Mint camp, occasionally jumping to Arch and OpenSuSe. I also dual…” Oct 21, 15:33

Béranger on Bluesabre is pathetic, and “the Xubuntu project” is retarded: “On the Wayback Machine, the original Download page. Xubuntu.org seems to work intermittently. If Xubuntu 25.10 released! doesn’t work, here’s…” Oct 21, 00:29

Azure AI: I’m too old for this shit, but Whisper works—locally

2 Comments Already

Leave a Reply Cancel reply