I type fast. Around 90 WPM on a good day. But even at that speed, I am constantly falling behind in Slack.
Slack is a different kind of typing. It is not flowing prose — it is reactive, rapid-fire, context-switching every two minutes. By the time I have typed out a coherent response, three more messages have arrived and the thread has moved on without me.
So I started using my voice instead. Here is what I learned.
Why Voice Is Faster for Slack and Teams
The average person speaks at 130–150 words per minute. Even the fastest typists rarely exceed 100 WPM in real-world conditions (not speed-test conditions). But more importantly, speaking is thinking out loud — it bypasses the translation layer between brain and fingers.
For short reactive messages — "yeah sounds good, let us jump on a call at 2" or "can you share the doc again? I cannot find it" — voice is dramatically faster. You say it, it appears, you send it. No backspacing, no autocorrect disasters, no hunting for the right emoji.
For longer messages like project updates or async explanations, the advantage compounds. A 3-paragraph Slack message that would take 2 minutes to type takes about 40 seconds to dictate.
What Does Not Work: Browser Extensions
The first thing most people try is a Chrome extension. There are several voice-to-text extensions on the Chrome Web Store, and they work fine — for Gmail, Google Docs, and other browser-based text fields.
But Slack's desktop app is not a browser. It is an Electron app running in its own process, outside Chrome's reach. Browser extensions can only inject into web pages in the Chrome renderer. They have no access to the desktop application's text input fields.
Same goes for Teams. The desktop version is also Electron-based. Your Chrome extension will not see it.
Windows' built-in Speech Recognition (the one you enable in Settings > Time & Language > Speech) can technically dictate into any window, but it is slow to activate, requires training, and the accuracy is noticeably worse than modern AI transcription — especially for technical terms, names, or anything with punctuation.
What Actually Works on Windows
The approach that works is a dedicated Windows dictation tool that operates at the OS level — not inside a browser, but as a system-wide layer that can inject text into any focused application.
Here is the setup that has been working for me:
1. Press a hotkey anywhere
You are in Slack, Teams, VS Code, Notepad, whatever. You press a global shortcut (I use Ctrl+Shift+Space). A small overlay appears — nothing intrusive, just a mic indicator.
2. Speak naturally
You say your message. The audio is sent to Groq's Whisper API for transcription. This takes about 1–2 seconds for a sentence, less than a second for short phrases.
3. Text is injected directly
The transcribed text is typed into whatever window was active when you pressed the hotkey. In Slack, it lands in the message box. You review it, press Enter.
This works because the tool uses Windows accessibility APIs (specifically UI Automation) to interact with the active window — not browser injection. It can reach desktop apps, terminal windows, chat apps, anything with a text input.
Accuracy in Real Use
Groq's Whisper model is genuinely impressive. In my testing:
- Common Slack phrases: ~99% accuracy
- Technical terms (API, GitHub, Kubernetes): ~96% accuracy
- Names and proper nouns: ~92% accuracy (drops with unusual names)
- Punctuation: handled automatically based on speech patterns
I occasionally have to fix a word, but it is faster than typing the whole message.
The Tool That Does This
The app I have been using is Dictate for Windows. It is a lightweight Electron app that runs in the system tray — you do not even know it is there until you need it. Press the hotkey, speak, done.
It uses Groq's Whisper API under the hood, which means the transcription cost is almost nothing — fractions of a cent per message. You pay for what you use, no subscription required.
If you are spending more than 30% of your workday in Slack or Teams, this is worth trying. The setup takes about 5 minutes and the habit clicks within a day or two.
Your keyboard will thank you.
Top comments (0)