Features
Two ways to speak — depending on how you think.
Choose your preferred shortcut method.
[Push to Dictation]
For short, precise inputs.
Push to Dictation
Press & Release
Hands Free
Hands-free
Keyboard & Mouse Shortcuts
Keyboard
Hands-free
The same words. The right output — everywhere.
Better context engineering with intelligent output adaptation across different applications.
Automatic context adaptation with the ability to define work modes for different contexts.
Use different language models, different voice models, and configure different output styles. Intelligently adapt appropriate outputs across different applications. Switch between different modes instantly using keyboard shortcuts for better context engineering.
Different scenarios require different output requirements: writing emails, replying to client messages, vibe coding, interacting with AI tools. Each scenario has specific format requirements and professional terminology enhancements. Different applications need different output formats, and different recipients (boss, client, colleague, family) require different tones and styles.
You say:
"Can you take a look at this?"
In Slack
Short & casual
In Email
Polite & structured
In IDE
Technical & concise
Applications
Voice
Slack
Message
Word
IDE
Teams
Discord
Scenarios
Boss
Professional
Client
Formal
Colleague
Casual
Family
Friendly
Playful
Humorous
Creative
Expressive
100+ languages
Supports transcription and translation in over 100 languages, from widely spoken languages like English, Chinese, and Spanish to regional dialects and less common languages.
And you can switch languages on the fly.
Hybrid LLM Engine
Local when privacy matters.
Cloud when intelligence matters.
Your key when control matters.
OpenAI
Grok
Gemini
DeepSeek
Meta
Mistral AI
Qwen
Ollama
Voice models and large language models run entirely on-device
Complete data security, lower latency, lower cost.
You can customize and download different language models based on your needs. Some models guarantee low latency, ideal for scenarios requiring quick responses; others may have slightly higher latency but ensure higher accuracy, perfect for scenarios with high precision requirements. You can choose the model that best fits your needs for different scenarios.
Runs fully on-device
Biometric voice data never leaves your device
Millisecond-level latency
Biometric Data Protected
Latency vs Accuracy
Low Latency
High Accuracy
Biometric Voice Data
Highly Sensitive • Fully Protected
It adapts to you.
Why it gets better with use.
Dictionary
Vocabulary
→ Fewer correctionsDefine custom vocabulary to greatly improve speech recognition accuracy and resolve spelling errors. Perfect for personal names, specific professional terms, and domain-specific vocabulary.
Snippets
→ Faster repliesQuickly define frequently used content such as postal addresses, product descriptions, and common message phrases. All vocabulary and snippets can be used across different applications.
Audio
→ Works anywhereWorks seamlessly across diverse scenarios and adapts to different input devices: AirPods, Bluetooth headphones, built-in microphones, laptop lid closed mode, wired microphones, and recording devices.
When your laptop lid is closed, the system intelligently adapts to the different microphone configuration, ensuring optimal audio capture even in clamshell mode.
Advanced noise reduction algorithms automatically filter out background noise in noisy environments like cafes, while preserving clear speech in quiet offices.
Intelligent silence detection handles long pauses and blank spaces in speech, automatically identifying when you're thinking, pausing, or finished speaking. The system understands natural speech patterns with frequent pauses and extended silence periods.
Dictionary
Personal Names
John Smith, Olivia Zhang, Neo Ruan, Jim Sang
Professional Terms
API, Kubernetes, CyberWhisper, LLM, Transformer, GPT, Claude
Input Devices
Built-in
Clamshell
Wired
Bluetooth
USB
Loopback Input
BYOK LLM integration
Bring Your Own Key (BYOK) support for LLM services. Use your own API keys from OpenAI, Anthropic, or other providers to maintain full control over your AI costs, usage, and data routing.
Perfect for teams and power users who need full control.
Bring your own key
See the difference — instantly.
Compare modes before you commit.
A place to shape how your voice becomes text.
Switch between different models, microphones, and parameters, see results change in real-time.
Compare output styles and speeds of different Modes. Experiment and save your final mode combination.
Real-time output area
Ready to try CyberWhisper?
Speak your thoughts. Let the text follow.
Runs locally. No audio upload. No lock-in.