Features

CyberWhisper Features
Push to Dictation Hands-free Mode Modes & Dictionary
Quick Triggers

Two ways to speak — depending on how you think.

Choose your preferred shortcut method.

[Push to Dictation]

For short, precise inputs.

[Hands-free Mode]

For long thoughts and open-ended thinking.

Learn more →

Push to Dictation

Press & Release

Hands Free

Hands-free

Keyboard & Mouse Shortcuts

Fn

Keyboard

⌃⌥

Hands-free

Modes

The same words. The right output — everywhere.

Better context engineering with intelligent output adaptation across different applications.

Automatic context adaptation with the ability to define work modes for different contexts.

Use different language models, different voice models, and configure different output styles. Intelligently adapt appropriate outputs across different applications. Switch between different modes instantly using keyboard shortcuts for better context engineering.

Different scenarios require different output requirements: writing emails, replying to client messages, vibe coding, interacting with AI tools. Each scenario has specific format requirements and professional terminology enhancements. Different applications need different output formats, and different recipients (boss, client, colleague, family) require different tones and styles.

You say:

"Can you take a look at this?"

In Slack

Short & casual

In Email

Polite & structured

In IDE

Technical & concise

Applications

Voice

Slack

Message

Word

Email

IDE

Teams

Discord

Scenarios

Boss

Professional

Client

Formal

Colleague

Casual

Family

Friendly

Playful

Humorous

Creative

Expressive

100+ languages

Supports transcription and translation in over 100 languages, from widely spoken languages like English, Chinese, and Spanish to regional dialects and less common languages.

🇺🇸
🇬🇧
🇨🇳
🇯🇵
🇰🇷
🇩🇪
🇫🇷
🇪🇸
🇮🇹
🇷🇺
🇵🇹
🇳🇱
🇵🇱
🇹🇷
🇸🇦
🇦🇪
🇮🇳
🇹🇭
🇻🇳
🇮🇩
🇲🇾
🇸🇬
🇵🇭
🇧🇷
🇲🇽
🇦🇷
🇨🇴
🇨🇱
🇵🇪
🇻🇪
🇨🇦
🇦🇺
🇳🇿
🇿🇦
🇪🇬
🇳🇬
🇰🇪
🇸🇪
🇳🇴
🇩🇰
🇫🇮
🇮🇸
🇮🇪
🇨🇿
🇭🇺
🇷🇴
🇬🇷
🇺🇦

And you can switch languages on the fly.

Hybrid LLM Engine

Local when privacy matters.

Cloud when intelligence matters.

Your key when control matters.

OpenAI

OpenAI

Grok

Grok

Google

Google

Gemini

Gemini

DeepSeek

DeepSeek

Meta

Meta

Mistral AI

Mistral AI

Qwen

Qwen

Ollama

Ollama

On-Device Large Models

Voice models and large language models run entirely on-device

Complete data security, lower latency, lower cost.

You can customize and download different language models based on your needs. Some models guarantee low latency, ideal for scenarios requiring quick responses; others may have slightly higher latency but ensure higher accuracy, perfect for scenarios with high precision requirements. You can choose the model that best fits your needs for different scenarios.

Runs fully on-device

Biometric voice data never leaves your device

Millisecond-level latency

Biometric Data Protected

Latency vs Accuracy

Latency Low
Accuracy High

Low Latency

High Accuracy

Biometric Voice Data

Highly Sensitive • Fully Protected

Encrypted
On-Device
Adaptation

It adapts to you.

Why it gets better with use.

Dictionary

Vocabulary

→ Fewer corrections

Define custom vocabulary to greatly improve speech recognition accuracy and resolve spelling errors. Perfect for personal names, specific professional terms, and domain-specific vocabulary.

Snippets

→ Faster replies

Quickly define frequently used content such as postal addresses, product descriptions, and common message phrases. All vocabulary and snippets can be used across different applications.

Audio

→ Works anywhere

Works seamlessly across diverse scenarios and adapts to different input devices: AirPods, Bluetooth headphones, built-in microphones, laptop lid closed mode, wired microphones, and recording devices.

When your laptop lid is closed, the system intelligently adapts to the different microphone configuration, ensuring optimal audio capture even in clamshell mode.

Advanced noise reduction algorithms automatically filter out background noise in noisy environments like cafes, while preserving clear speech in quiet offices.

Intelligent silence detection handles long pauses and blank spaces in speech, automatically identifying when you're thinking, pausing, or finished speaking. The system understands natural speech patterns with frequent pauses and extended silence periods.

Dictionary

Personal Names

John Smith, Olivia Zhang, Neo Ruan, Jim Sang

Professional Terms

API, Kubernetes, CyberWhisper, LLM, Transformer, GPT, Claude

Input Devices

Built-in

Clamshell

Wired

Bluetooth

USB

Loopback Input

BYOK LLM integration

Bring Your Own Key (BYOK) support for LLM services. Use your own API keys from OpenAI, Anthropic, or other providers to maintain full control over your AI costs, usage, and data routing.

Perfect for teams and power users who need full control.

Bring your own key

OpenAI
OpenAI
Gemini
Gemini
Grok
Grok
DeepSeek
DeepSeek
Playground

See the difference — instantly.

Compare modes before you commit.

A place to shape how your voice becomes text.

Switch between different models, microphones, and parameters, see results change in real-time.

Compare output styles and speeds of different Modes. Experiment and save your final mode combination.

Mic
Whisper
LLM

Real-time output area

Ready to try CyberWhisper?

Speak your thoughts. Let the text follow.

Runs locally. No audio upload. No lock-in.