VOCIX RTF22

accessibility claude dictation llm ollama productivity speech-to-text voice whisper windows

Use this command to install VOCIX:

winget install --id=RTF22.VOCIX -e

VOCIX is a voice dictation application designed to capture speech, transcribe it offline, optionally transform it using large language models (LLMs), and insert the result at the cursor position in any application.

Key Features:

Push-to-talk via global hotkey: Use the Pause key as the default hotkey for hands-free operation across all applications.
Three processing modes: Choose between Clean mode for standard transcription, Business mode for professional language transformation, or Rage mode to de-escalate aggressive text.
Multi-provider LLM integration: Supports Anthropic Claude, OpenAI-compatible APIs, and local Ollama models, allowing customization per mode.
System-wide insertion: Automatically inserts processed text into any application at the cursor position.
User-friendly interface: Features a system tray with a color-coded microphone icon for quick access and mode switching.

Audience & Benefit: Ideal for professionals who require efficient transcription across various applications. VOCIX enhances productivity by automating dictation and smart text transformation, ensuring seamless integration into daily workflows in browsers, Word, Outlook, or IDEs.

README

🌐 English · Deutsch

VOCIX — Voice Capture & Intelligent eXpression

Release Downloads License

Local voice dictation app for Windows 11 with a global hotkey. Capture speech, transcribe it, transform it intelligently, and insert it system-wide at the cursor position — in any application (browser, Word, Outlook, IDEs, etc.).

Features

Push-to-Talk via global hotkey (default: Pause)
Three modes:
- A — Clean: Clean transcription; strips filler words (um, uh, like, ...) with light corrections
- B — Business: Rewrites speech into professional business language (LLM-powered)
- C — Rage: De-escalates aggressive language into polite phrasing (LLM-powered)
Multi-provider LLM for modes B and C — pick your backend in the settings dialog: Anthropic Claude, any OpenAI-compatible API (OpenAI, Groq, OpenRouter, LM Studio, llama.cpp-server, vLLM via base_url) or local Ollama models. Per-mode override (e.g. Business on cloud Claude, Rage on local Llama). Provider failures fall back to Clean mode and surface an orange toast — no more silent degradation.
Settings dialog in the tray menu — four tabs (Basics / Advanced / Expert / AI Provider) with Test buttons, hotkey capture and per-mode validation
System tray with a colour-coded microphone icon and mode switching
Status overlay with a live VU meter while recording — instant visual feedback that the mic is picking up signal
History of the last 20 dictations in the tray — click an entry to re-insert it (saves your text when the target window has changed)
Usage statistics — words per day/week/total, estimated typing time saved (200 keystrokes/min), distribution across modes
Snippet expansion — your own shortcuts (, , …) inside the dictation are replaced with full text before insertion; Whisper transcripts like "slash sig" are normalised automatically

Shortcut	Action
`Pause` (hold)	Push-to-talk — speak, release to process
`Ctrl+Shift+1`	Mode A: Clean transcription
`Ctrl+Shift+2`	Mode B: Business mode
`Ctrl+Shift+3`	Mode C: Rage mode

Problem	Solution
SmartScreen: "Windows protected your PC" on first launch	Click More info → Run anyway. VOCIX is open source and the release ZIP is reproducible from `main` via `build_exe.bat`. Code signing is tracked in #12.
Tray icon not visible	Check hidden icons in the taskbar (arrow pointing up)
"VOCIX requires a CPU with AVX support" on startup	Your CPU is older than ~2012 and cannot run CTranslate2. VOCIX will not work on this machine.
Hotkey doesn't respond	Run the app as administrator
Laptop without a `Pause` key	Set `VOCIX_HOTKEY_RECORD=scroll lock` (or `f7`) in `.env`
"Microphone unavailable"	Check microphone permissions in Windows settings
Modes B/C only return Clean results	Open Settings → AI Provider, configure at least one slot and hit Test
Whisper download fails	Check your internet connection; configure proxy/firewall if needed
Text contains wrong characters	Make sure the target app supports Ctrl+V / paste
RDP: text is not inserted	Set `VOCIX_RDP_MODE=true` in `.env`

VOCIX RTF22

README

VOCIX — Voice Capture & Intelligent eXpression

Features

Requirements

Installation

Option A: winget

Option B: Scoop

Option C: Portable .exe

Option D: From source

GPU acceleration (optional, NVIDIA only)

Build the .exe yourself

Configuration

Usage

Troubleshooting

Project structure

License