STT-CLI Mantej Singh Dhanjal

accessibility artificial-intelligence artificialintelligence cli command-line offline openai privacy productivity speech-recognition speech-to-text startup system-tray terminal voice-control voice-recognition whisper windows

Use this command to install STT-CLI:

winget install --id=Mantej-Singh.STT-CLI -e

STT-CLI is a speech-to-text tool designed for Windows users on corporate laptops where voice typing features are restricted by IT policies. It operates as a background system tray application, enabling hands-free text input through a global hotkey (double-tap Left Alt).

Key Features:

Global Hotkey Activation: Toggle recording with a quick double-tap of the Left Alt key.
Background Operation: Runs discreetly in the system tray without a visible window.
Balloon Notifications: Provides visual feedback for recording status updates.

Audience & Benefit: Ideal for Windows users on restricted corporate laptops, STT-CLI offers hands-free text input in command-line interfaces like Windows Terminal and PowerShell, enhancing productivity without requiring admin rights or installation. It can be installed via winget, ensuring seamless integration into your workflow.

README

STT CLI

This project is a simple command-line interface (CLI) tool for Windows that provides speech-to-text functionality. It runs in the background, listens for a global hotkey, and transcribes your speech into the active command-line window.

🎯 Purpose and Motivation

I created this tool specifically for Windows users working on corporate laptops where Win+H is disabled by IT policies. After discovering that the built-in Windows voice typing was blocked on my work machine, I needed a solution that would:

Work without admin rights - No installation or system modifications required
Run portably - Just a single .exe file that runs from anywhere
Bypass corporate restrictions - Doesn't touch system settings or require permissions
Support CLI workflows - Specifically designed for command-line interfaces like Windows Terminal, PowerShell, and AI coding tools

I primarily use this with Claude Code and Gemini CLI, where voice input dramatically speeds up my workflow. Macs already have built-in speech-to-text features, and there are apps like SuperWhisper, Voicy, and Voice Mode that work nicely with these tools. But when I looked for a Windows alternative that worked around corporate restrictions, nothing existed. This tool fills that gap for Windows users who need voice control in the CLI but are blocked by enterprise policies.

The Problem: Win+H Disabled on Corporate Laptops

> [!IMPORTANT] > Many corporate and enterprise Windows laptops have the built-in voice typing feature (Win+H) disabled by IT policies and group restrictions. This is a widespread issue affecting millions of Windows users in enterprise environments.

When you try to use it, you'll see this frustrating message:

"Voice typing is not available - Speech service are managed by your organization"

This leaves users without any voice-to-text capability for their command-line workflows, especially problematic when:

You can't install software requiring admin rights
Group policies prevent modifying system settings
You need hands-free typing for accessibility or efficiency
You're working with AI coding assistants like Claude Code or Gemini CLI

The Solution: STT-CLI Running in the Background

STT-CLI solves this by running quietly in the background without requiring admin rights or system modifications. Simply double-tap Left Alt to start speaking, and your words appear directly in your terminal:

Component	Library	Why
Whisper Engine	`faster-whisper`	4x faster than OpenAI's vanilla Whisper
Speech Recognition	`SpeechRecognition`	Google Web Speech API wrapper
Audio Processing	`av` (PyAV)	FFmpeg bindings for Whisper

STT-CLI Mantej Singh Dhanjal

README

STT CLI

🎯 Purpose and Motivation

The Problem: Win+H Disabled on Corporate Laptops

The Solution: STT-CLI Running in the Background

Features

Architecture

Requirements

Installation

🚀 Method 1: Windows Package Manager (Winget) - RECOMMENDED

🚀 First Time Setup (After Winget Installation)

Step 1: Launch the Application

Step 2: Enable Auto-Start (Optional but Recommended)

Step 3: Start Using It

Method 2: Direct Download

System Tray Icon

Running from Source

Usage

Command-Line Options

Logs and Troubleshooting

Building from Source

📝 Future Development

Speech-to-Text Engines

Primary Engine: OpenAI Whisper (Offline) 🎯

Fallback Engine: Google Web Speech API (Online)

🚀 What's New in v2.0.0