Detailed Feature Overview
Whispering Tiger unifies Speech Recognition, Real-time & Batch Translation, Text-to-Speech, Image / Screen OCR, In-Game Text Monitoring, and a rich Plugin & Automation system – all running locally for privacy and performance.
Depending on the selected models, Whispering Tiger can understand or translate between 100 to 200+ languages. Coverage varies per model family (see sections below). Many pipelines allow language autodetection.
UI lets you pick size vs accuracy, precision / quantization, used hardware device and VAD (voice activity detection) options.
Extend core capabilities without modifying the base application.
No. New plugins, models and features appear frequently. For the freshest additions check the GitHub repositories and plugin index.
Yes. Model size, hardware (CPU/GPU), quantization, and concurrent tasks all influence throughput and latency. Profiles help tune these.
Open an issue on the main UI repository on GitHub, Join the Discord server
or send a report using the in-app feedback tool.
Model names & capabilities are provided for orientation only. Refer to each model's own license & repository for authoritative details.