Hold the hotkey
Trigger dictation from any app with a push-to-talk shortcut instead of switching into a browser or cloud recorder.
TalaX turns Whisper into a dictation workflow for technical work: hotkey in, speak naturally, correct once, and reuse those corrections automatically the next time you talk.
Current install path: build from source. No binary releases are published yet.
TalaX should feel like a native keyboard shortcut, not a separate transcription ritual. Speak, release, review, and let the corrections compound.
Trigger dictation from any app with a push-to-talk shortcut instead of switching into a browser or cloud recorder.
Whisper captures the raw transcript locally while TalaX keeps the context of technical vocabulary close by.
Fix the bad term, keep typing, and let the 3-layer pipeline remember how you actually talk about your stack.
Corrections feed a persistent pipeline instead of vanishing at the end of the session.
Keep separate vocabularies for DevOps, research, and personal writing without cross-polluting terms.
Whisper, SQLite, and the correction engine stay on-device, so privacy is not a marketing afterthought.
Tauri + Rust keep the app lightweight while the correction pipeline stays interactive.
This is the product moment the page should prove immediately: raw speech gets corrected into something a developer would actually send to an editor or terminal.
A DevOps profile should learn cluster names and Terraform resources. A research profile should remember model families and tooling terms. TalaX keeps those correction histories separate so you do not have to retrain from zero every time your domain changes.
Technical depth matters here, but it comes after the user outcome. These are the pieces that make TalaX credible once the value is obvious.
Local speech-to-text with model sizes from ~75 MB to ~574 MB, downloaded on first use through the model manager.
Dictionary replacement, trigram context scoring, and heuristic recovery for accents, acronyms, compounds, and fuzzy matches.
Persistent correction history, sessions, and per-profile learning data without introducing a remote dependency.
Desktop integration for global hotkeys, audio capture, text injection, tray state, and a lightweight frontend.
TalaX is released under Business Source License 1.1 and converts to Apache 2.0 on 2030-03-28. The site uses the same wording everywhere so the trust model stays clear.
The current path is source build, with documented prerequisites and model choices on the install page.