Typeflux

Talk. We'll Type.

Press Fn and speak naturally. Typeflux delivers lightning-fast, accurate voice-to-text directly into any macOS application. Free, open-source, and supports local models — your voice never has to leave your Mac.

Download

⬇ Download latest release (.dmg)

Download Typeflux.dmg from the latest release
Open the DMG and drag Typeflux.app to Applications
Launch and grant Microphone + Accessibility permissions

macOS 13+ · Free · No subscription · Fully local inference supported

Why Typeflux

Most voice input tools force you to switch apps — dictating in one place, then copying and pasting into another. That context switch breaks flow.

Typeflux injects text directly into whichever app you're already using, at the cursor position, the moment you release the hotkey. It feels like typing, just 4× faster (~200 WPM vs. ~50 WPM).

And when you need more than dictation, Voice Agent turns your voice into an AI assistant for Q&A, rewriting, translation, and complex workflows.

How It Works

Hold Fn → Speak → Release → Text appears instantly

Press and hold Fn (default hotkey)
Speak naturally
Release — Typeflux transcribes and injects the text at your cursor
The result is also copied to clipboard as a fallback

Features

One-Click Voice Input

Hold Fn to start, release to stop. No switching input methods, no clicking buttons — works in any text field across browsers, code editors, terminals, and native apps.

Voice Agent (`Fn + Space`)

More than just dictation. Press Fn + Space to chat with an AI agent using your voice:

Voice Q&A — Ask questions and get instant answers
Content Rewrite — Select text, then speak an instruction like "make this shorter" or "translate to English"
Complex Workflows — Handle multi-step tasks through natural conversation

Local-First, Privacy-First

Run entirely on your Mac with on-device models. No API keys needed, no data leaves your machine. We don't collect, store, or analyze any of your voice or text data.

Custom Personas

Create named instruction sets for different scenarios — work emails, study notes, casual chat, code comments — and switch between them from the menu bar.

Multiple Speech Backends

Provider	Type	Best For
Typeflux Cloud	Cloud	Zero-config, balanced accuracy
Local Model	Local	Privacy, offline use
Alibaba Cloud ASR	Cloud streaming	Low latency, Chinese
Doubao Realtime ASR	Cloud streaming	Chinese optimization
Google Speech-to-Text	Cloud	Multi-language, enterprise
OpenAI (Whisper API)	Cloud	High accuracy
Multimodal LLM	Cloud	Vision + audio tasks
Groq	Cloud	Fast inference, low cost
Free Models	Cloud	No API key, open-source endpoints

Local Models

When you choose Local Model, Typeflux downloads and runs the model entirely on your Mac:

Model	Size	Params	Best For
SenseVoice Small	~350 MB	234M	Fast multilingual, strong Chinese/Japanese/Korean
WhisperKit Medium	~1.5 GB	769M	Balanced English and multilingual dictation
WhisperKit Large	~3 GB	1.55B	Highest accuracy offline transcription
Qwen3-ASR	~1.3 GB	0.6B	Strong context understanding, long-form recognition

Streaming Preview

See partial transcription results while still speaking, so you get immediate feedback before you release.

History & Replay

Every session is saved locally. Review past sessions, replay audio, retry transcription with different settings, or export records to Markdown.

Requirements

macOS 13 or later
Microphone permission
Accessibility permission (for text injection)

For cloud providers: API keys and endpoint URLs.
For local inference: model files are downloaded automatically on first use.

Build from Source

git clone https://github.com/mylxsw/typeflux
cd typeflux
make run          # build + launch as .app bundle
make dev          # launch with terminal logs attached
make full-dev     # launch dev app with bundled SenseVoice resources
make full-release # build the full notarized production installer locally
swift test        # run tests

See CLAUDE.md for the full development guide.

Documentation

Contributing

Typeflux is a completely open-source project. We believe great tools should belong to everyone.

Contributions welcome — STT provider integrations, overlay UX, settings views, text injection edge cases, or history/export features are great starting points.

Read the module layout in CLAUDE.md
Run the app locally with make dev
Add or update tests for any logic changes
Open a PR with a description of user-visible impact

License

AGPL-3.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 610 Commits
.github/workflows		.github/workflows
Protos/google/cloud/speech/v2		Protos/google/cloud/speech/v2
Sources		Sources
Tests/TypefluxTests		Tests/TypefluxTests
TypefluxApp.xcodeproj		TypefluxApp.xcodeproj
app		app
assets		assets
docs		docs
scripts		scripts
.gitignore		.gitignore
.swift-version		.swift-version
.swiftformat		.swiftformat
.swiftlint-baseline.json		.swiftlint-baseline.json
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
DATA_STORAGE.md		DATA_STORAGE.md
LICENSE		LICENSE
Makefile		Makefile
PRD-remove-python-deps.md		PRD-remove-python-deps.md
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md
TypefluxAppRelease.entitlements		TypefluxAppRelease.entitlements
logo.png		logo.png
prd.md		prd.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Typeflux

Download

Why Typeflux

How It Works

Features

One-Click Voice Input

Voice Agent (`Fn + Space`)

Local-First, Privacy-First

Custom Personas

Multiple Speech Backends

Local Models

Streaming Preview

History & Replay

Requirements

Build from Source

Documentation

Contributing

License

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Typeflux

Download

Why Typeflux

How It Works

Features

One-Click Voice Input

Voice Agent (Fn + Space)

Local-First, Privacy-First

Custom Personas

Multiple Speech Backends

Local Models

Streaming Preview

History & Replay

Requirements

Build from Source

Documentation

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages

Voice Agent (`Fn + Space`)