Skip to content

axot/OpenSuperMLX

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

434 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OpenSuperMLX

Native speech-to-text for macOS. Press a shortcut anywhere, speak, and OpenSuperMLX turns your voice into clean text on your Apple Silicon Mac.

It is built for people who write meeting notes, Slack replies, documents, prompts, and follow-ups faster by talking than by typing.

Install with Homebrew: brew tap axot/tap && brew install --cask opensupermlx. Or download from GitHub releases.

OpenSuperMLX recordings and stats views with synthetic transcript history and activity dashboard
Screenshots use synthetic sample data. Full-size: Recordings · Stats.

Why Use It

  • Works from any app: tap or hold a global shortcut, then paste the transcript back where you were writing.
  • Feels native on macOS: menu-bar app, keyboard-first flow, mic picker, searchable transcript history, and drag-and-drop audio import.
  • Runs locally with MLX: transcription runs on-device by default through MLX; optional LLM correction sends text only to the provider you configure.
  • Handles real multilingual work: automatic language detection, English/Chinese/Japanese/Korean support, and Asian language autocorrect.
  • Tracks the habit: a stats dashboard shows sessions, streaks, speaking time, time saved, and estimates against a generic typing-speed baseline.

Core Workflow

  1. Press + ` from any app.
  2. Speak naturally.
  3. Release or stop recording.
  4. OpenSuperMLX transcribes, cleans up the text, and pastes it into the frontmost app.

Two modes are built in:

Gesture Action
Tap + ` Start or stop recording
Hold + ` Record only while held
Tap + + ` Start or stop recording with LLM correction
Escape Cancel active recording

Shortcuts are customizable in Settings -> Shortcuts.

Features

  • Real-time streaming transcription so text appears while you speak
  • Searchable local transcript history
  • Drag-and-drop audio file transcription with queue processing
  • Built-in model picker and custom Hugging Face model IDs
  • Microphone switching for built-in, external, Bluetooth, and Apple Continuity devices
  • Optional AWS Bedrock LLM post-transcription correction
  • CLI harness for transcription, diagnostics, queues, models, and benchmarks
  • First-launch onboarding for permissions and model setup

Installation

Homebrew

brew tap axot/tap
brew install --cask opensupermlx

Manual

Download the latest build from the GitHub releases page.

macOS Security Approval

OpenSuperMLX is not signed with an Apple Developer ID, so macOS may block the first launch.

  1. Open the app.
  2. Go to System Settings -> Privacy & Security.
  3. Find the OpenSuperMLX security message.
  4. Click Open Anyway.
  5. Confirm the dialog.

You only need to do this once.

Requirements

  • macOS 14.0+
  • Apple Silicon / ARM64 Mac

Models

Models are downloaded automatically from Hugging Face when selected in the app.

Model Best For
Qwen3-ASR-0.6B-4bit Fastest, smallest local model
Qwen3-ASR-1.7B-8bit Recommended balance of quality and speed
Qwen3-ASR-1.7B-bf16 Highest quality

Custom models can be added with a Hugging Face repository ID.

CLI

The app binary also works as a headless CLI harness. It supports transcribe, stream-simulate, correct, config, recordings, queue, mic, model, benchmark, and diagnose.

BINARY=build/Build/Products/Debug/OpenSuperMLX.app/Contents/MacOS/OpenSuperMLX
$BINARY diagnose --json
$BINARY help transcribe

See docs/cli.md for the full command reference.

Building Locally

git clone git@github.com:axot/OpenSuperMLX.git
cd OpenSuperMLX
git submodule update --init --recursive
brew install cmake libomp rust ruby
gem install xcpretty
./run.sh build

For CI build details, see .github/workflows/build.yml.

Support

If you run into an issue:

  1. Search existing GitHub issues.
  2. Open a new issue with reproduction steps.
  3. Include system information and relevant logs.

Acknowledgments

OpenSuperMLX is forked from OpenSuperWhisper by @Starmel. Thanks to the original project for the foundation.

License

OpenSuperMLX is licensed under the MIT License. See LICENSE for details.

About

macOS app for real-time audio transcription powered by MLX on Apple Silicon

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors