Skip to content

sim186/AmicoScript

Repository files navigation

AmicoScript Logo

AmicoScript

AmicoScript local audio transcription tool.

AmicoScript is a privacy-focused, local-first transcription tool built on OpenAI's Whisper models. It allows you to transform audio recordings into structured, searchable transcripts without your data ever leaving your repository or machine. Whether you need speaker identification (diarization), translation, or simple subtitles, AmicoScript provides a fast, free, and secure alternative to cloud services.

Demo

✨ Why AmicoScript

Most transcription tools:

  • require uploading your audio to the cloud
  • cost money or have limits
  • don’t give you control over your data

AmicoScript keeps everything local.

→ Your audio never leaves your machine.


🚀 Features

  • 🎧 Transcribe audio and video (MP3, WAV, M4A, OGG, FLAC, ACC, MP4, MOV, MKV)
  • 📚 Batch process multiple files at once
  • 🧠 Whisper models (tiny → large-v3)
  • 🗣️ Speaker diarization (who said what)
  • 🌍 Real-time translation to English
  • 🔍 Global search across transcripts
  • 🗂️ Organize with folders and tags
  • ✏️ Edit individual segments
  • 📤 Export to JSON, SRT, TXT, Markdown
  • ⌨️ Keyboard shortcuts for fast navigation
  • 🚀 For Mac, Windows, Docker, or local Python

⚡ Example

Upload a meeting recording → get a structured, time-stamped transcript you can search, edit, and export.


🖥️ Quick Start

Docker (recommended)

docker compose up --build

Then open: http://localhost:8002


Local

pip install -r backend/requirements.txt
python run.py

macOS: Running unsigned apps (Not disabling Gatekeeper)

  1. Download the latest release from the Releases page.
  2. Because the app is not signed by Apple, macOS will initially block it. Open System Settings → Privacy & Security and enable "App Store and identified developers" (allow apps downloaded from App Store and identified developers).
  3. Unzip the downloaded file. Double-click the application file (AmicoScript.app). macOS will prevent it from opening because it's from an unidentified developer.
  4. In System Settings → Privacy & Security, click the "Open Anyway" button next to the blocked app, then confirm when prompted to allow the application to run.
  5. The app will launch — you're ready to create icns files from PNG, JPG, or other image formats.

run.py will download ffmpeg automatically on first run.


🧪 Performance

Performance depends on your hardware (CPU/GPU) and selected model size.

  • Larger models → better accuracy
  • Smaller models → faster processing

Feedback and benchmarks are welcome.


🧩 Optional: Speaker Diarization

Uses pyannote and requires a Hugging Face token.

See full setup instructions in: Documentation


📚 Documentation

Full documentation (API, setup, details):

Documentation


🏗️ Architecture (brief)

  • Backend: Python + FastAPI
  • Frontend: Single HTML (no build step)
  • Processing: Background jobs
  • Storage: Temporary local files (auto-cleanup)

🤝 Contributing

Feedback, issues, and contributions are welcome.


⭐ If you find this useful

Give it a star — it helps a lot!


⚖️ License

This project is licensed under the MIT License. See the LICENSE file for more details.

About

Local Audio Transcription Tool

Resources

License

Stars

Watchers

Forks

Contributors