Star 历史趋势
数据来源: GitHub API · 生成自 Stargazers.cn
README.md

AutoSubs

Local-first AI subtitles. No cloud, no subscription, no data leaving your machine.

Use it as a standalone app, or connect to DaVinci Resolve, Adobe Premiere Pro, and After Effects.

  • 🎙️ Transcription: Whisper, Moonshine, and Parakeet models via whisper-rs and ONNX Runtime
  • 👥 Speaker Diarization: Identifies and labels different speakers in the transcript, enabling per-speaker styling
  • 🌍 100+ Languages: Transcription and translation across a wide range of languages
  • 💻 Cross-Platform: macOS (Apple Silicon/Intel), Windows (Vulkan/DirectML), Linux

Downloads Weekly App Opens New Downloads / Week Ask DeepWiki

AutoSubs UI

Download

PlatformInstaller
🪟 WindowsAutoSubs-windows-x86_64.exe
🍎 macOS (Apple Silicon)AutoSubs-Mac-ARM.pkg
🍎 macOS (Intel)AutoSubs-Mac-Intel.pkg
🐧 Linux (Debian/Ubuntu)AutoSubs-linux-x86_64.deb
🐧 Linux (Fedora/openSUSE)AutoSubs-linux-x86_64.rpm

macOS Homebrew

macOS users can also install AutoSubs with Homebrew:

brew install --cask auto-subs

Linux install

Debian/Ubuntu (.deb):

wget https://github.com/tmoroney/auto-subs/releases/latest/download/AutoSubs-linux-x86_64.deb
sudo apt install ./AutoSubs-linux-x86_64.deb

Fedora/openSUSE (.rpm): Download AutoSubs-linux-x86_64.rpm and open it with your package manager.

Buy Me A Coffee


Quick Start

Standalone Mode

  1. Launch AutoSubs and select an audio or video file.
  2. Pick your model and language/translation options.
  3. Click Transcribe. Edit speakers and subtitles as needed.
  4. Export as SRT, text, or copy to clipboard.

DaVinci Resolve Mode

  1. Open DaVinci Resolve → Workspace → Scripts → AutoSubs.
  2. Select your timeline/audio source and settings.
  3. Click Transcribe. Edit speakers and subtitles as needed.
  4. Send styled subtitles back to Resolve.

[!WARNING] Mac App Store version not supported - download DaVinci Resolve from blackmagicdesign.com instead.

Adobe Premiere Pro / After Effects Mode

  1. Launch AutoSubs and open Premiere Pro or After Effects (the CEP extension loads automatically).
  2. Select the Adobe integration from AutoSubs to export timeline audio for transcription, or import generated subtitles into your project.
  3. In Premiere Pro, subtitles are imported as caption tracks; in After Effects, SRT entries are created as text layers.

Command Line Interface

For command-line usage, see the CLI Guide with complete reference, examples, and troubleshooting.


Documentation

[!TIP] I highly recommend checking out DeepWiki for asking questions and understanding the codebase.

Ask DeepWiki


Integrations

AutoSubs can run as a standalone subtitle generator, connect directly to DaVinci Resolve, or communicate with Adobe Premiere Pro and After Effects through the bundled CEP extension.

Select a Preset StyleOr create your own
Transcription PageAdvanced Settings

What's New in v3.5

Transcription: Voice Activity Detection, multiple models (Whisper/Parakeet/Moonshine), improved speaker diarization, and built-in translation.

Editing & UI: Free-text subtitle editing with auto-timing, transcript history, 6 new UI languages, and custom titlebar.

DaVinci Resolve: Animated caption macro with per-word highlighting, preset system, marker-based word timing, and instant conflict detection.

Bug Fixes (v3.5.1): Formatting improvements, Resolve export corrections, Model Manager recovery, and Linux stability fixes.


Supported Models

AutoSubs ships with several local transcription model families. All run fully on-device — nothing is sent to the cloud. Models are downloaded on demand from the in-app Model Manager.

Accuracy is a relative 1–4 rating within AutoSubs (higher is better). Sizes and RAM figures are approximate.

Whisper

OpenAI's Whisper, via whisper-rs (GGML). Each size is available in a multilingual variant and an .en English-only variant (the .en models are slightly more accurate on English audio).

ModelSizeRAMLanguagesAccuracy
tiny / tiny.en80 MB1 GBMultilingual / English
base / base.en150 MB1 GBMultilingual / English
small / small.en480 MB2 GBMultilingual / English★★
medium / medium.en1.5 GB5 GBMultilingual / English★★★
large-v3-turbo1.6 GB6 GBMultilingual★★★
large-v33.1 GB10 GBMultilingual★★★★

Moonshine

Useful Sensors' Moonshine, via ONNX Runtime. The tiny English model is quantized; the language-specific tiny variants and the base model are float-precision.

ModelSizeRAMLanguageAccuracy
moonshine-tiny60 MB1 GBEnglish
moonshine-tiny-ar120 MB1 GBArabic★★★
moonshine-tiny-zh120 MB1 GBChinese★★★
moonshine-tiny-ja120 MB1 GBJapanese★★★
moonshine-tiny-ko120 MB1 GBKorean★★★
moonshine-tiny-uk120 MB1 GBUkrainian★★
moonshine-tiny-vi120 MB1 GBVietnamese★★★
moonshine-base200 MB1 GBEnglish★★

Parakeet

NVIDIA's Parakeet-TDT-0.6B-v3 (int8 ONNX). Fast and accurate, with support for 25 European languages plus Russian and Ukrainian.

ModelSizeRAMLanguagesAccuracy
parakeet700 MB2 GB25 languages (EU + RU + UK)★★★★

SenseVoice

Alibaba's SenseVoice (int8 ONNX). Compact and well-suited to CJK audio.

ModelSizeRAMLanguagesAccuracy
sense-voice230 MB1 GBChinese, English, Japanese, Korean, Cantonese★★★

Canary

NVIDIA's Canary-1B-v2 (int8 ONNX). A multilingual encoder-decoder model that also supports native translation.

ModelSizeRAMLanguagesAccuracy
canary1 GB3 GB25 languages (EU + RU + UK)★★★★

Cohere

Cohere Transcribe (int4 ONNX). The highest-accuracy option for a focused set of 14 widely-spoken languages.

ModelSizeRAMLanguagesAccuracy
cohere2 GB4 GBArabic, German, Greek, English, Spanish, French, Italian, Japanese, Korean, Dutch, Polish, Portuguese, Vietnamese, Chinese★★★★

Diarization & VAD

In addition to transcription models, AutoSubs downloads a speaker diarization model (~40 MB, user-selectable from the Model Manager) and a Silero VAD model (auto-downloaded for voice activity detection during transcription).


Contributing

PRs are welcome! See CONTRIBUTING.md for how to get started, including the dev setup and a full codebase walkthrough via AutoSubs DeepWiki.

For detailed information about the DaVinci Resolve integration architecture, Lua server, Fusion macro system, and development workflow, see Resolve-Integration/README.md.


Acknowledgments

AutoSubs is built on top of excellent open-source projects:

  • whisper-rs - Rust bindings for Whisper C++ library
  • transcribe-rs - ONNX Runtime transcription with Moonshine and Parakeet models
  • pyannote-rs - Rust implementation of Pyannote for speaker diarization (integrated into app code for improvements)

关于 About

On-device subtitle generation that connects directly to DaVinci Resolve, Premiere, and After Effects.
aicross-platformdavincidavinci-resolvepremiereresolverustspeech-to-textsubtitlestranscribetranscriptionvideo-editing

语言 Languages

TypeScript47.0%
Rust22.8%
C++11.8%
Lua9.4%
CMake2.9%
Python1.9%
NSIS1.9%
JavaScript0.9%
Shell0.5%
CSS0.5%
SCSS0.3%
Batchfile0.1%
HTML0.1%
PowerShell0.0%
C0.0%

提交活跃度 Commit Activity

代码提交热力图
过去 52 周的开发活跃度
551
Total Commits
峰值: 92次/周
Less
More

核心贡献者 Contributors