Show HN: Local audio transcription and speaker ID for Apple Silicon

1 vadiml 0 8/2/2025, 2:49:55 PM github.com ↗

  Built a tool combining MLX Whisper + pyannote for fast local audio transcription with speaker diarization on Apple Silicon.

  Key benefits: privacy-first (fully local), hardware-accelerated, automatic speaker identification, multiple output formats (TXT/SRT/JSON).

  Main technical challenge was making MLX Whisper and pyannote work together despite different audio processing - solved with preprocessing pipeline.

  Perfect for interviews, meetings, podcasts. Handles HuggingFace gated models with proper error handling.

Comments (0)

No comments yet