[e925]
░ armory · video-clipping · compare

Deepgram Nova-3 vs fast-asd (Sieve)

Both in the video & clipping category. Side-by-side — pick the one that fits your stack tonight.

Deepgram Nova-3★★★★★
✓ loya-tested💰 paid🛠️ wire-up

The speech-to-text that actually gets word-level timestamps right.

rating
5
tested
✓ loya-tested
cost
paid
install
needs-wiring
stars
0
updated
4d ago
#transcription#speech-to-text#word-timestamps#diarization#paid#deepgram
avoid if

You only transcribe short voice notes — use free Whisper locally.

open the full entry →
fast-asd (Sieve)★★★★★
🆓 free🐍 sidecar

Tells your video which person is actually talking. Powers auto-cropping for clips.

rating
3
tested
cost
free
install
sidecar
stars
82
updated
1y ago
#video#active-speaker#python#sieve#clipping#open-source
avoid if

You aren't building your own video pipeline. Most creators should just pay OpusClip and skip the plumbing.

open the full entry →

why it matters · Deepgram Nova-3

Nova-3 is the transcription engine behind every podcast clipper that ships. You upload audio, get back text with per-word timestamps, speaker labels, and punctuation — the three things you need to cut a clip on a clean sentence boundary instead of mid-word. Costs about 26 cents an hour of audio. Free \$200 credit when you sign up, which gets you through your first 700+ hours before you pay anything. Way more accurate than Whisper on real podcast audio.

why it matters · fast-asd (Sieve)

If you want to take a multi-person podcast and auto-crop it to the vertical 9:16 format TikTok and Reels want, the video needs to know WHO is talking at any given second. fast-asd figures that out — audio + lip movement detection — so your crop follows the active speaker. Stale repo (last updated mid-2024) but still works, and the pattern is still how every podcast clipper does speaker tracking under the hood. Python sidecar, MIT, free.

more video & clipping to compare

derived live from the armory manifest · same-category only