Deepgram Nova-3 vs PySceneDetect
Both in the video & clipping category. Side-by-side — pick the one that fits your stack tonight.
The speech-to-text that actually gets word-level timestamps right.
- rating
- 5★
- tested
- ✓ loya-tested
- cost
- paid
- install
- needs-wiring
- stars
- 0
- updated
- 4d ago
You only transcribe short voice notes — use free Whisper locally.
Finds every camera cut in your video automatically. Powers smart cropping + transitions.
- rating
- 4★
- tested
- ✓ loya-tested
- cost
- free
- install
- sidecar
- stars
- 4,736
- updated
- 4d ago
You only work with single-camera talking-head footage — scene detection isn't useful there.
why it matters · Deepgram Nova-3
Nova-3 is the transcription engine behind every podcast clipper that ships. You upload audio, get back text with per-word timestamps, speaker labels, and punctuation — the three things you need to cut a clip on a clean sentence boundary instead of mid-word. Costs about 26 cents an hour of audio. Free \$200 credit when you sign up, which gets you through your first 700+ hours before you pay anything. Way more accurate than Whisper on real podcast audio.
why it matters · PySceneDetect
PySceneDetect scans any video and spits out the timestamp of every hard cut — the moment the camera switches. For multi-cam podcasts, that's the boundary you need so your 9:16 crop follows the active speaker without drifting on stale frames. Used in podcast-clipper crop pipelines alongside face tracking — same library Loya's LYRC export pipeline relies on for scene work. Free, Python, actively maintained (commits this week).