LR-ASD — The Armory — Escape 9 to 5

why it matters

LR-ASD is the newest open-source active speaker detection model (Springer IJCV 2025 paper). It tells your video pipeline which person in a multi-face frame is actually talking. Accuracy beats the older TalkNet approach and it's 23 times lighter — fast enough to run on every frame, not just samples.

If you're building your own clipping or auto-crop pipeline and accuracy matters more than a pre-built library, this is the one to drop in. MIT, free, Python.

install

git clone https://github.com/Junhua-Liao/LR-ASD && pip install -r requirements.txt

where to find it

github★ 109· updated 1y ago site →

repo health★ 109

possibly unmaintained

no commits in 1 year. this doesn't mean it's broken — some small repos are "finished" — but if you hit an install issue, it may not get patched quickly.

avoid if

You're not building a pipeline yourself. This is a research model, not a product.

see it in action

open the module that demos LR-ASD

→

💰 money moves that use this tool

all money moves →

Podcast clipping as a service
Sell short-form clip output to podcasters who hate editing — $200-2K/mo retainers per show. Pipeline runs in your sleep.

LR-ASD★★★★★

why it matters

install

where to find it

tags

💰 money moves that use this tool

more in video & clipping