[e925]

LR-ASD★★★★

🆓 free🐍 sidecar

The 2025 state-of-the-art for 'which face is actually talking.' Fast, tiny, accurate.

why it matters

LR-ASD is the newest open-source active speaker detection model (Springer IJCV 2025 paper). It tells your video pipeline which person in a multi-face frame is actually talking. Accuracy beats the older TalkNet approach and it's 23 times lighter — fast enough to run on every frame, not just samples.

If you're building your own clipping or auto-crop pipeline and accuracy matters more than a pre-built library, this is the one to drop in. MIT, free, Python.

install

git clone https://github.com/Junhua-Liao/LR-ASD && pip install -r requirements.txt

where to find it

repo health109
possibly unmaintained

no commits in 1 year. this doesn't mean it's broken — some small repos are "finished" — but if you hit an install issue, it may not get patched quickly.

avoid if

You're not building a pipeline yourself. This is a research model, not a product.

see it in action
open the module that demos LR-ASD

tags

💰 money moves that use this tool

all money moves →

more in video & clipping

last reviewed · 2026-04-22 · added 2026-04-22