AnalysisAI Models
8 days ago
AVTrack: New Audio-Visual Speaker Tracking Method
AVTrack is a novel method for tracking active speakers using both audio and visual cues in complex human-centric scenes. The approach aims to enable fine-grained scene understanding for real-world applications.
·
8 days ago