AnalysisAI ModelsVisual AI
1 day ago
Two papers propose improvements to slot-based video object representation
TSA (Temporal Slot Activation) and Dual-State Slot Attention both aim to maintain persistent object identity in unsupervised video decomposition. TSA uses a temporal activation mechanism to reallocate slots, while Dual-State decouples appearance and identity to handle occlusions and reappearances.