April. 30th, 2025: We have performed comparative experiments with mainstream tracking-by-detection methods on the OVIS datasets(see Fig.5). Task-Specific SpatioTemporal Context-Aware Decoupling for ...
Abstract: In complex visual scenarios, people can understand occluded objects through contextual reasoning and association. However, existing video object detection (VOD) methods could not work well.