Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"
-
Updated
Aug 21, 2024 - Python
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"
[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
[ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), or Temporal Sentence Grounding in Videos (TSGV))
Pytorch implementation of the paper 'Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Supervised Temporal Video Grounding' (AAAI2024).
Official Implementation of Moment Alignment Transformer
[NCA] Official implementation of the paper Motion2Language, Unsupervised learning of synchronized semantic motion segmentation
[BMVC 2024] Official Implementation of the paper guided attention for interpretable motion captioning
Transformer with Controlled Attention for Synchronous Motion Captioning
This paper presents the VLMI framework to detect activities in complex videos. It combines Swin Transformer video features with language prompts and an EIoU-based similarity measure, enabling accurate, query-driven activity detection and timestamping, handling visual noise and temporal uncertainty without full manual labeling.
Add a description, image, and links to the temporal-grounding topic page so that developers can more easily learn about it.
To associate your repository with the temporal-grounding topic, visit your repo's landing page and select "manage topics."