Saved in:
| Main Authors: | Min, Sunah, Moon, Jinyoung |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2109.13572 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Text-driven Online Action Detection
by: Benavent-Lledo, Manuel, et al.
Published: (2025)
by: Benavent-Lledo, Manuel, et al.
Published: (2025)
Holi-DETR: Holistic Fashion Item Detection Leveraging Contextual Information
by: Kwon, Youngchae, et al.
Published: (2025)
by: Kwon, Youngchae, et al.
Published: (2025)
Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
by: Kim, Jongha, et al.
Published: (2024)
by: Kim, Jongha, et al.
Published: (2024)
HiCM$^2$: Hierarchical Compact Memory Modeling for Dense Video Captioning
by: Kim, Minkuk, et al.
Published: (2024)
by: Kim, Minkuk, et al.
Published: (2024)
Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
by: Kim, Minkuk, et al.
Published: (2024)
by: Kim, Minkuk, et al.
Published: (2024)
Context-Enhanced Memory-Refined Transformer for Online Action Detection
by: Pang, Zhanzhong, et al.
Published: (2025)
by: Pang, Zhanzhong, et al.
Published: (2025)
Online Action Representation using Change Detection and Symbolic Programming
by: Nair, Vishnu S, et al.
Published: (2024)
by: Nair, Vishnu S, et al.
Published: (2024)
Weakly Supervised Video Scene Graph Generation via Natural Language Supervision
by: Kim, Kibum, et al.
Published: (2025)
by: Kim, Kibum, et al.
Published: (2025)
Masked Spatial Propagation Network for Sparsity-Adaptive Depth Refinement
by: Jun, Jinyoung, et al.
Published: (2024)
by: Jun, Jinyoung, et al.
Published: (2024)
MALT: Multi-scale Action Learning Transformer for Online Action Detection
by: Yang, Zhipeng, et al.
Published: (2024)
by: Yang, Zhipeng, et al.
Published: (2024)
Object Aware Egocentric Online Action Detection
by: An, Joungbin, et al.
Published: (2024)
by: An, Joungbin, et al.
Published: (2024)
CNG-SFDA:Clean-and-Noisy Region Guided Online-Offline Source-Free Domain Adaptation
by: Cho, Hyeonwoo, et al.
Published: (2024)
by: Cho, Hyeonwoo, et al.
Published: (2024)
Adaptive Self-training Framework for Fine-grained Scene Graph Generation
by: Kim, Kibum, et al.
Published: (2024)
by: Kim, Kibum, et al.
Published: (2024)
Boundary-Recovering Network for Temporal Action Detection
by: Kim, Jihwan, et al.
Published: (2024)
by: Kim, Jihwan, et al.
Published: (2024)
OnlineTAS: An Online Baseline for Temporal Action Segmentation
by: Zhong, Qing, et al.
Published: (2024)
by: Zhong, Qing, et al.
Published: (2024)
Ultra-Fast Adaptive Track Detection Network
by: Ni, Hai, et al.
Published: (2024)
by: Ni, Hai, et al.
Published: (2024)
Probabilistic Temporal Masked Attention for Cross-view Online Action Detection
by: Xie, Liping, et al.
Published: (2025)
by: Xie, Liping, et al.
Published: (2025)
Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
by: Yang, Min, et al.
Published: (2023)
by: Yang, Min, et al.
Published: (2023)
Action-Dynamics Modeling and Cross-Temporal Interaction for Online Action Understanding
by: Yang, Xinyu, et al.
Published: (2025)
by: Yang, Xinyu, et al.
Published: (2025)
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO
by: Park, Jinyoung, et al.
Published: (2025)
by: Park, Jinyoung, et al.
Published: (2025)
Item Region-based Style Classification Network (IRSN): A Fashion Style Classifier Based on Domain Knowledge of Fashion Experts
by: Choi, Jinyoung, et al.
Published: (2025)
by: Choi, Jinyoung, et al.
Published: (2025)
RegFormer: Transferable Relational Grounding for Efficient Weakly-Supervised Human-Object Interaction Detection
by: Park, Jihwan, et al.
Published: (2026)
by: Park, Jihwan, et al.
Published: (2026)
CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes via Chain-of-Thought Reasoning
by: Song, Jeonghyo, et al.
Published: (2025)
by: Song, Jeonghyo, et al.
Published: (2025)
Online Continuous Generalized Category Discovery
by: Park, Keon-Hee, et al.
Published: (2024)
by: Park, Keon-Hee, et al.
Published: (2024)
Towards Efficient Vision State Space Models via Token Merging
by: Park, Jinyoung, et al.
Published: (2025)
by: Park, Jinyoung, et al.
Published: (2025)
Boundary Discretization and Reliable Classification Network for Temporal Action Detection
by: Fang, Zhenying, et al.
Published: (2023)
by: Fang, Zhenying, et al.
Published: (2023)
Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
by: Bao, Wentao, et al.
Published: (2024)
by: Bao, Wentao, et al.
Published: (2024)
Detecting Informative Channels: ActionFormer
by: Zhao, Kunpeng, et al.
Published: (2025)
by: Zhao, Kunpeng, et al.
Published: (2025)
Test-time Sparsity for Extreme Fast Action Diffusion
by: Ji, Kangye, et al.
Published: (2026)
by: Ji, Kangye, et al.
Published: (2026)
PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition
by: Lee, Jongseo, et al.
Published: (2025)
by: Lee, Jongseo, et al.
Published: (2025)
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection
by: Korban, Matthew, et al.
Published: (2024)
by: Korban, Matthew, et al.
Published: (2024)
Cefdet: Cognitive Effectiveness Network Based on Fuzzy Inference for Action Detection
by: Luo, Zhe, et al.
Published: (2024)
by: Luo, Zhe, et al.
Published: (2024)
Online Temporal Action Localization with Memory-Augmented Transformer
by: Song, Youngkil, et al.
Published: (2024)
by: Song, Youngkil, et al.
Published: (2024)
Prompt Learning via Meta-Regularization
by: Park, Jinyoung, et al.
Published: (2024)
by: Park, Jinyoung, et al.
Published: (2024)
Motion-aware Memory Network for Fast Video Salient Object Detection
by: Zhao, Xing, et al.
Published: (2022)
by: Zhao, Xing, et al.
Published: (2022)
Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action Detection
by: Fang, Xiang, et al.
Published: (2024)
by: Fang, Xiang, et al.
Published: (2024)
GCAgent: Long-Video Understanding via Schematic and Narrative Episodic Memory
by: Yeo, Jeong Hun, et al.
Published: (2025)
by: Yeo, Jeong Hun, et al.
Published: (2025)
OZ-TAL: Online Zero-Shot Temporal Action Localization
by: Han, Chaolei, et al.
Published: (2026)
by: Han, Chaolei, et al.
Published: (2026)
Disentangled Concepts Speak Louder Than Words: Explainable Video Action Recognition
by: Lee, Jongseo, et al.
Published: (2025)
by: Lee, Jongseo, et al.
Published: (2025)
Multi-dimensional Preference Alignment by Conditioning Reward Itself
by: Jang, Jiho, et al.
Published: (2025)
by: Jang, Jiho, et al.
Published: (2025)
Similar Items
-
Text-driven Online Action Detection
by: Benavent-Lledo, Manuel, et al.
Published: (2025) -
Holi-DETR: Holistic Fashion Item Detection Leveraging Contextual Information
by: Kwon, Youngchae, et al.
Published: (2025) -
Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
by: Kim, Jongha, et al.
Published: (2024) -
HiCM$^2$: Hierarchical Compact Memory Modeling for Dense Video Captioning
by: Kim, Minkuk, et al.
Published: (2024) -
Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
by: Kim, Minkuk, et al.
Published: (2024)