Guardado en:
| Autores principales: | Zhu, Fangrui, Xi, Yunfeng, Ni, Jianmo, Cai, Mu, Gong, Boqing, Zhao, Long, Qu, Chen, Miao, Ian, Li, Yi, Zhong, Cheng, Jiang, Huaizu, Patel, Shwetak |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2603.06561 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Towards Flexible Visual Relationship Segmentation
por: Zhu, Fangrui, et al.
Publicado: (2024)
por: Zhu, Fangrui, et al.
Publicado: (2024)
Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions
por: Han, Zeyu, et al.
Publicado: (2023)
por: Han, Zeyu, et al.
Publicado: (2023)
Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs
por: Zhu, Fangrui, et al.
Publicado: (2025)
por: Zhu, Fangrui, et al.
Publicado: (2025)
EgoTL: Egocentric Think-Aloud Chains for Long-Horizon Tasks
por: Liu, Lulin, et al.
Publicado: (2026)
por: Liu, Lulin, et al.
Publicado: (2026)
Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
por: Yu, Shoubin, et al.
Publicado: (2026)
por: Yu, Shoubin, et al.
Publicado: (2026)
EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning
por: Kulkarni, Yogesh, et al.
Publicado: (2025)
por: Kulkarni, Yogesh, et al.
Publicado: (2025)
EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT
por: Pei, Baoqi, et al.
Publicado: (2025)
por: Pei, Baoqi, et al.
Publicado: (2025)
AStar: Boosting Multimodal Reasoning with Automated Structured Thinking
por: Wu, Jinyang, et al.
Publicado: (2025)
por: Wu, Jinyang, et al.
Publicado: (2025)
EgoIntrospect: An Egocentric Dataset and Benchmark for User-Centric Internal State Reasoning
por: Wang, Zeyu, et al.
Publicado: (2026)
por: Wang, Zeyu, et al.
Publicado: (2026)
EgoEsportsQA: An Egocentric Video Benchmark for Perception and Reasoning in Esports
por: Ma, Jianzhe, et al.
Publicado: (2026)
por: Ma, Jianzhe, et al.
Publicado: (2026)
EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric Videos
por: Li, Yuxuan, et al.
Publicado: (2025)
por: Li, Yuxuan, et al.
Publicado: (2025)
EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding
por: Wang, Ziyang, et al.
Publicado: (2026)
por: Wang, Ziyang, et al.
Publicado: (2026)
EgoMotion: Hierarchical Reasoning and Diffusion for Egocentric Vision-Language Motion Generation
por: Hou, Ruibing, et al.
Publicado: (2026)
por: Hou, Ruibing, et al.
Publicado: (2026)
EgoProx: Evaluating MLLMs on Egocentric 3D Proximity Reasoning Across a Cognitive Hierarchy
por: Li, Jinzhao, et al.
Publicado: (2026)
por: Li, Jinzhao, et al.
Publicado: (2026)
EgoPoseVR: Spatiotemporal Multi-Modal Reasoning for Egocentric Full-Body Pose in Virtual Reality
por: Cheng, Haojie, et al.
Publicado: (2026)
por: Cheng, Haojie, et al.
Publicado: (2026)
Reinforced Attention Learning
por: Li, Bangzheng, et al.
Publicado: (2026)
por: Li, Bangzheng, et al.
Publicado: (2026)
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
por: Tian, Shulin, et al.
Publicado: (2025)
por: Tian, Shulin, et al.
Publicado: (2025)
To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks
por: Gong, Nanxu, et al.
Publicado: (2026)
por: Gong, Nanxu, et al.
Publicado: (2026)
VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI
por: Cheng, Sijie, et al.
Publicado: (2024)
por: Cheng, Sijie, et al.
Publicado: (2024)
EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
por: Hazra, Rishi, et al.
Publicado: (2023)
por: Hazra, Rishi, et al.
Publicado: (2023)
EgoExoMem: Cross-View Memory Reasoning over Synchronized Egocentric and Exocentric Videos
por: Liu, Ruiping, et al.
Publicado: (2026)
por: Liu, Ruiping, et al.
Publicado: (2026)
TeleEgo: Benchmarking Egocentric AI Assistants in the Wild
por: Yan, Jiaqi, et al.
Publicado: (2025)
por: Yan, Jiaqi, et al.
Publicado: (2025)
Intuitive and Ubiquitous Fever Monitoring Using Smartphones and Smartwatches
por: Breda, Joseph, et al.
Publicado: (2021)
por: Breda, Joseph, et al.
Publicado: (2021)
ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos
por: Wu, Peiran, et al.
Publicado: (2025)
por: Wu, Peiran, et al.
Publicado: (2025)
TwiSTAR:Think Fast, Think Slow, Then Act,Generative Recommendation with Adaptive Reasoning
por: Cao, Shiteng, et al.
Publicado: (2026)
por: Cao, Shiteng, et al.
Publicado: (2026)
EgoMimic: Scaling Imitation Learning via Egocentric Video
por: Kareer, Simar, et al.
Publicado: (2024)
por: Kareer, Simar, et al.
Publicado: (2024)
EgoAVU: Egocentric Audio-Visual Understanding
por: Seth, Ashish, et al.
Publicado: (2026)
por: Seth, Ashish, et al.
Publicado: (2026)
When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning
por: Zhang, Xiaoyun, et al.
Publicado: (2025)
por: Zhang, Xiaoyun, et al.
Publicado: (2025)
EgoGraph: Temporal Knowledge Graph for Egocentric Video Understanding
por: Sun, Shitong, et al.
Publicado: (2026)
por: Sun, Shitong, et al.
Publicado: (2026)
EgoLife: Towards Egocentric Life Assistant
por: Yang, Jingkang, et al.
Publicado: (2025)
por: Yang, Jingkang, et al.
Publicado: (2025)
Nice Fold or Hero Call: Learning Budget-Efficient Thinking for Adaptive Reasoning
por: Zhou, Zhaomeng, et al.
Publicado: (2026)
por: Zhou, Zhaomeng, et al.
Publicado: (2026)
Hier-EgoPack: Hierarchical Egocentric Video Understanding with Diverse Task Perspectives
por: Peirone, Simone Alberto, et al.
Publicado: (2025)
por: Peirone, Simone Alberto, et al.
Publicado: (2025)
From Perception to Planning: Evolving Ego-Centric Task-Oriented Spatiotemporal Reasoning via Curriculum Learning
por: Yang, Xiaoda, et al.
Publicado: (2026)
por: Yang, Xiaoda, et al.
Publicado: (2026)
EgoLCD: Egocentric Video Generation with Long Context Diffusion
por: Zhang, Liuzhou, et al.
Publicado: (2025)
por: Zhang, Liuzhou, et al.
Publicado: (2025)
EgoLive: A Large-Scale Egocentric Dataset from Real-World Human Tasks
por: Li, Yihang, et al.
Publicado: (2026)
por: Li, Yihang, et al.
Publicado: (2026)
EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception
por: Chowdhury, Sanjoy, et al.
Publicado: (2025)
por: Chowdhury, Sanjoy, et al.
Publicado: (2025)
Think Smart, Not Hard: Difficulty Adaptive Reasoning for Large Audio Language Models
por: Sheng, Zhichao, et al.
Publicado: (2025)
por: Sheng, Zhichao, et al.
Publicado: (2025)
EgoBridge: Domain Adaptation for Generalizable Imitation from Egocentric Human Data
por: Punamiya, Ryan, et al.
Publicado: (2025)
por: Punamiya, Ryan, et al.
Publicado: (2025)
UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation
por: Patel, Chaitanya, et al.
Publicado: (2025)
por: Patel, Chaitanya, et al.
Publicado: (2025)
Lifting Data-Tracing Machine Unlearning to Knowledge-Tracing for Foundation Models
por: Tan, Yuwen, et al.
Publicado: (2025)
por: Tan, Yuwen, et al.
Publicado: (2025)
Ejemplares similares
-
Towards Flexible Visual Relationship Segmentation
por: Zhu, Fangrui, et al.
Publicado: (2024) -
Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions
por: Han, Zeyu, et al.
Publicado: (2023) -
Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs
por: Zhu, Fangrui, et al.
Publicado: (2025) -
EgoTL: Egocentric Think-Aloud Chains for Long-Horizon Tasks
por: Liu, Lulin, et al.
Publicado: (2026) -
Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
por: Yu, Shoubin, et al.
Publicado: (2026)