Saved in:
| Main Authors: | Guo, Hang, Zhang, Yuzhen, Gao, Tianci, Su, Junning, Lv, Pei, Xu, Mingliang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.02201 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
by: Yang, Jihan, et al.
Published: (2024)
by: Yang, Jihan, et al.
Published: (2024)
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
by: Long, Lin, et al.
Published: (2025)
by: Long, Lin, et al.
Published: (2025)
Teaching Video Generators to Remember: Eliciting Dynamic Memory for Out-of-Sight State Evolution
by: Xu, Tianshuo, et al.
Published: (2026)
by: Xu, Tianshuo, et al.
Published: (2026)
Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
by: Kim, Minkuk, et al.
Published: (2024)
by: Kim, Minkuk, et al.
Published: (2024)
Grounding by Remembering: Cross-Scene and In-Scene Memory for 3D Functional Affordances
by: Wang, Qirui, et al.
Published: (2026)
by: Wang, Qirui, et al.
Published: (2026)
Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos
by: Feng, Yuang, et al.
Published: (2025)
by: Feng, Yuang, et al.
Published: (2025)
Delving into Mapping Uncertainty for Mapless Trajectory Prediction
by: Zhang, Zongzheng, et al.
Published: (2025)
by: Zhang, Zongzheng, et al.
Published: (2025)
Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction
by: Yang, Yuxin, et al.
Published: (2024)
by: Yang, Yuxin, et al.
Published: (2024)
Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics
by: Pei, Muleilan, et al.
Published: (2025)
by: Pei, Muleilan, et al.
Published: (2025)
Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions
by: Sun, Xiaoxiao, et al.
Published: (2026)
by: Sun, Xiaoxiao, et al.
Published: (2026)
Recall to Predict: Grounding Motion Forecasting in Interpretable Motion Bank
by: Vivekanandan, Abhishek, et al.
Published: (2026)
by: Vivekanandan, Abhishek, et al.
Published: (2026)
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning
by: Xu, Ziwen, et al.
Published: (2026)
by: Xu, Ziwen, et al.
Published: (2026)
Trajectory Prediction Meets Large Language Models: A Survey
by: Xu, Yi, et al.
Published: (2025)
by: Xu, Yi, et al.
Published: (2025)
Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement
by: He, Yulin, et al.
Published: (2024)
by: He, Yulin, et al.
Published: (2024)
Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation
by: Xu, Yunzhe, et al.
Published: (2025)
by: Xu, Yunzhe, et al.
Published: (2025)
Remember Me: Bridging the Long-Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience Strategies
by: Gao, Peng, et al.
Published: (2025)
by: Gao, Peng, et al.
Published: (2025)
RemInD: Remembering Anatomical Variations for Interpretable Domain Adaptive Medical Image Segmentation
by: Wang, Xin, et al.
Published: (2025)
by: Wang, Xin, et al.
Published: (2025)
GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction
by: Pei, Muleilan, et al.
Published: (2025)
by: Pei, Muleilan, et al.
Published: (2025)
MDU-Net: Multi-scale Densely Connected U-Net for biomedical image segmentation
by: Zhang, Jiawei, et al.
Published: (2018)
by: Zhang, Jiawei, et al.
Published: (2018)
Dynamic Aware: Adaptive Multi-Mode Out-of-Distribution Detection for Trajectory Prediction in Autonomous Vehicles
by: Guo, Tongfei, et al.
Published: (2025)
by: Guo, Tongfei, et al.
Published: (2025)
Relative Position Matters: Trajectory Prediction and Planning with Polar Representation
by: Zhang, Bozhou, et al.
Published: (2025)
by: Zhang, Bozhou, et al.
Published: (2025)
MR-COSMO: Visual-Text Memory Recall and Direct CrOSs-MOdal Alignment Method for Query-Driven 3D Segmentation
by: Li, Chade, et al.
Published: (2025)
by: Li, Chade, et al.
Published: (2025)
Squeeze-and-Remember Block
by: Cakaj, Rinor, et al.
Published: (2024)
by: Cakaj, Rinor, et al.
Published: (2024)
MRAD: Zero-Shot Anomaly Detection with Memory-Driven Retrieval
by: Xu, Chaoran, et al.
Published: (2026)
by: Xu, Chaoran, et al.
Published: (2026)
Token Bottleneck: One Token to Remember Dynamics
by: Kim, Taekyung, et al.
Published: (2025)
by: Kim, Taekyung, et al.
Published: (2025)
PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion
by: He, Xuewan, et al.
Published: (2025)
by: He, Xuewan, et al.
Published: (2025)
Can VLMs Recall Factual Associations From Visual References?
by: Ashok, Dhananjay, et al.
Published: (2025)
by: Ashok, Dhananjay, et al.
Published: (2025)
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors
by: Ganj, Ashkan, et al.
Published: (2024)
by: Ganj, Ashkan, et al.
Published: (2024)
Video2Layout: Recall and Reconstruct Metric-Grounded Cognitive Map for Spatial Reasoning
by: Huang, Yibin, et al.
Published: (2025)
by: Huang, Yibin, et al.
Published: (2025)
SMEMO: Social Memory for Trajectory Forecasting
by: Marchetti, Francesco, et al.
Published: (2022)
by: Marchetti, Francesco, et al.
Published: (2022)
Social-Pose: Enhancing Trajectory Prediction with Human Body Pose
by: Gao, Yang, et al.
Published: (2025)
by: Gao, Yang, et al.
Published: (2025)
Attend Locally, Remember Linearly: Linear Attention as Cross-Frame Memory for Autoregressive Video Diffusion
by: Li, Kunyang, et al.
Published: (2026)
by: Li, Kunyang, et al.
Published: (2026)
MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction
by: Dong, Jiacheng, et al.
Published: (2026)
by: Dong, Jiacheng, et al.
Published: (2026)
PBP: Path-based Trajectory Prediction for Autonomous Driving
by: Afshar, Sepideh, et al.
Published: (2023)
by: Afshar, Sepideh, et al.
Published: (2023)
Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation
by: Yang, Pinci, et al.
Published: (2025)
by: Yang, Pinci, et al.
Published: (2025)
UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations
by: Hu, Yuzhen, et al.
Published: (2025)
by: Hu, Yuzhen, et al.
Published: (2025)
Semi-Supervised Weed Detection in Vegetable Fields: In-domain and Cross-domain Experiments
by: Deng, Boyang, et al.
Published: (2025)
by: Deng, Boyang, et al.
Published: (2025)
Descriptor Distillation: a Teacher-Student-Regularized Framework for Learning Local Descriptors
by: Liu, Yuzhen, et al.
Published: (2022)
by: Liu, Yuzhen, et al.
Published: (2022)
EAR-Net: Pursuing End-to-End Absolute Rotations from Multi-View Images
by: Liu, Yuzhen, et al.
Published: (2023)
by: Liu, Yuzhen, et al.
Published: (2023)
PolarMAE: Efficient Fetal Ultrasound Pre-training via Semantic Screening and Polar-Guided Masking
by: Lv, Meng, et al.
Published: (2026)
by: Lv, Meng, et al.
Published: (2026)
Similar Items
-
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
by: Yang, Jihan, et al.
Published: (2024) -
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
by: Long, Lin, et al.
Published: (2025) -
Teaching Video Generators to Remember: Eliciting Dynamic Memory for Out-of-Sight State Evolution
by: Xu, Tianshuo, et al.
Published: (2026) -
Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
by: Kim, Minkuk, et al.
Published: (2024) -
Grounding by Remembering: Cross-Scene and In-Scene Memory for 3D Functional Affordances
by: Wang, Qirui, et al.
Published: (2026)