:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Guo, Hang, Zhang, Yuzhen, Gao, Tianci, Su, Junning, Lv, Pei, Xu, Mingliang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2410.02201
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
by: Yang, Jihan, et al.
Published: (2024)

Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory
by: Long, Lin, et al.
Published: (2025)

Teaching Video Generators to Remember: Eliciting Dynamic Memory for Out-of-Sight State Evolution
by: Xu, Tianshuo, et al.
Published: (2026)

Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
by: Kim, Minkuk, et al.
Published: (2024)

Grounding by Remembering: Cross-Scene and In-Scene Memory for 3D Functional Affordances
by: Wang, Qirui, et al.
Published: (2026)

Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos
by: Feng, Yuang, et al.
Published: (2025)

Delving into Mapping Uncertainty for Mapless Trajectory Prediction
by: Zhang, Zongzheng, et al.
Published: (2025)

Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction
by: Yang, Yuxin, et al.
Published: (2024)

Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics
by: Pei, Muleilan, et al.
Published: (2025)

Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions
by: Sun, Xiaoxiao, et al.
Published: (2026)

Recall to Predict: Grounding Motion Forecasting in Interpretable Motion Bank
by: Vivekanandan, Abhishek, et al.
Published: (2026)

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning
by: Xu, Ziwen, et al.
Published: (2026)

Trajectory Prediction Meets Large Language Models: A Survey
by: Xu, Yi, et al.
Published: (2025)

Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement
by: He, Yulin, et al.
Published: (2024)

Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Language Navigation
by: Xu, Yunzhe, et al.
Published: (2025)

Remember Me: Bridging the Long-Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience Strategies
by: Gao, Peng, et al.
Published: (2025)

RemInD: Remembering Anatomical Variations for Interpretable Domain Adaptive Medical Image Segmentation
by: Wang, Xin, et al.
Published: (2025)

GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction
by: Pei, Muleilan, et al.
Published: (2025)

MDU-Net: Multi-scale Densely Connected U-Net for biomedical image segmentation
by: Zhang, Jiawei, et al.
Published: (2018)

Dynamic Aware: Adaptive Multi-Mode Out-of-Distribution Detection for Trajectory Prediction in Autonomous Vehicles
by: Guo, Tongfei, et al.
Published: (2025)

Relative Position Matters: Trajectory Prediction and Planning with Polar Representation
by: Zhang, Bozhou, et al.
Published: (2025)

MR-COSMO: Visual-Text Memory Recall and Direct CrOSs-MOdal Alignment Method for Query-Driven 3D Segmentation
by: Li, Chade, et al.
Published: (2025)

Squeeze-and-Remember Block
by: Cakaj, Rinor, et al.
Published: (2024)

MRAD: Zero-Shot Anomaly Detection with Memory-Driven Retrieval
by: Xu, Chaoran, et al.
Published: (2026)

Token Bottleneck: One Token to Remember Dynamics
by: Kim, Taekyung, et al.
Published: (2025)

PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion
by: He, Xuewan, et al.
Published: (2025)

Can VLMs Recall Factual Associations From Visual References?
by: Ashok, Dhananjay, et al.
Published: (2025)

HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors
by: Ganj, Ashkan, et al.
Published: (2024)

Video2Layout: Recall and Reconstruct Metric-Grounded Cognitive Map for Spatial Reasoning
by: Huang, Yibin, et al.
Published: (2025)

SMEMO: Social Memory for Trajectory Forecasting
by: Marchetti, Francesco, et al.
Published: (2022)

Social-Pose: Enhancing Trajectory Prediction with Human Body Pose
by: Gao, Yang, et al.
Published: (2025)

Attend Locally, Remember Linearly: Linear Attention as Cross-Frame Memory for Autoregressive Video Diffusion
by: Li, Kunyang, et al.
Published: (2026)

MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction
by: Dong, Jiacheng, et al.
Published: (2026)

PBP: Path-based Trajectory Prediction for Autonomous Driving
by: Afshar, Sepideh, et al.
Published: (2023)

Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation
by: Yang, Pinci, et al.
Published: (2025)

UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations
by: Hu, Yuzhen, et al.
Published: (2025)

Semi-Supervised Weed Detection in Vegetable Fields: In-domain and Cross-domain Experiments
by: Deng, Boyang, et al.
Published: (2025)

Descriptor Distillation: a Teacher-Student-Regularized Framework for Learning Local Descriptors
by: Liu, Yuzhen, et al.
Published: (2022)

EAR-Net: Pursuing End-to-End Absolute Rotations from Multi-View Images
by: Liu, Yuzhen, et al.
Published: (2023)

PolarMAE: Efficient Fetal Ultrasound Pre-training via Semantic Screening and Polar-Guided Masking
by: Lv, Meng, et al.
Published: (2026)