:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cheng, Jintao, Li, Weibin, Luo, Jiehao, Tang, Xiaoyu, He, Zhijian, Wu, Jin, Zou, Yao, Zhang, Wei
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2509.02129
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition
by: Xiang, Qiuchi, et al.
Published: (2024)

Beyond First-Order: Learning Riemannian Geometries for Invariant Visual Place Recognition
by: Cheng, Jintao, et al.
Published: (2026)

A Pseudo Global Fusion Paradigm-Based Cross-View Network for LiDAR-Based Place Recognition
by: Cheng, Jintao, et al.
Published: (2025)

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
by: Yu, Zhuoran, et al.
Published: (2023)

Decoding Ambiguous Emotions with Test-Time Scaling in Audio-Language Models
by: Jia, Hong, et al.
Published: (2026)

Don't Just Fine-tune the Agent, Tune the Environment
by: Lu, Siyuan, et al.
Published: (2025)

Grow, Don't Overwrite: Fine-tuning Without Forgetting
by: Adila, Dyah, et al.
Published: (2026)

MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation
by: Luo, Jiehao, et al.
Published: (2025)

Surely Large Multimodal Models (Don't) Excel in Visual Species Recognition?
by: Liu, Tian, et al.
Published: (2025)

OptiSAR-Net++: A Large-Scale Benchmark and Transformer-Free Framework for Cross-Domain Remote Sensing Visual Grounding
by: Tang, Xiaoyu, et al.
Published: (2026)

Fine-Tuned LLMs Know They Don't Know: A Parameter-Efficient Approach to Recovering Honesty
by: Shi, Zeyu, et al.
Published: (2025)

VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization
by: Waheed, Sania, et al.
Published: (2025)

What LLMs Think When You Don't Tell Them What to Think About?
by: Kwon, Yongchan, et al.
Published: (2026)

Don't Forget the Nonlinearity: Unlocking Activation Functions in Efficient Fine-Tuning
by: Yin, Bo, et al.
Published: (2025)

Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs
by: Kuchibhotla, Hari Chandana, et al.
Published: (2025)

Don't Plan for Sick Time
by: Rebecca Weaver
Published: (2024)

Don't Mind Your Time
by: Rebecca Weaver
Published: (2025)

Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather
by: He, Zhijian, et al.
Published: (2025)

EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
by: Jin, Tong, et al.
Published: (2024)

Don't Get Me Wrong: How to Apply Deep Visual Interpretations to Time Series
by: Loeffler, Christoffer, et al.
Published: (2022)

Sample, Don't Search: Rethinking Test-Time Alignment for Language Models
by: Faria, Gonçalo, et al.
Published: (2025)

Don't Use LLMs to Make Relevance Judgments
by: Soboroff, Ian
Published: (2024)

VLA-IAP: Training-Free Visual Token Pruning via Interaction Alignment for Vision-Language-Action Models
by: Cheng, Jintao, et al.
Published: (2026)

Don’t Test Twice, It’s All Right
by: Zoe Raglow, et al.
Published: (2025)

MAVEN: Multi-Agent Verification-Elaboration Network with In-Step Epistemic Auditing
by: Yao, Yinsheng, et al.
Published: (2026)

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
by: Chen, Yukang, et al.
Published: (2023)

Speech Emotion Recognition Via CNN-Transformer and Multidimensional Attention Mechanism
by: Tang, Xiaoyu, et al.
Published: (2024)

3rd Place Solution to Large-scale Fine-grained Food Recognition
by: Zhong, Yang, et al.
Published: (2025)

Don't be salesmen
Published: (1997)

You Sense Only Once Beneath: Ultra-Light Real-Time Underwater Object Detection
by: Dong, Jun, et al.
Published: (2025)

Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition
by: Kiu, Yu, et al.
Published: (2025)

Don't Get Too Excited -- Eliciting Emotions in LLMs
by: Fazzi, Gino Franco, et al.
Published: (2025)

Structured Pruning for Efficient Visual Place Recognition
by: Grainge, Oliver, et al.
Published: (2024)

Don't Blink: Evidence Collapse during Multimodal Reasoning
by: Raghu, Suresh, et al.
Published: (2026)

Start Making Sense: Libraries Don't Have To Be Confusing Places for Kids with Reading Disabilities.
by: Gorman, Audrey J.
Published: (1999)

ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition
by: Xie, Weidong, et al.
Published: (2024)

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
by: Li, Juncheng, et al.
Published: (2023)

Distillation Improves Visual Place Recognition for Low Quality Images
by: Yang, Anbang, et al.
Published: (2023)

Don't Judge a Book by its Cover: Testing LLMs' Robustness Under Logical Obfuscation
by: Borah, Abhilekh, et al.
Published: (2026)

Don't double it: Efficient Agent Prediction in Occlusions
by: Rothenhäusler, Anna, et al.
Published: (2026)