:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Han, Xu, Tang, Yuan, Xu, Jinfeng, Li, Xianzhi
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2503.18368
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model
by: Han, Xu, et al.
Published: (2024)

MoST: Multi-modality Scene Tokenization for Motion Prediction
by: Mu, Norman, et al.
Published: (2024)

MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
by: Tang, Yuan, et al.
Published: (2024)

MonarchRT: Efficient Attention for Real-Time Video Generation
by: Agarwal, Krish, et al.
Published: (2026)

MoST: Motion Style Transformer between Diverse Action Contents
by: Kim, Boeun, et al.
Published: (2024)

LumiX: Structured and Coherent Text-to-Intrinsic Generation
by: Han, Xu, et al.
Published: (2025)

PointDreamer: Zero-shot 3D Textured Mesh Reconstruction from Colored Point Cloud
by: Yu, Qiao, et al.
Published: (2024)

More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding
by: Tang, Yuan, et al.
Published: (2024)

TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
by: Guo, Wenxuan, et al.
Published: (2025)

GRPO-RM: Fine-Tuning Representation Models via GRPO-Driven Reinforcement Learning
by: Xu, Yanchen, et al.
Published: (2025)

CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling
by: Wang, Xinze, et al.
Published: (2025)

PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation
by: Xu, Jinfeng, et al.
Published: (2024)

SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds
by: Xu, Jinfeng, et al.
Published: (2025)

FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation
by: Ma, Yuting, et al.
Published: (2024)

MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering
by: Zhu, Jian, et al.
Published: (2025)

Unsupervised Representation Learning from Sparse Transformation Analysis
by: Song, Yue, et al.
Published: (2024)

Sparse Autoencoders for Interpretable Medical Image Representation Learning
by: Wesp, Philipp, et al.
Published: (2026)

RoPeSLR: 3D RoPE-driven Sparse-LowRank Attention for Efficient Diffusion Transformers
by: Liu, Yuxi, et al.
Published: (2026)

3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
by: Ze, Yanjie, et al.
Published: (2024)

Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
by: Zhang, Xinxi, et al.
Published: (2024)

Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
by: Tang, Yiwen, et al.
Published: (2023)

LoST: Level of Semantics Tokenization for 3D Shapes
by: Dutt, Niladri Shekhar, et al.
Published: (2026)

SRL-SOA: Self-Representation Learning with Sparse 1D-Operational Autoencoder for Hyperspectral Image Band Selection
by: Ahishali, Mete, et al.
Published: (2022)

Expanding Sparse Tuning for Low Memory Usage
by: Shen, Shufan, et al.
Published: (2024)

CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning
by: Yao, Zhenquan, et al.
Published: (2026)

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
by: Wang, Zedong, et al.
Published: (2025)

Towards Sparse Video Understanding and Reasoning
by: Xu, Chenwei, et al.
Published: (2026)

MOFI: Learning Image Representations from Noisy Entity Annotated Images
by: Wu, Wentao, et al.
Published: (2023)

Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
by: Jie, Shibo, et al.
Published: (2024)

Probing the Representational Power of Sparse Autoencoders in Vision Models
by: Olson, Matthew Lyle, et al.
Published: (2025)

When Does Sparse MoE Help in Vision? The Role of Backbone Compute Leverage in Sparse Routing
by: Sun, Libo, et al.
Published: (2026)

Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference
by: Ahmed, Sk Miraj, et al.
Published: (2026)

GRPO-TTA: Test-Time Visual Tuning for Vision-Language Models via GRPO-Driven Reinforcement Learning
by: Li, Yujun, et al.
Published: (2026)

MoP-CLIP: A Mixture of Prompt-Tuned CLIP Models for Domain Incremental Learning
by: Nicolas, Julien, et al.
Published: (2023)

Neighbour-level Message Interaction Encoding for Improved Representation Learning on Graphs
by: Zhang, Haimin, et al.
Published: (2024)

LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes
by: Xu, Xiang, et al.
Published: (2025)

Trustworthy Personalized Bayesian Federated Learning via Posterior Fine-Tune
by: Luo, Mengen, et al.
Published: (2024)

Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images
by: Zhang, Yundi, et al.
Published: (2024)

RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting
by: Li, Zhan, et al.
Published: (2025)

Mixed Autoencoder for Self-supervised Visual Representation Learning
by: Chen, Kai, et al.
Published: (2023)