Saved in:
| Main Authors: | Han, Xu, Tang, Yuan, Xu, Jinfeng, Li, Xianzhi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.18368 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model
by: Han, Xu, et al.
Published: (2024)
by: Han, Xu, et al.
Published: (2024)
MoST: Multi-modality Scene Tokenization for Motion Prediction
by: Mu, Norman, et al.
Published: (2024)
by: Mu, Norman, et al.
Published: (2024)
MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
by: Tang, Yuan, et al.
Published: (2024)
by: Tang, Yuan, et al.
Published: (2024)
MonarchRT: Efficient Attention for Real-Time Video Generation
by: Agarwal, Krish, et al.
Published: (2026)
by: Agarwal, Krish, et al.
Published: (2026)
MoST: Motion Style Transformer between Diverse Action Contents
by: Kim, Boeun, et al.
Published: (2024)
by: Kim, Boeun, et al.
Published: (2024)
LumiX: Structured and Coherent Text-to-Intrinsic Generation
by: Han, Xu, et al.
Published: (2025)
by: Han, Xu, et al.
Published: (2025)
PointDreamer: Zero-shot 3D Textured Mesh Reconstruction from Colored Point Cloud
by: Yu, Qiao, et al.
Published: (2024)
by: Yu, Qiao, et al.
Published: (2024)
More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding
by: Tang, Yuan, et al.
Published: (2024)
by: Tang, Yuan, et al.
Published: (2024)
TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
by: Guo, Wenxuan, et al.
Published: (2025)
by: Guo, Wenxuan, et al.
Published: (2025)
GRPO-RM: Fine-Tuning Representation Models via GRPO-Driven Reinforcement Learning
by: Xu, Yanchen, et al.
Published: (2025)
by: Xu, Yanchen, et al.
Published: (2025)
CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling
by: Wang, Xinze, et al.
Published: (2025)
by: Wang, Xinze, et al.
Published: (2025)
PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation
by: Xu, Jinfeng, et al.
Published: (2024)
by: Xu, Jinfeng, et al.
Published: (2024)
SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds
by: Xu, Jinfeng, et al.
Published: (2025)
by: Xu, Jinfeng, et al.
Published: (2025)
FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation
by: Ma, Yuting, et al.
Published: (2024)
by: Ma, Yuting, et al.
Published: (2024)
MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering
by: Zhu, Jian, et al.
Published: (2025)
by: Zhu, Jian, et al.
Published: (2025)
Unsupervised Representation Learning from Sparse Transformation Analysis
by: Song, Yue, et al.
Published: (2024)
by: Song, Yue, et al.
Published: (2024)
Sparse Autoencoders for Interpretable Medical Image Representation Learning
by: Wesp, Philipp, et al.
Published: (2026)
by: Wesp, Philipp, et al.
Published: (2026)
RoPeSLR: 3D RoPE-driven Sparse-LowRank Attention for Efficient Diffusion Transformers
by: Liu, Yuxi, et al.
Published: (2026)
by: Liu, Yuxi, et al.
Published: (2026)
3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
by: Ze, Yanjie, et al.
Published: (2024)
by: Ze, Yanjie, et al.
Published: (2024)
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
by: Zhang, Xinxi, et al.
Published: (2024)
by: Zhang, Xinxi, et al.
Published: (2024)
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
by: Tang, Yiwen, et al.
Published: (2023)
by: Tang, Yiwen, et al.
Published: (2023)
LoST: Level of Semantics Tokenization for 3D Shapes
by: Dutt, Niladri Shekhar, et al.
Published: (2026)
by: Dutt, Niladri Shekhar, et al.
Published: (2026)
SRL-SOA: Self-Representation Learning with Sparse 1D-Operational Autoencoder for Hyperspectral Image Band Selection
by: Ahishali, Mete, et al.
Published: (2022)
by: Ahishali, Mete, et al.
Published: (2022)
Expanding Sparse Tuning for Low Memory Usage
by: Shen, Shufan, et al.
Published: (2024)
by: Shen, Shufan, et al.
Published: (2024)
CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning
by: Yao, Zhenquan, et al.
Published: (2026)
by: Yao, Zhenquan, et al.
Published: (2026)
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
by: Wang, Zedong, et al.
Published: (2025)
by: Wang, Zedong, et al.
Published: (2025)
Towards Sparse Video Understanding and Reasoning
by: Xu, Chenwei, et al.
Published: (2026)
by: Xu, Chenwei, et al.
Published: (2026)
MOFI: Learning Image Representations from Noisy Entity Annotated Images
by: Wu, Wentao, et al.
Published: (2023)
by: Wu, Wentao, et al.
Published: (2023)
Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
by: Jie, Shibo, et al.
Published: (2024)
by: Jie, Shibo, et al.
Published: (2024)
Probing the Representational Power of Sparse Autoencoders in Vision Models
by: Olson, Matthew Lyle, et al.
Published: (2025)
by: Olson, Matthew Lyle, et al.
Published: (2025)
When Does Sparse MoE Help in Vision? The Role of Backbone Compute Leverage in Sparse Routing
by: Sun, Libo, et al.
Published: (2026)
by: Sun, Libo, et al.
Published: (2026)
Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference
by: Ahmed, Sk Miraj, et al.
Published: (2026)
by: Ahmed, Sk Miraj, et al.
Published: (2026)
GRPO-TTA: Test-Time Visual Tuning for Vision-Language Models via GRPO-Driven Reinforcement Learning
by: Li, Yujun, et al.
Published: (2026)
by: Li, Yujun, et al.
Published: (2026)
MoP-CLIP: A Mixture of Prompt-Tuned CLIP Models for Domain Incremental Learning
by: Nicolas, Julien, et al.
Published: (2023)
by: Nicolas, Julien, et al.
Published: (2023)
Neighbour-level Message Interaction Encoding for Improved Representation Learning on Graphs
by: Zhang, Haimin, et al.
Published: (2024)
by: Zhang, Haimin, et al.
Published: (2024)
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes
by: Xu, Xiang, et al.
Published: (2025)
by: Xu, Xiang, et al.
Published: (2025)
Trustworthy Personalized Bayesian Federated Learning via Posterior Fine-Tune
by: Luo, Mengen, et al.
Published: (2024)
by: Luo, Mengen, et al.
Published: (2024)
Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images
by: Zhang, Yundi, et al.
Published: (2024)
by: Zhang, Yundi, et al.
Published: (2024)
RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting
by: Li, Zhan, et al.
Published: (2025)
by: Li, Zhan, et al.
Published: (2025)
Mixed Autoencoder for Self-supervised Visual Representation Learning
by: Chen, Kai, et al.
Published: (2023)
by: Chen, Kai, et al.
Published: (2023)
Similar Items
-
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model
by: Han, Xu, et al.
Published: (2024) -
MoST: Multi-modality Scene Tokenization for Motion Prediction
by: Mu, Norman, et al.
Published: (2024) -
MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
by: Tang, Yuan, et al.
Published: (2024) -
MonarchRT: Efficient Attention for Real-Time Video Generation
by: Agarwal, Krish, et al.
Published: (2026) -
MoST: Motion Style Transformer between Diverse Action Contents
by: Kim, Boeun, et al.
Published: (2024)