Saved in:
| Main Authors: | Han, Cheng, Lu, Yawen, Sun, Guohao, Liang, James C., Cao, Zhiwen, Wang, Qifan, Guan, Qiang, Dianat, Sohail A., Rao, Raghuveer M., Geng, Tong, Tao, Zhiqiang, Liu, Dongfang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.01559 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ProMotion: Prototypes As Motion Learners
by: Lu, Yawen, et al.
Published: (2024)
by: Lu, Yawen, et al.
Published: (2024)
Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval
by: Wang, Jiamian, et al.
Published: (2024)
by: Wang, Jiamian, et al.
Published: (2024)
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
by: Han, Cheng, et al.
Published: (2024)
by: Han, Cheng, et al.
Published: (2024)
Image Translation as Diffusion Visual Programmers
by: Han, Cheng, et al.
Published: (2024)
by: Han, Cheng, et al.
Published: (2024)
MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper
by: Zeng, Runjia, et al.
Published: (2025)
by: Zeng, Runjia, et al.
Published: (2025)
Latent Chain-of-Thought for Visual Reasoning
by: Sun, Guohao, et al.
Published: (2025)
by: Sun, Guohao, et al.
Published: (2025)
X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)
Re-Imagining Multimodal Instruction Tuning: A Representation View
by: Liu, Yiyang, et al.
Published: (2025)
by: Liu, Yiyang, et al.
Published: (2025)
Visual Self-Refinement for Autoregressive Models
by: Wang, Jiamian, et al.
Published: (2025)
by: Wang, Jiamian, et al.
Published: (2025)
Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled Data
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)
Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain Adaptation
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)
Radiance Field Learners As UAV First-Person Viewers
by: Yan, Liqi, et al.
Published: (2024)
by: Yan, Liqi, et al.
Published: (2024)
Probabilistic Token Alignment for Large Language Model Fusion
by: Zeng, Runjia, et al.
Published: (2025)
by: Zeng, Runjia, et al.
Published: (2025)
Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
by: Cheng, Zhiyuan, et al.
Published: (2024)
by: Cheng, Zhiyuan, et al.
Published: (2024)
Visual Fourier Prompt Tuning
by: Zeng, Runjia, et al.
Published: (2024)
by: Zeng, Runjia, et al.
Published: (2024)
TokenMotion: Motion-Guided Vision Transformer for Video Camouflaged Object Detection Via Learnable Token Selection
by: Yu, Zifan, et al.
Published: (2023)
by: Yu, Zifan, et al.
Published: (2023)
TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching
by: Zeng, Runjia, et al.
Published: (2026)
by: Zeng, Runjia, et al.
Published: (2026)
A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation Learning
by: Liu, Changyu, et al.
Published: (2026)
by: Liu, Changyu, et al.
Published: (2026)
Comparing theApproaches of International Political Economy MacroStrategy of the UnitedStates of America and China Towards the Islamic Republic of Iran
by: Dianat, Hossein
Published: (2024)
by: Dianat, Hossein
Published: (2024)
Proto-Former: Unified Facial Landmark Detection by Prototype Transformer
by: Hu, Shengkai, et al.
Published: (2025)
by: Hu, Shengkai, et al.
Published: (2025)
Learning from Label Proportions: Bootstrapping Supervised Learners via Belief Propagation
by: Havaldar, Shreyas, et al.
Published: (2023)
by: Havaldar, Shreyas, et al.
Published: (2023)
Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?
by: Han, Cheng, et al.
Published: (2024)
by: Han, Cheng, et al.
Published: (2024)
Credibility in Second-Price Auctions: An Experimental Test
by: Dianat, Ahrash, et al.
Published: (2021)
by: Dianat, Ahrash, et al.
Published: (2021)
MotionRFT: Unified Reinforcement Fine-Tuning for Text-to-Motion Generation
by: Tan, Xiaofeng, et al.
Published: (2026)
by: Tan, Xiaofeng, et al.
Published: (2026)
Visual Agents as Fast and Slow Thinkers
by: Sun, Guangyan, et al.
Published: (2024)
by: Sun, Guangyan, et al.
Published: (2024)
Graph Positional Autoencoders as Self-supervised Learners
by: Liu, Yang, et al.
Published: (2025)
by: Liu, Yang, et al.
Published: (2025)
Addressing Skewed Heterogeneity via Federated Prototype Rectification with Personalization
by: Guo, Shunxin, et al.
Published: (2024)
by: Guo, Shunxin, et al.
Published: (2024)
PersonaGest: Personalized Co-Speech Gesture Generation with Semantic-Guided Hierarchical Motion Representation
by: Zhao, Junchuan, et al.
Published: (2026)
by: Zhao, Junchuan, et al.
Published: (2026)
Transformers as Multi-task Learners: Decoupling Features in Hidden Markov Models
by: Hao, Yifan, et al.
Published: (2025)
by: Hao, Yifan, et al.
Published: (2025)
RaCT: Ranking-aware Chain-of-Thought Optimization for LLMs
by: Liu, Haowei, et al.
Published: (2024)
by: Liu, Haowei, et al.
Published: (2024)
STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical Question-Answering
by: Sun, Guohao, et al.
Published: (2024)
by: Sun, Guohao, et al.
Published: (2024)
Movable Antenna for Wireless Communications:Prototyping and Experimental Results
by: Dong, Zhenjun, et al.
Published: (2024)
by: Dong, Zhenjun, et al.
Published: (2024)
All You Need is One: Capsule Prompt Tuning with a Single Vector
by: Liu, Yiyang, et al.
Published: (2025)
by: Liu, Yiyang, et al.
Published: (2025)
MLP Can Be A Good Transformer Learner
by: Lin, Sihao, et al.
Published: (2024)
by: Lin, Sihao, et al.
Published: (2024)
Inertial Confinement Fusion Forecasting via Large Language Models
by: Chen, Mingkai, et al.
Published: (2024)
by: Chen, Mingkai, et al.
Published: (2024)
Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
by: Wang, Taowen, et al.
Published: (2024)
by: Wang, Taowen, et al.
Published: (2024)
TokenBlowUp: Resolving Representational Singularities in LLM Token Spaces via Monoidal Transformations
by: Zhao, Dongfang
Published: (2025)
by: Zhao, Dongfang
Published: (2025)
Q-Bridge: Code Translation for Quantum Machine Learning via LLMs
by: Zeng, Runjia, et al.
Published: (2026)
by: Zeng, Runjia, et al.
Published: (2026)
FRA-DiagSys: A Transformer Winding Fault Diagnosis System for Identifying Fault Types and degrees Using Frequency Response Analysis
by: Wang, Guohao
Published: (2024)
by: Wang, Guohao
Published: (2024)
Unified Framework for Direct Characterization of Kraus Operators, Observables, Density Matrices, and Weak Values Without Weak Interaction
by: Sahil, et al.
Published: (2025)
by: Sahil, et al.
Published: (2025)
Similar Items
-
ProMotion: Prototypes As Motion Learners
by: Lu, Yawen, et al.
Published: (2024) -
Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval
by: Wang, Jiamian, et al.
Published: (2024) -
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
by: Han, Cheng, et al.
Published: (2024) -
Image Translation as Diffusion Visual Programmers
by: Han, Cheng, et al.
Published: (2024) -
MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper
by: Zeng, Runjia, et al.
Published: (2025)