:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Han, Cheng, Lu, Yawen, Sun, Guohao, Liang, James C., Cao, Zhiwen, Wang, Qifan, Guan, Qiang, Dianat, Sohail A., Rao, Raghuveer M., Geng, Tong, Tao, Zhiqiang, Liu, Dongfang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2406.01559
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ProMotion: Prototypes As Motion Learners
by: Lu, Yawen, et al.
Published: (2024)

Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval
by: Wang, Jiamian, et al.
Published: (2024)

AMD: Automatic Multi-step Distillation of Large-scale Vision Models
by: Han, Cheng, et al.
Published: (2024)

Image Translation as Diffusion Visual Programmers
by: Han, Cheng, et al.
Published: (2024)

MEPT: Mixture of Expert Prompt Tuning as a Manifold Mapper
by: Zeng, Runjia, et al.
Published: (2025)

Latent Chain-of-Thought for Visual Reasoning
by: Sun, Guohao, et al.
Published: (2025)

X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)

Re-Imagining Multimodal Instruction Tuning: A Representation View
by: Liu, Yiyang, et al.
Published: (2025)

Visual Self-Refinement for Autoregressive Models
by: Wang, Jiamian, et al.
Published: (2025)

Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled Data
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)

Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain Adaptation
by: Pulakurthi, Prasanna Reddy, et al.
Published: (2025)

Radiance Field Learners As UAV First-Person Viewers
by: Yan, Liqi, et al.
Published: (2024)

Probabilistic Token Alignment for Large Language Model Fusion
by: Zeng, Runjia, et al.
Published: (2025)

Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
by: Cheng, Zhiyuan, et al.
Published: (2024)

Visual Fourier Prompt Tuning
by: Zeng, Runjia, et al.
Published: (2024)

TokenMotion: Motion-Guided Vision Transformer for Video Camouflaged Object Detection Via Learnable Token Selection
by: Yu, Zifan, et al.
Published: (2023)

TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching
by: Zeng, Runjia, et al.
Published: (2026)

A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation Learning
by: Liu, Changyu, et al.
Published: (2026)

Comparing theApproaches of International Political Economy MacroStrategy of the UnitedStates of America and China Towards the Islamic Republic of Iran
by: Dianat, Hossein
Published: (2024)

Proto-Former: Unified Facial Landmark Detection by Prototype Transformer
by: Hu, Shengkai, et al.
Published: (2025)

Learning from Label Proportions: Bootstrapping Supervised Learners via Belief Propagation
by: Havaldar, Shreyas, et al.
Published: (2023)

Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?
by: Han, Cheng, et al.
Published: (2024)

Credibility in Second-Price Auctions: An Experimental Test
by: Dianat, Ahrash, et al.
Published: (2021)

MotionRFT: Unified Reinforcement Fine-Tuning for Text-to-Motion Generation
by: Tan, Xiaofeng, et al.
Published: (2026)

Visual Agents as Fast and Slow Thinkers
by: Sun, Guangyan, et al.
Published: (2024)

Graph Positional Autoencoders as Self-supervised Learners
by: Liu, Yang, et al.
Published: (2025)

Addressing Skewed Heterogeneity via Federated Prototype Rectification with Personalization
by: Guo, Shunxin, et al.
Published: (2024)

PersonaGest: Personalized Co-Speech Gesture Generation with Semantic-Guided Hierarchical Motion Representation
by: Zhao, Junchuan, et al.
Published: (2026)

Transformers as Multi-task Learners: Decoupling Features in Hidden Markov Models
by: Hao, Yifan, et al.
Published: (2025)

RaCT: Ranking-aware Chain-of-Thought Optimization for LLMs
by: Liu, Haowei, et al.
Published: (2024)

STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical Question-Answering
by: Sun, Guohao, et al.
Published: (2024)

Movable Antenna for Wireless Communications:Prototyping and Experimental Results
by: Dong, Zhenjun, et al.
Published: (2024)

All You Need is One: Capsule Prompt Tuning with a Single Vector
by: Liu, Yiyang, et al.
Published: (2025)

MLP Can Be A Good Transformer Learner
by: Lin, Sihao, et al.
Published: (2024)

Inertial Confinement Fusion Forecasting via Large Language Models
by: Chen, Mingkai, et al.
Published: (2024)

Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
by: Wang, Taowen, et al.
Published: (2024)

TokenBlowUp: Resolving Representational Singularities in LLM Token Spaces via Monoidal Transformations
by: Zhao, Dongfang
Published: (2025)

Q-Bridge: Code Translation for Quantum Machine Learning via LLMs
by: Zeng, Runjia, et al.
Published: (2026)

FRA-DiagSys: A Transformer Winding Fault Diagnosis System for Identifying Fault Types and degrees Using Frequency Response Analysis
by: Wang, Guohao
Published: (2024)

Unified Framework for Direct Characterization of Kraus Operators, Observables, Density Matrices, and Weak Values Without Weak Interaction
by: Sahil, et al.
Published: (2025)