Saved in:
| Main Authors: | Chen, Zhi-Kai, Jiang, Jun-Peng, Tao, Jun-Jie, Zhan, De-Chuan, Ye, Han-Jia |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.01858 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation
by: Chen, Zhi-Kai, et al.
Published: (2025)
by: Chen, Zhi-Kai, et al.
Published: (2025)
Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing
by: Zhang, Yi-Kai, et al.
Published: (2025)
by: Zhang, Yi-Kai, et al.
Published: (2025)
TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering
by: Ji, An-Yang, et al.
Published: (2026)
by: Ji, An-Yang, et al.
Published: (2026)
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters
by: Lab, Mind, et al.
Published: (2026)
by: Lab, Mind, et al.
Published: (2026)
SAME: Stabilized Mixture-of-Experts for Multimodal Continual Instruction Tuning
by: Xie, Zhen-Hao, et al.
Published: (2026)
by: Xie, Zhen-Hao, et al.
Published: (2026)
Task-Agnostic Guided Feature Expansion for Class-Incremental Learning
by: Zheng, Bowen, et al.
Published: (2025)
by: Zheng, Bowen, et al.
Published: (2025)
StyleBooth: Image Style Editing with Multimodal Instruction
by: Han, Zhen, et al.
Published: (2024)
by: Han, Zhen, et al.
Published: (2024)
Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need
by: Zhou, Da-Wei, et al.
Published: (2023)
by: Zhou, Da-Wei, et al.
Published: (2023)
TV100: A TV Series Dataset that Pre-Trained CLIP Has Not Seen
by: Zhou, Da-Wei, et al.
Published: (2024)
by: Zhou, Da-Wei, et al.
Published: (2024)
VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions
by: Zhan, Jun, et al.
Published: (2025)
by: Zhan, Jun, et al.
Published: (2025)
Consistency-Guided Temperature Scaling Using Style and Content Information for Out-of-Domain Calibration
by: Choi, Wonjeong, et al.
Published: (2024)
by: Choi, Wonjeong, et al.
Published: (2024)
StyleMark: A Robust Watermarking Method for Art Style Images Against Black-Box Arbitrary Style Transfer
by: Zhang, Yunming, et al.
Published: (2024)
by: Zhang, Yunming, et al.
Published: (2024)
BOFA: Bridge-Layer Orthogonal Low-Rank Fusion for CLIP-Based Class-Incremental Learning
by: Li, Lan, et al.
Published: (2025)
by: Li, Lan, et al.
Published: (2025)
SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion
by: Han, Lu, et al.
Published: (2024)
by: Han, Lu, et al.
Published: (2024)
MIETT: Multi-Instance Encrypted Traffic Transformer for Encrypted Traffic Classification
by: Chen, Xu-Yang, et al.
Published: (2024)
by: Chen, Xu-Yang, et al.
Published: (2024)
Adaptive Adapter Routing for Long-Tailed Class-Incremental Learning
by: Qi, Zhi-Hong, et al.
Published: (2024)
by: Qi, Zhi-Hong, et al.
Published: (2024)
Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification
by: Yi, Chao, et al.
Published: (2024)
by: Yi, Chao, et al.
Published: (2024)
Model Assembly Learning with Heterogeneous Layer Weight Merging
by: Zhang, Yi-Kai, et al.
Published: (2025)
by: Zhang, Yi-Kai, et al.
Published: (2025)
Towards Million-Scale Adversarial Robustness Evaluation With Stronger Individual Attacks
by: Xie, Yong, et al.
Published: (2024)
by: Xie, Yong, et al.
Published: (2024)
Polaris
by: Bessels, Emil, et al.
Published: (2017)
by: Bessels, Emil, et al.
Published: (2017)
Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identification
by: Zhu, Lanyun, et al.
Published: (2025)
by: Zhu, Lanyun, et al.
Published: (2025)
Addressing Imbalanced Domain-Incremental Learning through Dual-Balance Collaborative Experts
by: Li, Lan, et al.
Published: (2025)
by: Li, Lan, et al.
Published: (2025)
Bridge the Modality and Capability Gaps in Vision-Language Model Selection
by: Yi, Chao, et al.
Published: (2024)
by: Yi, Chao, et al.
Published: (2024)
Grassland: A Rapid Algebraic Modeling System for Million-variable Optimization
by: Li, Xihan, et al.
Published: (2021)
by: Li, Xihan, et al.
Published: (2021)
Multimodal Tabular Reasoning with Privileged Structured Information
by: Jiang, Jun-Peng, et al.
Published: (2025)
by: Jiang, Jun-Peng, et al.
Published: (2025)
Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?
by: Roberts, Jonathan, et al.
Published: (2024)
by: Roberts, Jonathan, et al.
Published: (2024)
AC-LoRA: Auto Component LoRA for Personalized Artistic Style Image Generation
by: Cui, Zhipu, et al.
Published: (2025)
by: Cui, Zhipu, et al.
Published: (2025)
SigStyle: Signature Style Transfer via Personalized Text-to-Image Models
by: Wang, Ye, et al.
Published: (2025)
by: Wang, Ye, et al.
Published: (2025)
Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens
by: Huang, Ting-Ji, et al.
Published: (2024)
by: Huang, Ting-Ji, et al.
Published: (2024)
SIGMAN:Scaling 3D Human Gaussian Generation with Millions of Assets
by: Yang, Yuhang, et al.
Published: (2025)
by: Yang, Yuhang, et al.
Published: (2025)
SerialGen: Personalized Image Generation by First Standardization Then Personalization
by: Xie, Cong, et al.
Published: (2024)
by: Xie, Cong, et al.
Published: (2024)
Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning
by: Zhou, Da-Wei, et al.
Published: (2024)
by: Zhou, Da-Wei, et al.
Published: (2024)
PILOT: A Pre-Trained Model-Based Continual Learning Toolbox
by: Sun, Hai-Long, et al.
Published: (2023)
by: Sun, Hai-Long, et al.
Published: (2023)
Revisiting Nearest Neighbor for Tabular Data: A Deep Tabular Baseline Two Decades Later
by: Ye, Han-Jia, et al.
Published: (2024)
by: Ye, Han-Jia, et al.
Published: (2024)
Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation
by: He, Linda, et al.
Published: (2025)
by: He, Linda, et al.
Published: (2025)
LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation
by: Lee, Suhyeon, et al.
Published: (2023)
by: Lee, Suhyeon, et al.
Published: (2023)
Towards Scale-Aware Low-Light Enhancement via Structure-Guided Transformer Design
by: Dong, Wei, et al.
Published: (2025)
by: Dong, Wei, et al.
Published: (2025)
Bridging Text and Image for Artist Style Transfer via Contrastive Learning
by: Liu, Zhi-Song, et al.
Published: (2024)
by: Liu, Zhi-Song, et al.
Published: (2024)
External Knowledge Injection for CLIP-Based Class-Incremental Learning
by: Zhou, Da-Wei, et al.
Published: (2025)
by: Zhou, Da-Wei, et al.
Published: (2025)
Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging
by: Dai, Siyuan, et al.
Published: (2025)
by: Dai, Siyuan, et al.
Published: (2025)
Similar Items
-
Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation
by: Chen, Zhi-Kai, et al.
Published: (2025) -
Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing
by: Zhang, Yi-Kai, et al.
Published: (2025) -
TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering
by: Ji, An-Yang, et al.
Published: (2026) -
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters
by: Lab, Mind, et al.
Published: (2026) -
SAME: Stabilized Mixture-of-Experts for Multimodal Continual Instruction Tuning
by: Xie, Zhen-Hao, et al.
Published: (2026)