Saved in:
| Main Authors: | Shu, Youwei, Zheng, Shaomian, Jin, Dingnan, Qu, Wenjie, Guo, Ziyao, Cui, Qing, Zhou, Jun, Zhang, Jiaheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.13773 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging
by: Yang, Jinluan, et al.
Published: (2025)
by: Yang, Jinluan, et al.
Published: (2025)
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
by: Zhang, Jia, et al.
Published: (2025)
by: Zhang, Jia, et al.
Published: (2025)
ProDS: Preference-oriented Data Selection for Instruction Tuning
by: Guo, Wenya, et al.
Published: (2025)
by: Guo, Wenya, et al.
Published: (2025)
Instruction Mining: Instruction Data Selection for Tuning Large Language Models
by: Cao, Yihan, et al.
Published: (2023)
by: Cao, Yihan, et al.
Published: (2023)
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
by: Li, Ming, et al.
Published: (2024)
by: Li, Ming, et al.
Published: (2024)
Self-Sovereign Agent
by: Qu, Wenjie, et al.
Published: (2026)
by: Qu, Wenjie, et al.
Published: (2026)
Large-Scale Data Selection for Instruction Tuning
by: Ivison, Hamish, et al.
Published: (2025)
by: Ivison, Hamish, et al.
Published: (2025)
Learning to Attack: Uncovering Privacy Risks in Sequential Data Releases
by: Cui, Ziyao, et al.
Published: (2025)
by: Cui, Ziyao, et al.
Published: (2025)
T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
by: Fu, Yanjun, et al.
Published: (2025)
by: Fu, Yanjun, et al.
Published: (2025)
Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection
by: Yan, Jun, et al.
Published: (2023)
by: Yan, Jun, et al.
Published: (2023)
LESS: Selecting Influential Data for Targeted Instruction Tuning
by: Xia, Mengzhou, et al.
Published: (2024)
by: Xia, Mengzhou, et al.
Published: (2024)
Generative Representational Instruction Tuning
by: Muennighoff, Niklas, et al.
Published: (2024)
by: Muennighoff, Niklas, et al.
Published: (2024)
TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
by: Zhang, Jipeng, et al.
Published: (2024)
by: Zhang, Jipeng, et al.
Published: (2024)
DMark: Order-Agnostic Watermarking for Diffusion Large Language Models
by: Wu, Linyu, et al.
Published: (2025)
by: Wu, Linyu, et al.
Published: (2025)
Diversity Measurement and Subset Selection for Instruction Tuning Datasets
by: Wang, Peiqi, et al.
Published: (2024)
by: Wang, Peiqi, et al.
Published: (2024)
LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning
by: Lin, Xiaotian, et al.
Published: (2025)
by: Lin, Xiaotian, et al.
Published: (2025)
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
by: You, Zebin, et al.
Published: (2025)
by: You, Zebin, et al.
Published: (2025)
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
by: Tang, Zhengyang, et al.
Published: (2024)
by: Tang, Zhengyang, et al.
Published: (2024)
A Unified Graph Language Model for Multi-Domain Multi-Task Graph Alignment Instruction Tuning
by: Chen, Haibo, et al.
Published: (2026)
by: Chen, Haibo, et al.
Published: (2026)
Parameter-Efficient Tuning Large Language Models for Graph Representation Learning
by: Zhu, Qi, et al.
Published: (2024)
by: Zhu, Qi, et al.
Published: (2024)
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
by: Liu, Youwei, et al.
Published: (2026)
by: Liu, Youwei, et al.
Published: (2026)
Beyond the Laplacian: Interpolated Spectral Augmentation for Graph Neural Networks
by: Cui, Ziyao, et al.
Published: (2025)
by: Cui, Ziyao, et al.
Published: (2025)
How Is Uncertainty Propagated in Knowledge Distillation?
by: Cui, Ziyao, et al.
Published: (2026)
by: Cui, Ziyao, et al.
Published: (2026)
SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning
by: He, Yexiao, et al.
Published: (2024)
by: He, Yexiao, et al.
Published: (2024)
DiffScore: Text Evaluation Beyond Autoregressive Likelihood
by: Lai, Wen, et al.
Published: (2026)
by: Lai, Wen, et al.
Published: (2026)
SGPT: Few-Shot Prompt Tuning for Signed Graphs
by: Zhai, Zian, et al.
Published: (2024)
by: Zhai, Zian, et al.
Published: (2024)
A Bayesian Hybrid Parameter-Efficient Fine-Tuning Method for Large Language Models
by: Chai, Yidong, et al.
Published: (2025)
by: Chai, Yidong, et al.
Published: (2025)
Task-Specific Data Selection for Instruction Tuning via Monosemantic Neuronal Activations
by: Ma, Da, et al.
Published: (2025)
by: Ma, Da, et al.
Published: (2025)
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
by: Safaei, Bardia, et al.
Published: (2025)
by: Safaei, Bardia, et al.
Published: (2025)
Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning
by: Yuan, Zhihang, et al.
Published: (2026)
by: Yuan, Zhihang, et al.
Published: (2026)
Parrot: Multilingual Visual Instruction Tuning
by: Sun, Hai-Long, et al.
Published: (2024)
by: Sun, Hai-Long, et al.
Published: (2024)
Dynamic Bayesian Optimization Framework for Instruction Tuning in Partial Differential Equation Discovery
by: Qu, Junqi, et al.
Published: (2025)
by: Qu, Junqi, et al.
Published: (2025)
Reconstructive Visual Instruction Tuning
by: Wang, Haochen, et al.
Published: (2024)
by: Wang, Haochen, et al.
Published: (2024)
Sparsity-Aware Low-Rank Representation for Efficient Fine-Tuning of Large Language Models
by: Zhang, Longteng, et al.
Published: (2026)
by: Zhang, Longteng, et al.
Published: (2026)
Federated Data-Efficient Instruction Tuning for Large Language Models
by: Qin, Zhen, et al.
Published: (2024)
by: Qin, Zhen, et al.
Published: (2024)
Echoes within the Reasoning: Stealthy and Effective Watermarking via Chain of Thought
by: Lu, Jiacheng, et al.
Published: (2026)
by: Lu, Jiacheng, et al.
Published: (2026)
CGCOD: Class-Guided Camouflaged Object Detection
by: Zhang, Chenxi, et al.
Published: (2024)
by: Zhang, Chenxi, et al.
Published: (2024)
EDBench: Large-Scale Electron Density Data for Molecular Modeling
by: Xiang, Hongxin, et al.
Published: (2025)
by: Xiang, Hongxin, et al.
Published: (2025)
Diffusion Instruction Tuning
by: Jin, Chen, et al.
Published: (2025)
by: Jin, Chen, et al.
Published: (2025)
Scaling Laws are Redundancy Laws
by: Bi, Yuda, et al.
Published: (2025)
by: Bi, Yuda, et al.
Published: (2025)
Similar Items
-
Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging
by: Yang, Jinluan, et al.
Published: (2025) -
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
by: Zhang, Jia, et al.
Published: (2025) -
ProDS: Preference-oriented Data Selection for Instruction Tuning
by: Guo, Wenya, et al.
Published: (2025) -
Instruction Mining: Instruction Data Selection for Tuning Large Language Models
by: Cao, Yihan, et al.
Published: (2023) -
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
by: Li, Ming, et al.
Published: (2024)