Guardado en:
| Autores principales: | Li, Zhuo, Du, Guodong, Shi, Zesheng, Guo, Weiyang, Yao, Weijun, Zhou, Yuan, Zhang, Jiabo, Li, Jing |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2605.22205 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Knowledge Fusion of Large Language Models Via Modular SkillPacks
por: Du, Guodong, et al.
Publicado: (2025)
por: Du, Guodong, et al.
Publicado: (2025)
Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward
por: Guo, Weiyang, et al.
Publicado: (2026)
por: Guo, Weiyang, et al.
Publicado: (2026)
Joint Identifiability of Cross-Domain Recommendation via Hierarchical Subspace Disentanglement
por: Du, Jing, et al.
Publicado: (2024)
por: Du, Jing, et al.
Publicado: (2024)
Jailbreak-R1: Exploring the Jailbreak Capabilities of LLMs via Reinforcement Learning
por: Guo, Weiyang, et al.
Publicado: (2025)
por: Guo, Weiyang, et al.
Publicado: (2025)
Safety Alignment via Constrained Knowledge Unlearning
por: Shi, Zesheng, et al.
Publicado: (2025)
por: Shi, Zesheng, et al.
Publicado: (2025)
E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning
por: Guo, Weiyang, et al.
Publicado: (2026)
por: Guo, Weiyang, et al.
Publicado: (2026)
Multi-objective Large Language Model Alignment with Hierarchical Experts
por: Li, Zhuo, et al.
Publicado: (2025)
por: Li, Zhuo, et al.
Publicado: (2025)
Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning
por: Zhou, Zhi, et al.
Publicado: (2025)
por: Zhou, Zhi, et al.
Publicado: (2025)
Safeguarding LLM Fine-tuning via Push-Pull Distributional Alignment
por: Wang, Haozhong, et al.
Publicado: (2026)
por: Wang, Haozhong, et al.
Publicado: (2026)
D-QRELO: Training- and Data-Free Delta Compression for Large Language Models via Quantization and Residual Low-Rank Approximation
por: Li, Junlin, et al.
Publicado: (2026)
por: Li, Junlin, et al.
Publicado: (2026)
POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation
por: Qiu, Zeju, et al.
Publicado: (2026)
por: Qiu, Zeju, et al.
Publicado: (2026)
Efficient Skill Discovery via Regret-Aware Optimization
por: Zhang, He, et al.
Publicado: (2025)
por: Zhang, He, et al.
Publicado: (2025)
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
por: Zhou, Tianyi, et al.
Publicado: (2026)
por: Zhou, Tianyi, et al.
Publicado: (2026)
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling
por: Ding, Yiwen, et al.
Publicado: (2024)
por: Ding, Yiwen, et al.
Publicado: (2024)
Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs
por: Wang, Ruoyu, et al.
Publicado: (2024)
por: Wang, Ruoyu, et al.
Publicado: (2024)
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning
por: Zhou, Zhi, et al.
Publicado: (2025)
por: Zhou, Zhi, et al.
Publicado: (2025)
FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference
por: Liu, Guangda, et al.
Publicado: (2025)
por: Liu, Guangda, et al.
Publicado: (2025)
Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion
por: Li, Minghan, et al.
Publicado: (2026)
por: Li, Minghan, et al.
Publicado: (2026)
Reparameterized LLM Training via Orthogonal Equivalence Transformation
por: Qiu, Zeju, et al.
Publicado: (2025)
por: Qiu, Zeju, et al.
Publicado: (2025)
ViReSkill: Vision-Grounded Replanning with Skill Memory for LLM-Based Planning in Lifelong Robot Learning
por: Kagaya, Tomoyuki, et al.
Publicado: (2025)
por: Kagaya, Tomoyuki, et al.
Publicado: (2025)
Agentic Web: Weaving the Next Web with AI Agents
por: Yang, Yingxuan, et al.
Publicado: (2025)
por: Yang, Yingxuan, et al.
Publicado: (2025)
Efficient Multi-objective Prompt Optimization via Pure-exploration Bandits
por: Li, Donghao, et al.
Publicado: (2026)
por: Li, Donghao, et al.
Publicado: (2026)
DOGMA: Weaving Structural Information into Data-centric Single-cell Transcriptomics Analysis
por: Zhang, Ru, et al.
Publicado: (2026)
por: Zhang, Ru, et al.
Publicado: (2026)
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
por: Chen, Guoxuan, et al.
Publicado: (2024)
por: Chen, Guoxuan, et al.
Publicado: (2024)
Reference-guided Policy Optimization for Molecular Optimization via LLM Reasoning
por: Li, Xuan, et al.
Publicado: (2026)
por: Li, Xuan, et al.
Publicado: (2026)
Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning
por: Yang, Zhicheng, et al.
Publicado: (2026)
por: Yang, Zhicheng, et al.
Publicado: (2026)
Personalized Federated Collaborative Filtering: A Variational AutoEncoder Approach
por: Li, Zhiwei, et al.
Publicado: (2024)
por: Li, Zhiwei, et al.
Publicado: (2024)
Amortized Reasoning Tree Search: Decoupling Proposal and Decision in Large Language Models
por: Hong, Zesheng, et al.
Publicado: (2026)
por: Hong, Zesheng, et al.
Publicado: (2026)
Efficient Traffic Forecasting on Large-Scale Road Network by Regularized Adaptive Graph Convolution
por: Wu, Kaiqi, et al.
Publicado: (2025)
por: Wu, Kaiqi, et al.
Publicado: (2025)
Dynamic Model Merging Made Slim
por: Du, Guodong, et al.
Publicado: (2026)
por: Du, Guodong, et al.
Publicado: (2026)
Ability Transfer and Recovery via Modularized Parameters Localization
por: Jin, Songyao, et al.
Publicado: (2026)
por: Jin, Songyao, et al.
Publicado: (2026)
Learning Versatile Skills with Curriculum Masking
por: Tang, Yao, et al.
Publicado: (2024)
por: Tang, Yao, et al.
Publicado: (2024)
Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles
por: Ni, Zhanghan, et al.
Publicado: (2026)
por: Ni, Zhanghan, et al.
Publicado: (2026)
SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
por: Wen, Yongyan, et al.
Publicado: (2024)
por: Wen, Yongyan, et al.
Publicado: (2024)
Multi-Scale Dequant: Eliminating Dequantization Bottleneck via Activation Decomposition for Efficient LLM Inference
por: Zheng, Lingchao, et al.
Publicado: (2026)
por: Zheng, Lingchao, et al.
Publicado: (2026)
DualWeaver: Synergistic Feature Weaving Surrogates for Multivariate Forecasting with Univariate Time Series Foundation Models
por: Li, Jinpeng, et al.
Publicado: (2026)
por: Li, Jinpeng, et al.
Publicado: (2026)
Team-Based Self-Play With Dual Adaptive Weighting for Fine-Tuning LLMs
por: Li, Wu, et al.
Publicado: (2026)
por: Li, Wu, et al.
Publicado: (2026)
Randomized Antipodal Search Done Right for Data Pareto Improvement of LLM Unlearning
por: Liu, Ziwen, et al.
Publicado: (2026)
por: Liu, Ziwen, et al.
Publicado: (2026)
SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories
por: Yu, Zhuoyun, et al.
Publicado: (2026)
por: Yu, Zhuoyun, et al.
Publicado: (2026)
Skill-R1: Agent Skill Evolution via Reinforcement Learning
por: Vishe, Yash, et al.
Publicado: (2026)
por: Vishe, Yash, et al.
Publicado: (2026)
Ejemplares similares
-
Knowledge Fusion of Large Language Models Via Modular SkillPacks
por: Du, Guodong, et al.
Publicado: (2025) -
Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward
por: Guo, Weiyang, et al.
Publicado: (2026) -
Joint Identifiability of Cross-Domain Recommendation via Hierarchical Subspace Disentanglement
por: Du, Jing, et al.
Publicado: (2024) -
Jailbreak-R1: Exploring the Jailbreak Capabilities of LLMs via Reinforcement Learning
por: Guo, Weiyang, et al.
Publicado: (2025) -
Safety Alignment via Constrained Knowledge Unlearning
por: Shi, Zesheng, et al.
Publicado: (2025)