:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Li, Zhuo, Du, Guodong, Shi, Zesheng, Guo, Weiyang, Yao, Weijun, Zhou, Yuan, Zhang, Jiabo, Li, Jing
Formato:	Preprint
Publicado:	2026
Materias:	Artificial Intelligence Machine Learning
Acceso en línea:	https://arxiv.org/abs/2605.22205
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Knowledge Fusion of Large Language Models Via Modular SkillPacks
por: Du, Guodong, et al.
Publicado: (2025)

Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward
por: Guo, Weiyang, et al.
Publicado: (2026)

Joint Identifiability of Cross-Domain Recommendation via Hierarchical Subspace Disentanglement
por: Du, Jing, et al.
Publicado: (2024)

Jailbreak-R1: Exploring the Jailbreak Capabilities of LLMs via Reinforcement Learning
por: Guo, Weiyang, et al.
Publicado: (2025)

Safety Alignment via Constrained Knowledge Unlearning
por: Shi, Zesheng, et al.
Publicado: (2025)

E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning
por: Guo, Weiyang, et al.
Publicado: (2026)

Multi-objective Large Language Model Alignment with Hierarchical Experts
por: Li, Zhuo, et al.
Publicado: (2025)

Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning
por: Zhou, Zhi, et al.
Publicado: (2025)

Safeguarding LLM Fine-tuning via Push-Pull Distributional Alignment
por: Wang, Haozhong, et al.
Publicado: (2026)

D-QRELO: Training- and Data-Free Delta Compression for Large Language Models via Quantization and Residual Low-Rank Approximation
por: Li, Junlin, et al.
Publicado: (2026)

POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation
por: Qiu, Zeju, et al.
Publicado: (2026)

Efficient Skill Discovery via Regret-Aware Optimization
por: Zhang, He, et al.
Publicado: (2025)

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
por: Zhou, Tianyi, et al.
Publicado: (2026)

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling
por: Ding, Yiwen, et al.
Publicado: (2024)

Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs
por: Wang, Ruoyu, et al.
Publicado: (2024)

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning
por: Zhou, Zhi, et al.
Publicado: (2025)

FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference
por: Liu, Guangda, et al.
Publicado: (2025)

Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion
por: Li, Minghan, et al.
Publicado: (2026)

Reparameterized LLM Training via Orthogonal Equivalence Transformation
por: Qiu, Zeju, et al.
Publicado: (2025)

ViReSkill: Vision-Grounded Replanning with Skill Memory for LLM-Based Planning in Lifelong Robot Learning
por: Kagaya, Tomoyuki, et al.
Publicado: (2025)

Agentic Web: Weaving the Next Web with AI Agents
por: Yang, Yingxuan, et al.
Publicado: (2025)

Efficient Multi-objective Prompt Optimization via Pure-exploration Bandits
por: Li, Donghao, et al.
Publicado: (2026)

DOGMA: Weaving Structural Information into Data-centric Single-cell Transcriptomics Analysis
por: Zhang, Ru, et al.
Publicado: (2026)

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
por: Chen, Guoxuan, et al.
Publicado: (2024)

Reference-guided Policy Optimization for Molecular Optimization via LLM Reasoning
por: Li, Xuan, et al.
Publicado: (2026)

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning
por: Yang, Zhicheng, et al.
Publicado: (2026)

Personalized Federated Collaborative Filtering: A Variational AutoEncoder Approach
por: Li, Zhiwei, et al.
Publicado: (2024)

Amortized Reasoning Tree Search: Decoupling Proposal and Decision in Large Language Models
por: Hong, Zesheng, et al.
Publicado: (2026)

Efficient Traffic Forecasting on Large-Scale Road Network by Regularized Adaptive Graph Convolution
por: Wu, Kaiqi, et al.
Publicado: (2025)

Dynamic Model Merging Made Slim
por: Du, Guodong, et al.
Publicado: (2026)

Ability Transfer and Recovery via Modularized Parameters Localization
por: Jin, Songyao, et al.
Publicado: (2026)

Learning Versatile Skills with Curriculum Masking
por: Tang, Yao, et al.
Publicado: (2024)

Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles
por: Ni, Zhanghan, et al.
Publicado: (2026)

SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
por: Wen, Yongyan, et al.
Publicado: (2024)

Multi-Scale Dequant: Eliminating Dequantization Bottleneck via Activation Decomposition for Efficient LLM Inference
por: Zheng, Lingchao, et al.
Publicado: (2026)

DualWeaver: Synergistic Feature Weaving Surrogates for Multivariate Forecasting with Univariate Time Series Foundation Models
por: Li, Jinpeng, et al.
Publicado: (2026)

Team-Based Self-Play With Dual Adaptive Weighting for Fine-Tuning LLMs
por: Li, Wu, et al.
Publicado: (2026)

Randomized Antipodal Search Done Right for Data Pareto Improvement of LLM Unlearning
por: Liu, Ziwen, et al.
Publicado: (2026)

SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories
por: Yu, Zhuoyun, et al.
Publicado: (2026)

Skill-R1: Agent Skill Evolution via Reinforcement Learning
por: Vishe, Yash, et al.
Publicado: (2026)