:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tan, Renxuan, Li, Rongpeng, Zhao, Zhifeng, Zhang, Honggang
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.05965
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LLM-Empowered Agentic MAC Protocols: A Dynamic Stackelberg Game Approach
by: Tan, Renxuan, et al.
Published: (2025)

Pareto Actor-Critic for Communication and Computation Co-Optimization in Non-Cooperative Federated Learning Services
by: Tan, Renxuan, et al.
Published: (2025)

Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game
by: Yuming, Xiang, et al.
Published: (2025)

Tool-Aided Evolutionary LLM for Generative Policy Toward Efficient Resource Management in Wireless Federated Learning
by: Tan, Chongyang, et al.
Published: (2025)

LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol Emergence
by: Tan, Renxuan, et al.
Published: (2025)

Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
by: Chen, Yuxuan, et al.
Published: (2024)

Topology-Assisted Spatio-Temporal Pattern Disentangling for Scalable MARL in Large-scale Autonomous Traffic Control
by: Li, Rongpeng, et al.
Published: (2025)

AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air
by: Yang, Shiyi, et al.
Published: (2025)

RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms
by: Wang, Ziyao, et al.
Published: (2025)

Snake Learning: A Communication- and Computation-Efficient Distributed Learning Framework for 6G
by: Yu, Xiaoxue, et al.
Published: (2024)

Select2Col: Leveraging Spatial-Temporal Importance of Semantic Information for Efficient Collaborative Perception
by: Liu, Yuntao, et al.
Published: (2023)

Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning
by: Geng, Wei, et al.
Published: (2023)

Beyond Preferences in AI Alignment
by: Zhi-Xuan, Tan, et al.
Published: (2024)

MERLOT: A Distilled LLM-based Mixture-of-Experts Framework for Scalable Encrypted Traffic Classification
by: Chen, Yuxuan, et al.
Published: (2024)

Reinforcement Learning-Based Heterogeneous Multi-Task Optimization in Semantic Broadcast Communications
by: Lu, Zhilin, et al.
Published: (2025)

Multi-Agent Conditional Diffusion Model with Mean Field Communication as Wireless Resource Allocation Planner
by: Meng, Kechen, et al.
Published: (2025)

Communications-Incentivized Collaborative Reasoning in NetGPT through Agentic Reinforcement Learning
by: Yu, Xiaoxue, et al.
Published: (2026)

Topology Data Analysis-based Error Detection for Semantic Image Transmission with Incremental Knowledge-based HARQ
by: Ni, Fei, et al.
Published: (2024)

Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation
by: Shin, Hyunjune, et al.
Published: (2024)

Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
by: Zhou, Zhanhui, et al.
Published: (2023)

Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
by: Kim, Dongyoung, et al.
Published: (2024)

NetWorld: Communication-Based Diffusion World Model for Multi-Agent Reinforcement Learning in Wireless Networks
by: Meng, Kechen, et al.
Published: (2026)

Pareto Multi-Objective Alignment for Language Models
by: He, Qiang, et al.
Published: (2025)

Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier
by: Badrinath, Anirudhan, et al.
Published: (2024)

Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
by: Zhang, Jianfei, et al.
Published: (2024)

Evolutionary Preference Sampling for Pareto Set Learning
by: Ye, Rongguang, et al.
Published: (2024)

Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion
by: Li, Minghan, et al.
Published: (2026)

Pareto-Optimal Learning from Preferences with Hidden Context
by: Bahlous-Boldi, Ryan, et al.
Published: (2024)

Adversarial Preference Learning for Robust LLM Alignment
by: Wang, Yuanfu, et al.
Published: (2025)

UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
by: Cheng, Zelei, et al.
Published: (2025)

Not All Preferences Are Created Equal: Stability-Aware and Gradient-Efficient Alignment for Reasoning Models
by: Wu, Hui, et al.
Published: (2026)

The Geometry of Compromise: Unlocking Generative Capabilities via Controllable Modality Alignment
by: Liu, Hongyuan, et al.
Published: (2026)

Beyond Single-Point Judgment: Distribution Alignment for LLM-as-a-Judge
by: Chen, Luyu, et al.
Published: (2025)

Beyond Consensus: Mitigating the Agreeableness Bias in LLM Judge Evaluations
by: Jain, Suryaansh, et al.
Published: (2025)

TODO: Enhancing LLM Alignment with Ternary Preferences
by: Guo, Yuxiang, et al.
Published: (2024)

HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents
by: Jin, Hongbo, et al.
Published: (2026)

Pareto Continual Learning: Preference-Conditioned Learning and Adaption for Dynamic Stability-Plasticity Trade-off
by: Lai, Song, et al.
Published: (2025)

Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback
by: Whitfill, Parker, et al.
Published: (2025)

In-Context Source and Channel Coding
by: Wang, Ziqiong, et al.
Published: (2026)

Select2Drive: Pragmatic Communications for Real-Time Collaborative Autonomous Driving
by: Huang, Jiahao, et al.
Published: (2025)