Saved in:
| Main Authors: | Tan, Renxuan, Li, Rongpeng, Zhao, Zhifeng, Zhang, Honggang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.05965 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM-Empowered Agentic MAC Protocols: A Dynamic Stackelberg Game Approach
by: Tan, Renxuan, et al.
Published: (2025)
by: Tan, Renxuan, et al.
Published: (2025)
Pareto Actor-Critic for Communication and Computation Co-Optimization in Non-Cooperative Federated Learning Services
by: Tan, Renxuan, et al.
Published: (2025)
by: Tan, Renxuan, et al.
Published: (2025)
Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game
by: Yuming, Xiang, et al.
Published: (2025)
by: Yuming, Xiang, et al.
Published: (2025)
Tool-Aided Evolutionary LLM for Generative Policy Toward Efficient Resource Management in Wireless Federated Learning
by: Tan, Chongyang, et al.
Published: (2025)
by: Tan, Chongyang, et al.
Published: (2025)
LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol Emergence
by: Tan, Renxuan, et al.
Published: (2025)
by: Tan, Renxuan, et al.
Published: (2025)
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
by: Chen, Yuxuan, et al.
Published: (2024)
by: Chen, Yuxuan, et al.
Published: (2024)
Topology-Assisted Spatio-Temporal Pattern Disentangling for Scalable MARL in Large-scale Autonomous Traffic Control
by: Li, Rongpeng, et al.
Published: (2025)
by: Li, Rongpeng, et al.
Published: (2025)
AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air
by: Yang, Shiyi, et al.
Published: (2025)
by: Yang, Shiyi, et al.
Published: (2025)
RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms
by: Wang, Ziyao, et al.
Published: (2025)
by: Wang, Ziyao, et al.
Published: (2025)
Snake Learning: A Communication- and Computation-Efficient Distributed Learning Framework for 6G
by: Yu, Xiaoxue, et al.
Published: (2024)
by: Yu, Xiaoxue, et al.
Published: (2024)
Select2Col: Leveraging Spatial-Temporal Importance of Semantic Information for Efficient Collaborative Perception
by: Liu, Yuntao, et al.
Published: (2023)
by: Liu, Yuntao, et al.
Published: (2023)
Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning
by: Geng, Wei, et al.
Published: (2023)
by: Geng, Wei, et al.
Published: (2023)
Beyond Preferences in AI Alignment
by: Zhi-Xuan, Tan, et al.
Published: (2024)
by: Zhi-Xuan, Tan, et al.
Published: (2024)
MERLOT: A Distilled LLM-based Mixture-of-Experts Framework for Scalable Encrypted Traffic Classification
by: Chen, Yuxuan, et al.
Published: (2024)
by: Chen, Yuxuan, et al.
Published: (2024)
Reinforcement Learning-Based Heterogeneous Multi-Task Optimization in Semantic Broadcast Communications
by: Lu, Zhilin, et al.
Published: (2025)
by: Lu, Zhilin, et al.
Published: (2025)
Multi-Agent Conditional Diffusion Model with Mean Field Communication as Wireless Resource Allocation Planner
by: Meng, Kechen, et al.
Published: (2025)
by: Meng, Kechen, et al.
Published: (2025)
Communications-Incentivized Collaborative Reasoning in NetGPT through Agentic Reinforcement Learning
by: Yu, Xiaoxue, et al.
Published: (2026)
by: Yu, Xiaoxue, et al.
Published: (2026)
Topology Data Analysis-based Error Detection for Semantic Image Transmission with Incremental Knowledge-based HARQ
by: Ni, Fei, et al.
Published: (2024)
by: Ni, Fei, et al.
Published: (2024)
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation
by: Shin, Hyunjune, et al.
Published: (2024)
by: Shin, Hyunjune, et al.
Published: (2024)
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
by: Zhou, Zhanhui, et al.
Published: (2023)
by: Zhou, Zhanhui, et al.
Published: (2023)
Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
by: Kim, Dongyoung, et al.
Published: (2024)
by: Kim, Dongyoung, et al.
Published: (2024)
NetWorld: Communication-Based Diffusion World Model for Multi-Agent Reinforcement Learning in Wireless Networks
by: Meng, Kechen, et al.
Published: (2026)
by: Meng, Kechen, et al.
Published: (2026)
Pareto Multi-Objective Alignment for Language Models
by: He, Qiang, et al.
Published: (2025)
by: He, Qiang, et al.
Published: (2025)
Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier
by: Badrinath, Anirudhan, et al.
Published: (2024)
by: Badrinath, Anirudhan, et al.
Published: (2024)
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
by: Zhang, Jianfei, et al.
Published: (2024)
by: Zhang, Jianfei, et al.
Published: (2024)
Evolutionary Preference Sampling for Pareto Set Learning
by: Ye, Rongguang, et al.
Published: (2024)
by: Ye, Rongguang, et al.
Published: (2024)
Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion
by: Li, Minghan, et al.
Published: (2026)
by: Li, Minghan, et al.
Published: (2026)
Pareto-Optimal Learning from Preferences with Hidden Context
by: Bahlous-Boldi, Ryan, et al.
Published: (2024)
by: Bahlous-Boldi, Ryan, et al.
Published: (2024)
Adversarial Preference Learning for Robust LLM Alignment
by: Wang, Yuanfu, et al.
Published: (2025)
by: Wang, Yuanfu, et al.
Published: (2025)
UC-MOA: Utility-Conditioned Multi-Objective Alignment for Distributional Pareto-Optimality
by: Cheng, Zelei, et al.
Published: (2025)
by: Cheng, Zelei, et al.
Published: (2025)
Not All Preferences Are Created Equal: Stability-Aware and Gradient-Efficient Alignment for Reasoning Models
by: Wu, Hui, et al.
Published: (2026)
by: Wu, Hui, et al.
Published: (2026)
The Geometry of Compromise: Unlocking Generative Capabilities via Controllable Modality Alignment
by: Liu, Hongyuan, et al.
Published: (2026)
by: Liu, Hongyuan, et al.
Published: (2026)
Beyond Single-Point Judgment: Distribution Alignment for LLM-as-a-Judge
by: Chen, Luyu, et al.
Published: (2025)
by: Chen, Luyu, et al.
Published: (2025)
Beyond Consensus: Mitigating the Agreeableness Bias in LLM Judge Evaluations
by: Jain, Suryaansh, et al.
Published: (2025)
by: Jain, Suryaansh, et al.
Published: (2025)
TODO: Enhancing LLM Alignment with Ternary Preferences
by: Guo, Yuxiang, et al.
Published: (2024)
by: Guo, Yuxiang, et al.
Published: (2024)
HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents
by: Jin, Hongbo, et al.
Published: (2026)
by: Jin, Hongbo, et al.
Published: (2026)
Pareto Continual Learning: Preference-Conditioned Learning and Adaption for Dynamic Stability-Plasticity Trade-off
by: Lai, Song, et al.
Published: (2025)
by: Lai, Song, et al.
Published: (2025)
Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback
by: Whitfill, Parker, et al.
Published: (2025)
by: Whitfill, Parker, et al.
Published: (2025)
In-Context Source and Channel Coding
by: Wang, Ziqiong, et al.
Published: (2026)
by: Wang, Ziqiong, et al.
Published: (2026)
Select2Drive: Pragmatic Communications for Real-Time Collaborative Autonomous Driving
by: Huang, Jiahao, et al.
Published: (2025)
by: Huang, Jiahao, et al.
Published: (2025)
Similar Items
-
LLM-Empowered Agentic MAC Protocols: A Dynamic Stackelberg Game Approach
by: Tan, Renxuan, et al.
Published: (2025) -
Pareto Actor-Critic for Communication and Computation Co-Optimization in Non-Cooperative Federated Learning Services
by: Tan, Renxuan, et al.
Published: (2025) -
Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game
by: Yuming, Xiang, et al.
Published: (2025) -
Tool-Aided Evolutionary LLM for Generative Policy Toward Efficient Resource Management in Wireless Federated Learning
by: Tan, Chongyang, et al.
Published: (2025) -
LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol Emergence
by: Tan, Renxuan, et al.
Published: (2025)