Saved in:
| Main Authors: | Wu, Yusen, Liu, Yiran, Deng, Xiaotie |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.17694 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
HCAG: Hierarchical Abstraction and Retrieval-Augmented Generation on Theoretical Repositories with LLMs
by: Wu, Yusen, et al.
Published: (2026)
by: Wu, Yusen, et al.
Published: (2026)
Implementing Long Text Style Transfer with LLMs through Dual-Layered Sentence and Paragraph Structure Extraction and Mapping
by: Wu, Yusen, et al.
Published: (2025)
by: Wu, Yusen, et al.
Published: (2025)
How Social is It? A Benchmark for LLMs' Capabilities in Multi-user Multi-turn Social Agent Tasks
by: Wu, Yusen, et al.
Published: (2025)
by: Wu, Yusen, et al.
Published: (2025)
DeepRule: An Integrated Framework for Automated Business Rule Generation via Deep Predictive Modeling and Hybrid Search Optimization
by: Wu, Yusen, et al.
Published: (2025)
by: Wu, Yusen, et al.
Published: (2025)
Hummer: Towards Limited Competitive Preference Dataset
by: Jiang, Li, et al.
Published: (2024)
by: Jiang, Li, et al.
Published: (2024)
Game Theory Meets Large Language Models: A Systematic Survey with Taxonomy and New Frontiers
by: Sun, Haoran, et al.
Published: (2025)
by: Sun, Haoran, et al.
Published: (2025)
How Large Language Models Need Symbolism
by: Deng, Xiaotie, et al.
Published: (2025)
by: Deng, Xiaotie, et al.
Published: (2025)
Will AI Trade? A Computational Inversion of the No-Trade Theorem
by: Li, Hanyu, et al.
Published: (2025)
by: Li, Hanyu, et al.
Published: (2025)
An Information-Theoretic Criterion for Efficient Data Synthesis
by: Li, Hanyu, et al.
Published: (2026)
by: Li, Hanyu, et al.
Published: (2026)
Meta-Aligner: Bidirectional Preference-Policy Optimization for Multi-Objective LLMs Alignment
by: Xu, Wenzhe, et al.
Published: (2026)
by: Xu, Wenzhe, et al.
Published: (2026)
IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis
by: Li, Hanyu, et al.
Published: (2025)
by: Li, Hanyu, et al.
Published: (2025)
When and How Human Curation Backfires: Preference Alignment under Multi-Model Self-Consuming Loop
by: Zhang, Yang, et al.
Published: (2026)
by: Zhang, Yang, et al.
Published: (2026)
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
by: Zhang, Shenao, et al.
Published: (2024)
by: Zhang, Shenao, et al.
Published: (2024)
The Sandbox Configurator: A Framework to Support Technical Assessment in AI Regulatory Sandboxes
by: Buscemi, Alessio, et al.
Published: (2025)
by: Buscemi, Alessio, et al.
Published: (2025)
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
by: Wang, Haoxiang, et al.
Published: (2024)
by: Wang, Haoxiang, et al.
Published: (2024)
PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference
by: Ji, Jiaming, et al.
Published: (2024)
by: Ji, Jiaming, et al.
Published: (2024)
Grounding Natural Language for Multi-agent Decision-Making with Multi-agentic LLMs
by: Huh, Dom, et al.
Published: (2025)
by: Huh, Dom, et al.
Published: (2025)
Implementing Rational Choice Functions with LLMs and Measuring their Alignment with User Preferences
by: Karnysheva, Anna, et al.
Published: (2025)
by: Karnysheva, Anna, et al.
Published: (2025)
Multi-Value Alignment for LLMs via Value Decorrelation and Extrapolation
by: Xu, Hefei, et al.
Published: (2025)
by: Xu, Hefei, et al.
Published: (2025)
Coverage-based Fairness in Multi-document Summarization
by: Li, Haoyuan, et al.
Published: (2024)
by: Li, Haoyuan, et al.
Published: (2024)
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment
by: Wang, Yifan, et al.
Published: (2025)
by: Wang, Yifan, et al.
Published: (2025)
Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities
by: Ying, Shuangshuang, et al.
Published: (2026)
by: Ying, Shuangshuang, et al.
Published: (2026)
A Scalable Neural Network for DSIC Affine Maximizer Auction Design
by: Duan, Zhijian, et al.
Published: (2023)
by: Duan, Zhijian, et al.
Published: (2023)
Can LLMs Help You at Work? A Sandbox for Evaluating LLM Agents in Enterprise Environments
by: Vishwakarma, Harsh, et al.
Published: (2025)
by: Vishwakarma, Harsh, et al.
Published: (2025)
Sample Efficient Preference Alignment in LLMs via Active Exploration
by: Mehta, Viraj, et al.
Published: (2023)
by: Mehta, Viraj, et al.
Published: (2023)
Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
by: Zhang, Yichi, et al.
Published: (2023)
by: Zhang, Yichi, et al.
Published: (2023)
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
by: Zhou, Zhanhui, et al.
Published: (2023)
by: Zhou, Zhanhui, et al.
Published: (2023)
Discovering Expert-Level Nash Equilibrium Algorithms with Large Language Models
by: Li, Hanyu, et al.
Published: (2025)
by: Li, Hanyu, et al.
Published: (2025)
MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection
by: Liu, Ziyan, et al.
Published: (2025)
by: Liu, Ziyan, et al.
Published: (2025)
Improving VTE Identification through Language Models from Radiology Reports: A Comparative Study of Mamba, Phi-3 Mini, and BERT
by: Deng, Jamie, et al.
Published: (2024)
by: Deng, Jamie, et al.
Published: (2024)
Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs
by: Wachter, Jasmin, et al.
Published: (2025)
by: Wachter, Jasmin, et al.
Published: (2025)
Property-driven Protein Inverse Folding With Multi-Objective Preference Alignment
by: Hou, Xiaoyang, et al.
Published: (2026)
by: Hou, Xiaoyang, et al.
Published: (2026)
Aligning CodeLLMs with Direct Preference Optimization
by: Miao, Yibo, et al.
Published: (2024)
by: Miao, Yibo, et al.
Published: (2024)
Computational Challenges in Token Economics: Bridging Economic Theory and AI System Design
by: Wu, Ou, et al.
Published: (2026)
by: Wu, Ou, et al.
Published: (2026)
Crab: A Semantics-Aware Checkpoint/Restore Runtime for Agent Sandboxes
by: Wu, Tianyuan, et al.
Published: (2026)
by: Wu, Tianyuan, et al.
Published: (2026)
A Systematic Evaluation of Preference Aggregation in Federated RLHF for Pluralistic Alignment of LLMs
by: Srewa, Mahmoud, et al.
Published: (2025)
by: Srewa, Mahmoud, et al.
Published: (2025)
A General Benchmark Framework is Dynamic Graph Neural Network Need
by: Zhang, Yusen
Published: (2024)
by: Zhang, Yusen
Published: (2024)
TinyAlign: Boosting Lightweight Vision-Language Models by Mitigating Modal Alignment Bottlenecks
by: Hu, Yuanze, et al.
Published: (2025)
by: Hu, Yuanze, et al.
Published: (2025)
APPA: Adaptive Preference Pluralistic Alignment for Fair Federated RLHF of LLMs
by: Srewa, Mahmoud, et al.
Published: (2026)
by: Srewa, Mahmoud, et al.
Published: (2026)
When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger
by: Afzali, Amirabbas, et al.
Published: (2026)
by: Afzali, Amirabbas, et al.
Published: (2026)
Similar Items
-
HCAG: Hierarchical Abstraction and Retrieval-Augmented Generation on Theoretical Repositories with LLMs
by: Wu, Yusen, et al.
Published: (2026) -
Implementing Long Text Style Transfer with LLMs through Dual-Layered Sentence and Paragraph Structure Extraction and Mapping
by: Wu, Yusen, et al.
Published: (2025) -
How Social is It? A Benchmark for LLMs' Capabilities in Multi-user Multi-turn Social Agent Tasks
by: Wu, Yusen, et al.
Published: (2025) -
DeepRule: An Integrated Framework for Automated Business Rule Generation via Deep Predictive Modeling and Hybrid Search Optimization
by: Wu, Yusen, et al.
Published: (2025) -
Hummer: Towards Limited Competitive Preference Dataset
by: Jiang, Li, et al.
Published: (2024)