Saved in:
| Main Authors: | Wang, Wenqing, Ali, Muhammad Asif, Shoker, Ali, Yang, Ruohan, Chen, Junyang, Sha, Ying, Wang, Huan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.10077 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CRASH: Cognitive Reasoning Agent for Safety Hazards in Autonomous Driving
by: Silva, Erick, et al.
Published: (2026)
by: Silva, Erick, et al.
Published: (2026)
NILC: Discovering New Intents with LLM-assisted Clustering
by: Wang, Hongtao, et al.
Published: (2025)
by: Wang, Hongtao, et al.
Published: (2025)
IPO: Your Language Model is Secretly a Preference Classifier
by: Garg, Shivank, et al.
Published: (2025)
by: Garg, Shivank, et al.
Published: (2025)
Antonym vs Synonym Distinction using InterlaCed Encoder NETworks (ICE-NET)
by: Ali, Muhammad Asif, et al.
Published: (2024)
by: Ali, Muhammad Asif, et al.
Published: (2024)
MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions
by: Yang, Shu, et al.
Published: (2024)
by: Yang, Shu, et al.
Published: (2024)
MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning
by: Yang, Shu, et al.
Published: (2024)
by: Yang, Shu, et al.
Published: (2024)
MQA-KEAL: Multi-hop Question Answering under Knowledge Editing for Arabic Language
by: Ali, Muhammad Asif, et al.
Published: (2024)
by: Ali, Muhammad Asif, et al.
Published: (2024)
IPO: Interpretable Prompt Optimization for Vision-Language Models
by: Du, Yingjun, et al.
Published: (2024)
by: Du, Yingjun, et al.
Published: (2024)
IPO-Mine: A Toolkit and Dataset for Section-Structured Analysis of Long, Multimodal IPO Documents
by: Galarnyk, Michael, et al.
Published: (2026)
by: Galarnyk, Michael, et al.
Published: (2026)
VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
by: Li, Yunxin, et al.
Published: (2025)
by: Li, Yunxin, et al.
Published: (2025)
Plug-and-Play Training Framework for Preference Optimization
by: Ma, Jingyuan, et al.
Published: (2024)
by: Ma, Jingyuan, et al.
Published: (2024)
Intent-driven In-context Learning for Few-shot Dialogue State Tracking
by: Yi, Zihao, et al.
Published: (2024)
by: Yi, Zihao, et al.
Published: (2024)
Self-Steering Optimization: Autonomous Preference Optimization for Large Language Models
by: Xiang, Hao, et al.
Published: (2024)
by: Xiang, Hao, et al.
Published: (2024)
AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework
by: Wei, Xiao, et al.
Published: (2025)
by: Wei, Xiao, et al.
Published: (2025)
Adaptive Preference Optimization with Uncertainty-aware Utility Anchor
by: Wang, Xiaobo, et al.
Published: (2025)
by: Wang, Xiaobo, et al.
Published: (2025)
From Multimodal Perception to Strategic Reasoning: A Survey on AI-Generated Game Commentary
by: Zheng, Qirui, et al.
Published: (2025)
by: Zheng, Qirui, et al.
Published: (2025)
AIPO: Improving Training Objective for Iterative Preference Optimization
by: Shen, Yaojie, et al.
Published: (2024)
by: Shen, Yaojie, et al.
Published: (2024)
Soft Adaptive Policy Optimization
by: Gao, Chang, et al.
Published: (2025)
by: Gao, Chang, et al.
Published: (2025)
ASPO: Adaptive Sentence-Level Preference Optimization for Fine-Grained Multimodal Reasoning
by: Wang, Yeyuan, et al.
Published: (2025)
by: Wang, Yeyuan, et al.
Published: (2025)
Adaptive Decoding via Latent Preference Optimization
by: Dhuliawala, Shehzaad, et al.
Published: (2024)
by: Dhuliawala, Shehzaad, et al.
Published: (2024)
Evaluating and Aligning CodeLLMs on Human Preference
by: Yang, Jian, et al.
Published: (2024)
by: Yang, Jian, et al.
Published: (2024)
Understanding Reasoning in Chain-of-Thought from the Hopfieldian View
by: Hu, Lijie, et al.
Published: (2024)
by: Hu, Lijie, et al.
Published: (2024)
A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
by: Hu, Lijie, et al.
Published: (2024)
by: Hu, Lijie, et al.
Published: (2024)
Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs
by: Yang, Shu, et al.
Published: (2024)
by: Yang, Shu, et al.
Published: (2024)
Can LLMs Capture Human Preferences?
by: Goli, Ali, et al.
Published: (2023)
by: Goli, Ali, et al.
Published: (2023)
Multi-Granularity Open Intent Classification via Adaptive Granular-Ball Decision Boundary
by: Li, Yanhua, et al.
Published: (2024)
by: Li, Yanhua, et al.
Published: (2024)
Deep Learning Approaches for Multimodal Intent Recognition: A Survey
by: Zhao, Jingwei, et al.
Published: (2025)
by: Zhao, Jingwei, et al.
Published: (2025)
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review
by: Abbasli, Toghrul, et al.
Published: (2025)
by: Abbasli, Toghrul, et al.
Published: (2025)
Just a Scratch: Enhancing LLM Capabilities for Self-harm Detection through Intent Differentiation and Emoji Interpretation
by: Ghosh, Soumitra, et al.
Published: (2025)
by: Ghosh, Soumitra, et al.
Published: (2025)
Interpersonal Memory Matters: A New Task for Proactive Dialogue Utilizing Conversational History
by: Wu, Bowen, et al.
Published: (2025)
by: Wu, Bowen, et al.
Published: (2025)
CODEMENV: Benchmarking Large Language Models on Code Migration
by: Cheng, Keyuan, et al.
Published: (2025)
by: Cheng, Keyuan, et al.
Published: (2025)
Preference Optimization with Multi-Sample Comparisons
by: Wang, Chaoqi, et al.
Published: (2024)
by: Wang, Chaoqi, et al.
Published: (2024)
No Preference Left Behind: Group Distributional Preference Optimization
by: Yao, Binwei, et al.
Published: (2024)
by: Yao, Binwei, et al.
Published: (2024)
Direct Judgement Preference Optimization
by: Wang, Peifeng, et al.
Published: (2024)
by: Wang, Peifeng, et al.
Published: (2024)
Dual-IPO: Dual-Iterative Preference Optimization for Text-to-Video Generation
by: Yang, Xiaomeng, et al.
Published: (2025)
by: Yang, Xiaomeng, et al.
Published: (2025)
GRAVITY: A Framework for Personalized Text Generation via Profile-Grounded Synthetic Preferences
by: Dey, Priyanka, et al.
Published: (2025)
by: Dey, Priyanka, et al.
Published: (2025)
Token-level Direct Preference Optimization
by: Zeng, Yongcheng, et al.
Published: (2024)
by: Zeng, Yongcheng, et al.
Published: (2024)
The Fellowship of the LLMs: Multi-Model Workflows for Synthetic Preference Optimization Dataset Generation
by: Arif, Samee, et al.
Published: (2024)
by: Arif, Samee, et al.
Published: (2024)
Segment-Level Coherence for Robust Harmful Intent Probing in LLMs
by: He, Xuanli, et al.
Published: (2026)
by: He, Xuanli, et al.
Published: (2026)
Similar Items
-
CRASH: Cognitive Reasoning Agent for Safety Hazards in Autonomous Driving
by: Silva, Erick, et al.
Published: (2026) -
NILC: Discovering New Intents with LLM-assisted Clustering
by: Wang, Hongtao, et al.
Published: (2025) -
IPO: Your Language Model is Secretly a Preference Classifier
by: Garg, Shivank, et al.
Published: (2025) -
Antonym vs Synonym Distinction using InterlaCed Encoder NETworks (ICE-NET)
by: Ali, Muhammad Asif, et al.
Published: (2024) -
MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions
by: Yang, Shu, et al.
Published: (2024)