:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Wenqing, Ali, Muhammad Asif, Shoker, Ali, Yang, Ruohan, Chen, Junyang, Sha, Ying, Wang, Huan
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2510.10077
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CRASH: Cognitive Reasoning Agent for Safety Hazards in Autonomous Driving
by: Silva, Erick, et al.
Published: (2026)

NILC: Discovering New Intents with LLM-assisted Clustering
by: Wang, Hongtao, et al.
Published: (2025)

IPO: Your Language Model is Secretly a Preference Classifier
by: Garg, Shivank, et al.
Published: (2025)

Antonym vs Synonym Distinction using InterlaCed Encoder NETworks (ICE-NET)
by: Ali, Muhammad Asif, et al.
Published: (2024)

MONAL: Model Autophagy Analysis for Modeling Human-AI Interactions
by: Yang, Shu, et al.
Published: (2024)

MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning
by: Yang, Shu, et al.
Published: (2024)

MQA-KEAL: Multi-hop Question Answering under Knowledge Editing for Arabic Language
by: Ali, Muhammad Asif, et al.
Published: (2024)

IPO: Interpretable Prompt Optimization for Vision-Language Models
by: Du, Yingjun, et al.
Published: (2024)

IPO-Mine: A Toolkit and Dataset for Section-Structured Analysis of Long, Multimodal IPO Documents
by: Galarnyk, Michael, et al.
Published: (2026)

VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
by: Li, Yunxin, et al.
Published: (2025)

Plug-and-Play Training Framework for Preference Optimization
by: Ma, Jingyuan, et al.
Published: (2024)

Intent-driven In-context Learning for Few-shot Dialogue State Tracking
by: Yi, Zihao, et al.
Published: (2024)

Self-Steering Optimization: Autonomous Preference Optimization for Large Language Models
by: Xiang, Hao, et al.
Published: (2024)

AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)

Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework
by: Wei, Xiao, et al.
Published: (2025)

Adaptive Preference Optimization with Uncertainty-aware Utility Anchor
by: Wang, Xiaobo, et al.
Published: (2025)

From Multimodal Perception to Strategic Reasoning: A Survey on AI-Generated Game Commentary
by: Zheng, Qirui, et al.
Published: (2025)

AIPO: Improving Training Objective for Iterative Preference Optimization
by: Shen, Yaojie, et al.
Published: (2024)

Soft Adaptive Policy Optimization
by: Gao, Chang, et al.
Published: (2025)

ASPO: Adaptive Sentence-Level Preference Optimization for Fine-Grained Multimodal Reasoning
by: Wang, Yeyuan, et al.
Published: (2025)

Adaptive Decoding via Latent Preference Optimization
by: Dhuliawala, Shehzaad, et al.
Published: (2024)

Evaluating and Aligning CodeLLMs on Human Preference
by: Yang, Jian, et al.
Published: (2024)

Understanding Reasoning in Chain-of-Thought from the Hopfieldian View
by: Hu, Lijie, et al.
Published: (2024)

A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
by: Hu, Lijie, et al.
Published: (2024)

Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs
by: Yang, Shu, et al.
Published: (2024)

Can LLMs Capture Human Preferences?
by: Goli, Ali, et al.
Published: (2023)

Multi-Granularity Open Intent Classification via Adaptive Granular-Ball Decision Boundary
by: Li, Yanhua, et al.
Published: (2024)

Deep Learning Approaches for Multimodal Intent Recognition: A Survey
by: Zhao, Jingwei, et al.
Published: (2025)

Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review
by: Abbasli, Toghrul, et al.
Published: (2025)

Just a Scratch: Enhancing LLM Capabilities for Self-harm Detection through Intent Differentiation and Emoji Interpretation
by: Ghosh, Soumitra, et al.
Published: (2025)

Interpersonal Memory Matters: A New Task for Proactive Dialogue Utilizing Conversational History
by: Wu, Bowen, et al.
Published: (2025)

CODEMENV: Benchmarking Large Language Models on Code Migration
by: Cheng, Keyuan, et al.
Published: (2025)

Preference Optimization with Multi-Sample Comparisons
by: Wang, Chaoqi, et al.
Published: (2024)

No Preference Left Behind: Group Distributional Preference Optimization
by: Yao, Binwei, et al.
Published: (2024)

Direct Judgement Preference Optimization
by: Wang, Peifeng, et al.
Published: (2024)

Dual-IPO: Dual-Iterative Preference Optimization for Text-to-Video Generation
by: Yang, Xiaomeng, et al.
Published: (2025)

GRAVITY: A Framework for Personalized Text Generation via Profile-Grounded Synthetic Preferences
by: Dey, Priyanka, et al.
Published: (2025)

Token-level Direct Preference Optimization
by: Zeng, Yongcheng, et al.
Published: (2024)

The Fellowship of the LLMs: Multi-Model Workflows for Synthetic Preference Optimization Dataset Generation
by: Arif, Samee, et al.
Published: (2024)

Segment-Level Coherence for Robust Harmful Intent Probing in LLMs
by: He, Xuanli, et al.
Published: (2026)