:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Yunxiao, Wang, Zhiqiang, Yu, Xingtong, Li, Xiaoli, Liang, Jiye, Li, Ru
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.13393
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective
by: Zhao, Yunxiao, et al.
Published: (2025)

Event-Aware Prompt Learning for Dynamic Graphs
by: Yu, Xingtong, et al.
Published: (2025)

Consistency-Aware Editing for Entity-level Unlearning in Language Models
by: Han, Xiaoqi, et al.
Published: (2025)

Constrained Language Model Policy Optimization via Risk-aware Stepwise Alignment
by: Zhang, Lijun, et al.
Published: (2025)

MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging
by: Wang, Jiapeng, et al.
Published: (2026)

Style-Preserving Policy Optimization for Game Agents
by: Li, Lingfeng, et al.
Published: (2025)

Time-varying Interaction Graph ODE for Dynamic Graph Representation Learning
by: Wang, Xiaoyi, et al.
Published: (2026)

Improving Rationality in the Reasoning Process of Language Models through Self-playing Game
by: Wang, Pinzheng, et al.
Published: (2025)

DOGMA: Weaving Structural Information into Data-centric Single-cell Transcriptomics Analysis
by: Zhang, Ru, et al.
Published: (2026)

CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric
by: Guo, Yunxiao, et al.
Published: (2021)

RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library
by: Wang, Jiapeng, et al.
Published: (2025)

Human-centered explanation does not fit all: The interplay of sociotechnical, cognitive, and individual factors in the effect AI explanations in algorithmic decision-making
by: Ahn, Yongsu, et al.
Published: (2025)

PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
by: Guan, Lei, et al.
Published: (2023)

EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness
by: Zhang, Yunxiao, et al.
Published: (2025)

Is Data Valuation Learnable and Interpretable?
by: Wu, Ou, et al.
Published: (2024)

Understanding Representation Learnability of Nonlinear Self-Supervised Learning
by: Yang, Ruofeng, et al.
Published: (2024)

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies
by: Lou, Zhanzhi, et al.
Published: (2026)

Zero-Knowledge Proof Based Verifiable Inference of Models
by: Wang, Yunxiao
Published: (2025)

Learnable Chernoff Baselines for Inference-Time Alignment
by: Madhow, Sunil, et al.
Published: (2026)

The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations
by: Swamy, Vinitra, et al.
Published: (2023)

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
by: Xu, Zelai, et al.
Published: (2025)

Exploring the Reliability of Self-explanation and its Relationship with Classification in Language Model-driven Financial Analysis
by: Yuan, Han, et al.
Published: (2025)

MemPO: Self-Memory Policy Optimization for Long-Horizon Agents
by: Li, Ruoran, et al.
Published: (2026)

Apollonion: Profile-centric Dialog Agent
by: Chen, Shangyu, et al.
Published: (2024)

LR-CNN: Lightweight Row-centric Convolutional Neural Network Training for Memory Reduction
by: Wang, Zhigang, et al.
Published: (2024)

MirrorBench: Evaluating Self-centric Intelligence in MLLMs by Introducing a Mirror
by: Guo, Shengyu, et al.
Published: (2026)

WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models
by: Yin, Qiyue, et al.
Published: (2025)

Provable and Practical In-Context Policy Optimization for Self-Improvement
by: Yu, Tianrun, et al.
Published: (2026)

Toward Data-centric Directed Graph Learning: An Entropy-driven Approach
by: Li, Xunkai, et al.
Published: (2025)

Exploring Large Language Models for Feature Selection: A Data-centric Perspective
by: Li, Dawei, et al.
Published: (2024)

Finding Kissing Numbers with Game-theoretic Reinforcement Learning
by: Ma, Chengdong, et al.
Published: (2025)

Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization
by: Zhou, Huilin, et al.
Published: (2026)

Parameter Hierarchical Optimization for Visible-Infrared Person Re-Identification
by: YU, Zeng, et al.
Published: (2024)

SciIntegrity-Bench: A Benchmark for Evaluating Academic Integrity in AI Scientist Systems
by: Yang, Zonglin, et al.
Published: (2026)

SL-BiLEM: Structured Learnable Behavior-in-the-Loop Epidemic Modeling for Forecasting and Policy Evaluation
by: Wang, Haochun, et al.
Published: (2026)

Difficulty-Estimated Policy Optimization
by: Zhao, Yu, et al.
Published: (2026)

Review of Data-centric Time Series Analysis from Sample, Feature, and Period
by: Sun, Chenxi, et al.
Published: (2024)

Combining Cognitive and Generative AI for Self-explanation in Interactive AI Agents
by: Sushri, Shalini, et al.
Published: (2024)

Game-theoretic LLM: Agent Workflow for Negotiation Games
by: Hua, Wenyue, et al.
Published: (2024)

Game-Theoretic Modeling of Vehicle Unprotected Left Turns Considering Drivers' Bounded Rationality
by: Lian, Yuansheng, et al.
Published: (2025)