Saved in:
| Main Authors: | Zhao, Yunxiao, Wang, Zhiqiang, Yu, Xingtong, Li, Xiaoli, Liang, Jiye, Li, Ru |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.13393 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective
by: Zhao, Yunxiao, et al.
Published: (2025)
by: Zhao, Yunxiao, et al.
Published: (2025)
Event-Aware Prompt Learning for Dynamic Graphs
by: Yu, Xingtong, et al.
Published: (2025)
by: Yu, Xingtong, et al.
Published: (2025)
Consistency-Aware Editing for Entity-level Unlearning in Language Models
by: Han, Xiaoqi, et al.
Published: (2025)
by: Han, Xiaoqi, et al.
Published: (2025)
Constrained Language Model Policy Optimization via Risk-aware Stepwise Alignment
by: Zhang, Lijun, et al.
Published: (2025)
by: Zhang, Lijun, et al.
Published: (2025)
MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging
by: Wang, Jiapeng, et al.
Published: (2026)
by: Wang, Jiapeng, et al.
Published: (2026)
Style-Preserving Policy Optimization for Game Agents
by: Li, Lingfeng, et al.
Published: (2025)
by: Li, Lingfeng, et al.
Published: (2025)
Time-varying Interaction Graph ODE for Dynamic Graph Representation Learning
by: Wang, Xiaoyi, et al.
Published: (2026)
by: Wang, Xiaoyi, et al.
Published: (2026)
Improving Rationality in the Reasoning Process of Language Models through Self-playing Game
by: Wang, Pinzheng, et al.
Published: (2025)
by: Wang, Pinzheng, et al.
Published: (2025)
DOGMA: Weaving Structural Information into Data-centric Single-cell Transcriptomics Analysis
by: Zhang, Ru, et al.
Published: (2026)
by: Zhang, Ru, et al.
Published: (2026)
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric
by: Guo, Yunxiao, et al.
Published: (2021)
by: Guo, Yunxiao, et al.
Published: (2021)
RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library
by: Wang, Jiapeng, et al.
Published: (2025)
by: Wang, Jiapeng, et al.
Published: (2025)
Human-centered explanation does not fit all: The interplay of sociotechnical, cognitive, and individual factors in the effect AI explanations in algorithmic decision-making
by: Ahn, Yongsu, et al.
Published: (2025)
by: Ahn, Yongsu, et al.
Published: (2025)
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
by: Guan, Lei, et al.
Published: (2023)
by: Guan, Lei, et al.
Published: (2023)
EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness
by: Zhang, Yunxiao, et al.
Published: (2025)
by: Zhang, Yunxiao, et al.
Published: (2025)
Is Data Valuation Learnable and Interpretable?
by: Wu, Ou, et al.
Published: (2024)
by: Wu, Ou, et al.
Published: (2024)
Understanding Representation Learnability of Nonlinear Self-Supervised Learning
by: Yang, Ruofeng, et al.
Published: (2024)
by: Yang, Ruofeng, et al.
Published: (2024)
Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies
by: Lou, Zhanzhi, et al.
Published: (2026)
by: Lou, Zhanzhi, et al.
Published: (2026)
Zero-Knowledge Proof Based Verifiable Inference of Models
by: Wang, Yunxiao
Published: (2025)
by: Wang, Yunxiao
Published: (2025)
Learnable Chernoff Baselines for Inference-Time Alignment
by: Madhow, Sunil, et al.
Published: (2026)
by: Madhow, Sunil, et al.
Published: (2026)
The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations
by: Swamy, Vinitra, et al.
Published: (2023)
by: Swamy, Vinitra, et al.
Published: (2023)
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
by: Xu, Zelai, et al.
Published: (2025)
by: Xu, Zelai, et al.
Published: (2025)
Exploring the Reliability of Self-explanation and its Relationship with Classification in Language Model-driven Financial Analysis
by: Yuan, Han, et al.
Published: (2025)
by: Yuan, Han, et al.
Published: (2025)
MemPO: Self-Memory Policy Optimization for Long-Horizon Agents
by: Li, Ruoran, et al.
Published: (2026)
by: Li, Ruoran, et al.
Published: (2026)
Apollonion: Profile-centric Dialog Agent
by: Chen, Shangyu, et al.
Published: (2024)
by: Chen, Shangyu, et al.
Published: (2024)
LR-CNN: Lightweight Row-centric Convolutional Neural Network Training for Memory Reduction
by: Wang, Zhigang, et al.
Published: (2024)
by: Wang, Zhigang, et al.
Published: (2024)
MirrorBench: Evaluating Self-centric Intelligence in MLLMs by Introducing a Mirror
by: Guo, Shengyu, et al.
Published: (2026)
by: Guo, Shengyu, et al.
Published: (2026)
WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models
by: Yin, Qiyue, et al.
Published: (2025)
by: Yin, Qiyue, et al.
Published: (2025)
Provable and Practical In-Context Policy Optimization for Self-Improvement
by: Yu, Tianrun, et al.
Published: (2026)
by: Yu, Tianrun, et al.
Published: (2026)
Toward Data-centric Directed Graph Learning: An Entropy-driven Approach
by: Li, Xunkai, et al.
Published: (2025)
by: Li, Xunkai, et al.
Published: (2025)
Exploring Large Language Models for Feature Selection: A Data-centric Perspective
by: Li, Dawei, et al.
Published: (2024)
by: Li, Dawei, et al.
Published: (2024)
Finding Kissing Numbers with Game-theoretic Reinforcement Learning
by: Ma, Chengdong, et al.
Published: (2025)
by: Ma, Chengdong, et al.
Published: (2025)
Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization
by: Zhou, Huilin, et al.
Published: (2026)
by: Zhou, Huilin, et al.
Published: (2026)
Parameter Hierarchical Optimization for Visible-Infrared Person Re-Identification
by: YU, Zeng, et al.
Published: (2024)
by: YU, Zeng, et al.
Published: (2024)
SciIntegrity-Bench: A Benchmark for Evaluating Academic Integrity in AI Scientist Systems
by: Yang, Zonglin, et al.
Published: (2026)
by: Yang, Zonglin, et al.
Published: (2026)
SL-BiLEM: Structured Learnable Behavior-in-the-Loop Epidemic Modeling for Forecasting and Policy Evaluation
by: Wang, Haochun, et al.
Published: (2026)
by: Wang, Haochun, et al.
Published: (2026)
Difficulty-Estimated Policy Optimization
by: Zhao, Yu, et al.
Published: (2026)
by: Zhao, Yu, et al.
Published: (2026)
Review of Data-centric Time Series Analysis from Sample, Feature, and Period
by: Sun, Chenxi, et al.
Published: (2024)
by: Sun, Chenxi, et al.
Published: (2024)
Combining Cognitive and Generative AI for Self-explanation in Interactive AI Agents
by: Sushri, Shalini, et al.
Published: (2024)
by: Sushri, Shalini, et al.
Published: (2024)
Game-theoretic LLM: Agent Workflow for Negotiation Games
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
Game-Theoretic Modeling of Vehicle Unprotected Left Turns Considering Drivers' Bounded Rationality
by: Lian, Yuansheng, et al.
Published: (2025)
by: Lian, Yuansheng, et al.
Published: (2025)
Similar Items
-
Explaining Black-box Language Models with Knowledge Probing Systems: A Post-hoc Explanation Perspective
by: Zhao, Yunxiao, et al.
Published: (2025) -
Event-Aware Prompt Learning for Dynamic Graphs
by: Yu, Xingtong, et al.
Published: (2025) -
Consistency-Aware Editing for Entity-level Unlearning in Language Models
by: Han, Xiaoqi, et al.
Published: (2025) -
Constrained Language Model Policy Optimization via Risk-aware Stepwise Alignment
by: Zhang, Lijun, et al.
Published: (2025) -
MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging
by: Wang, Jiapeng, et al.
Published: (2026)