:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Wei, Xu, Haomei, Liu, Bingqing, Deng, Zhiying, Wang, Haozhao, Wang, Jun, Li, Ruixuan, Teh, Yee Whye, Lee, Wee Sun
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.00625
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Are We Evaluating the Edit Locality of LLM Model Editing Properly?
by: Liu, Wei, et al.
Published: (2026)

From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing
by: Liu, Wei, et al.
Published: (2026)

LinTree: Improving LLM Reasoning with Explicitly Structured Search Histories
by: Kang, Liwei, et al.
Published: (2026)

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization
by: Zheng, Zhi, et al.
Published: (2025)

Extending Epistemic Uncertainty Beyond Parameters Would Assist in Designing Reliable LLMs
by: Nguyen-Hien, T. Duy, et al.
Published: (2025)

Breaking Free from MMI: A New Frontier in Rationalization by Probing Input Utilization
by: Liu, Wei, et al.
Published: (2025)

Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets
by: Liu, Wei, et al.
Published: (2025)

Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization
by: Liu, Wei, et al.
Published: (2024)

NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation
by: Li, Qinyu, et al.
Published: (2025)

Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables
by: Geng, Xuzhao, et al.
Published: (2025)

Verifier-Backed Hard Problem Generation for Mathematical Reasoning
by: Lai, Yuhang, et al.
Published: (2026)

The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
by: Sims, Anya, et al.
Published: (2024)

L3Ms -- Lagrange Large Language Models
by: Dhillon, Guneet S., et al.
Published: (2024)

Incorporating Unlabelled Data into Bayesian Neural Networks
by: Sharma, Mrinank, et al.
Published: (2023)

SymDiff: Equivariant Diffusion via Stochastic Symmetrisation
by: Zhang, Leo, et al.
Published: (2024)

Enhancing Large Language Model Reasoning with Reward Models: An Analytical Survey
by: Liu, Qiyuan, et al.
Published: (2025)

Manifold Aware Denoising Score Matching (MAD)
by: Levy-Jurgenson, Alona, et al.
Published: (2026)

Foundation of Intelligence: Review of Math Word Problems from Human Cognition Perspective
by: Huang, Zhenya, et al.
Published: (2025)

Amortized Probabilistic Detection of Communities in Graphs
by: Wang, Yueqi, et al.
Published: (2020)

Rao-Blackwellised Reparameterisation Gradients
by: Lam, Kevin H., et al.
Published: (2025)

Learning to Contextualize Web Pages for Enhanced Decision Making by LLM Agents
by: Lee, Dongjun, et al.
Published: (2025)

Adversarial Attack for Explanation Robustness of Rationalization Models
by: Zhang, Yuankai, et al.
Published: (2024)

Metropolis-Adjusted Diffusion Models
by: Lam, Kevin H., et al.
Published: (2026)

Selective Safety Steering via Value-Filtered Decoding
by: Einbinder, Bat-Sheva, et al.
Published: (2026)

Meta-Learning Objectives for Preference Optimization
by: Alfano, Carlo, et al.
Published: (2024)

EvIL: Evolution Strategies for Generalisable Imitation Learning
by: Sapora, Silvia, et al.
Published: (2024)

Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts
by: Chen, Shengzhuang, et al.
Published: (2024)

BetaEdit: Null-Space Constrained Sequential Model Editing
by: Liu, Bingqing, et al.
Published: (2026)

SigmaDock: Untwisting Molecular Docking With Fragment-Based SE(3) Diffusion
by: Prat, Alvaro, et al.
Published: (2025)

Meta Flow Maps enable scalable reward alignment
by: Potaptchik, Peter, et al.
Published: (2026)

Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation
by: Sgouritsa, Eleni, et al.
Published: (2024)

Aortobronchial Fistula: A Case Report and Literature Review
by: Zhenghan Liu, et al.
Published: (2025)

Online Adaptation of Language Models with a Memory of Amortized Contexts
by: Tack, Jihoon, et al.
Published: (2024)

When Is Enough Not Enough? Illusory Completion in Search Agents
by: Ko, Dayoon, et al.
Published: (2026)

Variational Flow Maps: Make Some Noise for One-Step Conditional Generation
by: Mammadov, Abbas, et al.
Published: (2026)

Kalman Filter for Online Classification of Non-Stationary Data
by: Titsias, Michalis K., et al.
Published: (2023)

Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design
by: Klarner, Leo, et al.
Published: (2024)

FedGIG: Graph Inversion from Gradient in Federated Learning
by: Xiao, Tianzhe, et al.
Published: (2024)

Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
by: Galashov, Alexandre, et al.
Published: (2024)

Information Science: A House Built on Sand
by: Vagianos, Louis
Published: (1972)