Saved in:
| Main Authors: | Liu, Wei, Xu, Haomei, Liu, Bingqing, Deng, Zhiying, Wang, Haozhao, Wang, Jun, Li, Ruixuan, Teh, Yee Whye, Lee, Wee Sun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.00625 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Are We Evaluating the Edit Locality of LLM Model Editing Properly?
by: Liu, Wei, et al.
Published: (2026)
by: Liu, Wei, et al.
Published: (2026)
From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing
by: Liu, Wei, et al.
Published: (2026)
by: Liu, Wei, et al.
Published: (2026)
LinTree: Improving LLM Reasoning with Explicitly Structured Search Histories
by: Kang, Liwei, et al.
Published: (2026)
by: Kang, Liwei, et al.
Published: (2026)
SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization
by: Zheng, Zhi, et al.
Published: (2025)
by: Zheng, Zhi, et al.
Published: (2025)
Extending Epistemic Uncertainty Beyond Parameters Would Assist in Designing Reliable LLMs
by: Nguyen-Hien, T. Duy, et al.
Published: (2025)
by: Nguyen-Hien, T. Duy, et al.
Published: (2025)
Breaking Free from MMI: A New Frontier in Rationalization by Probing Input Utilization
by: Liu, Wei, et al.
Published: (2025)
by: Liu, Wei, et al.
Published: (2025)
Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets
by: Liu, Wei, et al.
Published: (2025)
by: Liu, Wei, et al.
Published: (2025)
Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization
by: Liu, Wei, et al.
Published: (2024)
by: Liu, Wei, et al.
Published: (2024)
NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation
by: Li, Qinyu, et al.
Published: (2025)
by: Li, Qinyu, et al.
Published: (2025)
Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables
by: Geng, Xuzhao, et al.
Published: (2025)
by: Geng, Xuzhao, et al.
Published: (2025)
Verifier-Backed Hard Problem Generation for Mathematical Reasoning
by: Lai, Yuhang, et al.
Published: (2026)
by: Lai, Yuhang, et al.
Published: (2026)
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
by: Sims, Anya, et al.
Published: (2024)
by: Sims, Anya, et al.
Published: (2024)
L3Ms -- Lagrange Large Language Models
by: Dhillon, Guneet S., et al.
Published: (2024)
by: Dhillon, Guneet S., et al.
Published: (2024)
Incorporating Unlabelled Data into Bayesian Neural Networks
by: Sharma, Mrinank, et al.
Published: (2023)
by: Sharma, Mrinank, et al.
Published: (2023)
SymDiff: Equivariant Diffusion via Stochastic Symmetrisation
by: Zhang, Leo, et al.
Published: (2024)
by: Zhang, Leo, et al.
Published: (2024)
Enhancing Large Language Model Reasoning with Reward Models: An Analytical Survey
by: Liu, Qiyuan, et al.
Published: (2025)
by: Liu, Qiyuan, et al.
Published: (2025)
Manifold Aware Denoising Score Matching (MAD)
by: Levy-Jurgenson, Alona, et al.
Published: (2026)
by: Levy-Jurgenson, Alona, et al.
Published: (2026)
Foundation of Intelligence: Review of Math Word Problems from Human Cognition Perspective
by: Huang, Zhenya, et al.
Published: (2025)
by: Huang, Zhenya, et al.
Published: (2025)
Amortized Probabilistic Detection of Communities in Graphs
by: Wang, Yueqi, et al.
Published: (2020)
by: Wang, Yueqi, et al.
Published: (2020)
Rao-Blackwellised Reparameterisation Gradients
by: Lam, Kevin H., et al.
Published: (2025)
by: Lam, Kevin H., et al.
Published: (2025)
Learning to Contextualize Web Pages for Enhanced Decision Making by LLM Agents
by: Lee, Dongjun, et al.
Published: (2025)
by: Lee, Dongjun, et al.
Published: (2025)
Adversarial Attack for Explanation Robustness of Rationalization Models
by: Zhang, Yuankai, et al.
Published: (2024)
by: Zhang, Yuankai, et al.
Published: (2024)
Metropolis-Adjusted Diffusion Models
by: Lam, Kevin H., et al.
Published: (2026)
by: Lam, Kevin H., et al.
Published: (2026)
Selective Safety Steering via Value-Filtered Decoding
by: Einbinder, Bat-Sheva, et al.
Published: (2026)
by: Einbinder, Bat-Sheva, et al.
Published: (2026)
Meta-Learning Objectives for Preference Optimization
by: Alfano, Carlo, et al.
Published: (2024)
by: Alfano, Carlo, et al.
Published: (2024)
EvIL: Evolution Strategies for Generalisable Imitation Learning
by: Sapora, Silvia, et al.
Published: (2024)
by: Sapora, Silvia, et al.
Published: (2024)
Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts
by: Chen, Shengzhuang, et al.
Published: (2024)
by: Chen, Shengzhuang, et al.
Published: (2024)
BetaEdit: Null-Space Constrained Sequential Model Editing
by: Liu, Bingqing, et al.
Published: (2026)
by: Liu, Bingqing, et al.
Published: (2026)
SigmaDock: Untwisting Molecular Docking With Fragment-Based SE(3) Diffusion
by: Prat, Alvaro, et al.
Published: (2025)
by: Prat, Alvaro, et al.
Published: (2025)
Meta Flow Maps enable scalable reward alignment
by: Potaptchik, Peter, et al.
Published: (2026)
by: Potaptchik, Peter, et al.
Published: (2026)
Prompting Strategies for Enabling Large Language Models to Infer Causation from Correlation
by: Sgouritsa, Eleni, et al.
Published: (2024)
by: Sgouritsa, Eleni, et al.
Published: (2024)
Aortobronchial Fistula: A Case Report and Literature Review
by: Zhenghan Liu, et al.
Published: (2025)
by: Zhenghan Liu, et al.
Published: (2025)
Online Adaptation of Language Models with a Memory of Amortized Contexts
by: Tack, Jihoon, et al.
Published: (2024)
by: Tack, Jihoon, et al.
Published: (2024)
When Is Enough Not Enough? Illusory Completion in Search Agents
by: Ko, Dayoon, et al.
Published: (2026)
by: Ko, Dayoon, et al.
Published: (2026)
Variational Flow Maps: Make Some Noise for One-Step Conditional Generation
by: Mammadov, Abbas, et al.
Published: (2026)
by: Mammadov, Abbas, et al.
Published: (2026)
Kalman Filter for Online Classification of Non-Stationary Data
by: Titsias, Michalis K., et al.
Published: (2023)
by: Titsias, Michalis K., et al.
Published: (2023)
Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design
by: Klarner, Leo, et al.
Published: (2024)
by: Klarner, Leo, et al.
Published: (2024)
FedGIG: Graph Inversion from Gradient in Federated Learning
by: Xiao, Tianzhe, et al.
Published: (2024)
by: Xiao, Tianzhe, et al.
Published: (2024)
Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
by: Galashov, Alexandre, et al.
Published: (2024)
by: Galashov, Alexandre, et al.
Published: (2024)
Information Science: A House Built on Sand
by: Vagianos, Louis
Published: (1972)
by: Vagianos, Louis
Published: (1972)
Similar Items
-
Are We Evaluating the Edit Locality of LLM Model Editing Properly?
by: Liu, Wei, et al.
Published: (2026) -
From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing
by: Liu, Wei, et al.
Published: (2026) -
LinTree: Improving LLM Reasoning with Explicitly Structured Search Histories
by: Kang, Liwei, et al.
Published: (2026) -
SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization
by: Zheng, Zhi, et al.
Published: (2025) -
Extending Epistemic Uncertainty Beyond Parameters Would Assist in Designing Reliable LLMs
by: Nguyen-Hien, T. Duy, et al.
Published: (2025)