Saved in:
| Main Authors: | Chen, Jinkun, Cheng, Fengxiang, Han, Sijia, Keselj, Vlado |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02863 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fine, I'll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging
by: Su, Guinan, et al.
Published: (2025)
by: Su, Guinan, et al.
Published: (2025)
Equivariant Neural Simulators for Stochastic Spatiotemporal Dynamics
by: Minartz, Koen, et al.
Published: (2023)
by: Minartz, Koen, et al.
Published: (2023)
Diagnosing Training Inference Mismatch in LLM Reinforcement Learning
by: Zhong, Tianle, et al.
Published: (2026)
by: Zhong, Tianle, et al.
Published: (2026)
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models
by: Sui, Yuan, et al.
Published: (2025)
by: Sui, Yuan, et al.
Published: (2025)
Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights
by: Parashar, Shubham, et al.
Published: (2025)
by: Parashar, Shubham, et al.
Published: (2025)
Toward Adaptive Reasoning in Large Language Models with Thought Rollback
by: Chen, Sijia, et al.
Published: (2024)
by: Chen, Sijia, et al.
Published: (2024)
Generalisation of RLHF under Reward Shift and Clipped KL Regularisation
by: Tang, Kenton, et al.
Published: (2026)
by: Tang, Kenton, et al.
Published: (2026)
Ensuring Safety in an Uncertain Environment: Constrained MDPs via Stochastic Thresholds
by: Zuo, Qian, et al.
Published: (2025)
by: Zuo, Qian, et al.
Published: (2025)
Adversarial Robustness Overestimation and Instability in TRADES
by: Li, Jonathan Weiping, et al.
Published: (2024)
by: Li, Jonathan Weiping, et al.
Published: (2024)
Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs
by: Laine, Rudolf, et al.
Published: (2024)
by: Laine, Rudolf, et al.
Published: (2024)
When LLM Meets Time Series: Can LLMs Perform Multi-Step Time Series Reasoning and Inference
by: Ye, Wen, et al.
Published: (2025)
by: Ye, Wen, et al.
Published: (2025)
STFlow: Data-Coupled Flow Matching for Geometric Trajectory Simulation
by: Brinke, Kiet Bennema ten, et al.
Published: (2025)
by: Brinke, Kiet Bennema ten, et al.
Published: (2025)
DeXposure-FM: A Time-series, Graph Foundation Model for Credit Exposures and Stability on Decentralized Financial Networks
by: Shu, Aijie, et al.
Published: (2026)
by: Shu, Aijie, et al.
Published: (2026)
Experience-Guided Adaptation of Inference-Time Reasoning Strategies
by: Stein, Adam, et al.
Published: (2025)
by: Stein, Adam, et al.
Published: (2025)
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
by: Pan, Rui, et al.
Published: (2025)
by: Pan, Rui, et al.
Published: (2025)
Harmonizing Multi-Objective LLM Unlearning via Unified Domain Representation and Bidirectional Logit Distillation
by: Zhong, Yisheng, et al.
Published: (2026)
by: Zhong, Yisheng, et al.
Published: (2026)
Think Clearly: Improving Reasoning via Redundant Token Pruning
by: Choi, Daewon, et al.
Published: (2025)
by: Choi, Daewon, et al.
Published: (2025)
Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification
by: Liang, Zhenwen, et al.
Published: (2024)
by: Liang, Zhenwen, et al.
Published: (2024)
Unlearners Can Lie: Evaluating and Improving Honesty in LLM Unlearning
by: Gu, Renjie, et al.
Published: (2026)
by: Gu, Renjie, et al.
Published: (2026)
TS-Reasoner: Domain-Oriented Time Series Inference Agents for Reasoning and Automated Analysis
by: Ye, Wen, et al.
Published: (2024)
by: Ye, Wen, et al.
Published: (2024)
From Observations to Parameters: Detecting Changepoint in Nonlinear Dynamics with Simulation-based Inference
by: Deng, Xiangbo, et al.
Published: (2025)
by: Deng, Xiangbo, et al.
Published: (2025)
Decocted Experience Improves Test-Time Inference in LLM Agents
by: Shen, Maohao, et al.
Published: (2026)
by: Shen, Maohao, et al.
Published: (2026)
I-LLM: Efficient Integer-Only Inference for Fully-Quantized Low-Bit Large Language Models
by: Hu, Xing, et al.
Published: (2024)
by: Hu, Xing, et al.
Published: (2024)
HEARTS: Benchmarking LLM Reasoning on Health Time Series
by: Li, Sirui, et al.
Published: (2026)
by: Li, Sirui, et al.
Published: (2026)
Do Transformers Have the Ability for Periodicity Generalization?
by: Liu, Huanyu, et al.
Published: (2026)
by: Liu, Huanyu, et al.
Published: (2026)
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks
by: Wang, Tianlong, et al.
Published: (2024)
by: Wang, Tianlong, et al.
Published: (2024)
Dynamic Search for Inference-Time Alignment in Diffusion Models
by: Li, Xiner, et al.
Published: (2025)
by: Li, Xiner, et al.
Published: (2025)
AgentKit: Structured LLM Reasoning with Dynamic Graphs
by: Wu, Yue, et al.
Published: (2024)
by: Wu, Yue, et al.
Published: (2024)
Latent-Space Contrastive Reinforcement Learning for Stable and Efficient LLM Reasoning
by: Shan, Lianlei, et al.
Published: (2026)
by: Shan, Lianlei, et al.
Published: (2026)
To Call or Not to Call: Diagnosing Intrinsic Over-Calling Bias in LLM Agents
by: Shi, Wei, et al.
Published: (2026)
by: Shi, Wei, et al.
Published: (2026)
Static Sandboxes Are Inadequate: Modeling Societal Complexity Requires Open-Ended Co-Evolution in LLM-Based Multi-Agent Simulations
by: Chen, Jinkun, et al.
Published: (2025)
by: Chen, Jinkun, et al.
Published: (2025)
FractalBench: Diagnosing Visual-Mathematical Reasoning Through Recursive Program Synthesis
by: Ondras, Jan, et al.
Published: (2025)
by: Ondras, Jan, et al.
Published: (2025)
Can LLMs Score Medical Diagnoses and Clinical Reasoning as well as Expert Panels?
by: Rouillard, Amy, et al.
Published: (2026)
by: Rouillard, Amy, et al.
Published: (2026)
RetroReasoner: A Reasoning LLM for Strategic Retrosynthesis Prediction
by: Ko, Hanbum, et al.
Published: (2026)
by: Ko, Hanbum, et al.
Published: (2026)
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
by: Agarwal, Shivam, et al.
Published: (2025)
by: Agarwal, Shivam, et al.
Published: (2025)
No Free Lunch: Rethinking Internal Feedback for LLM Reasoning
by: Zhang, Yanzhi, et al.
Published: (2025)
by: Zhang, Yanzhi, et al.
Published: (2025)
Mamba or Transformer for Time Series Forecasting? Mixture of Universals (MoU) Is All You Need
by: Peng, Sijia, et al.
Published: (2024)
by: Peng, Sijia, et al.
Published: (2024)
Diagnosing Medical Datasets with Training Dynamics
by: Wenderoth, Laura
Published: (2024)
by: Wenderoth, Laura
Published: (2024)
Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills
by: Wang, Changsheng, et al.
Published: (2025)
by: Wang, Changsheng, et al.
Published: (2025)
RePCS: Diagnosing Data Memorization in LLM-Powered Retrieval-Augmented Generation
by: Anh, Le Vu, et al.
Published: (2025)
by: Anh, Le Vu, et al.
Published: (2025)
Similar Items
-
Fine, I'll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging
by: Su, Guinan, et al.
Published: (2025) -
Equivariant Neural Simulators for Stochastic Spatiotemporal Dynamics
by: Minartz, Koen, et al.
Published: (2023) -
Diagnosing Training Inference Mismatch in LLM Reinforcement Learning
by: Zhong, Tianle, et al.
Published: (2026) -
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models
by: Sui, Yuan, et al.
Published: (2025) -
Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights
by: Parashar, Shubham, et al.
Published: (2025)