Saved in:
| Main Author: | Advani, Laksh |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.00513 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Trajectory Guard -- A Lightweight, Sequence-Aware Model for Real-Time Anomaly Detection in Agentic AI
by: Advani, Laksh
Published: (2026)
by: Advani, Laksh
Published: (2026)
Clever Materials: When Models Identify Good Materials for the Wrong Reasons
by: Jablonka, Kevin Maik
Published: (2026)
by: Jablonka, Kevin Maik
Published: (2026)
Data Cartography for Detecting Memorization Hotspots and Guiding Data Interventions in Generative Models
by: Patel, Laksh, et al.
Published: (2025)
by: Patel, Laksh, et al.
Published: (2025)
Wrong Model, Right Uncertainty: Spatial Associations for Discrete Data with Misspecification
by: Burt, David R., et al.
Published: (2025)
by: Burt, David R., et al.
Published: (2025)
Perplexity Cannot Always Tell Right from Wrong
by: Veličković, Petar, et al.
Published: (2026)
by: Veličković, Petar, et al.
Published: (2026)
When World Models Dream Wrong: Physical-Conditioned Adversarial Attacks against World Models
by: Guo, Zhixiang, et al.
Published: (2026)
by: Guo, Zhixiang, et al.
Published: (2026)
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
by: Son, Seongho, et al.
Published: (2024)
by: Son, Seongho, et al.
Published: (2024)
Stable but Wrong: When More Data Degrades Scientific Conclusions
by: Zhang, Zhipeng, et al.
Published: (2026)
by: Zhang, Zhipeng, et al.
Published: (2026)
When to Trust the Cheap Check: Weak and Strong Verification for Reasoning
by: Kiyani, Shayan, et al.
Published: (2026)
by: Kiyani, Shayan, et al.
Published: (2026)
When PINNs Go Wrong: Pseudo-Time Stepping Against Spurious Solutions
by: Wang, Sifan, et al.
Published: (2026)
by: Wang, Sifan, et al.
Published: (2026)
[Experiments & Analysis] Evaluating the Feasibility of Sampling-Based Techniques for Training Multilayer Perceptrons
by: Ebrahimi, Sana, et al.
Published: (2023)
by: Ebrahimi, Sana, et al.
Published: (2023)
When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception
by: Zolfaghari, Vahideh
Published: (2026)
by: Zolfaghari, Vahideh
Published: (2026)
Trustworthy AI: Ensuring Reliability and Accountability from Models to Agents
by: Long, Carol Xuan
Published: (2026)
by: Long, Carol Xuan
Published: (2026)
When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning
by: Singhi, Nishad, et al.
Published: (2025)
by: Singhi, Nishad, et al.
Published: (2025)
Thinking Wrong in Silence: Backdoor Attacks on Continuous Latent Reasoning
by: Parekh, Swapnil
Published: (2026)
by: Parekh, Swapnil
Published: (2026)
Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection
by: Xiaohu, Xie, et al.
Published: (2026)
by: Xiaohu, Xie, et al.
Published: (2026)
When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic
by: Fernández-Hernández, Alberto, et al.
Published: (2026)
by: Fernández-Hernández, Alberto, et al.
Published: (2026)
Acting for the Right Reasons: Creating Reason-Sensitive Artificial Moral Agents
by: Baum, Kevin, et al.
Published: (2024)
by: Baum, Kevin, et al.
Published: (2024)
Right for the Right Reasons: Avoiding Reasoning Shortcuts via Prototypical Neurosymbolic AI
by: Andolfi, Luca, et al.
Published: (2025)
by: Andolfi, Luca, et al.
Published: (2025)
Towards Trustworthy GUI Agents: A Survey
by: Shi, Yucheng, et al.
Published: (2025)
by: Shi, Yucheng, et al.
Published: (2025)
ThermEval: A Structured Benchmark for Evaluation of Vision-Language Models on Thermal Imagery
by: Shrivastava, Ayush, et al.
Published: (2026)
by: Shrivastava, Ayush, et al.
Published: (2026)
Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL
by: Cai, Siyang, et al.
Published: (2026)
by: Cai, Siyang, et al.
Published: (2026)
Decidable By Construction: Design-Time Verification for Trustworthy AI
by: Haynes, Houston
Published: (2026)
by: Haynes, Houston
Published: (2026)
The Right Answer, the Wrong Direction: Why Transformers Fail at Counting and How to Fix It
by: Garcia, Gabriel
Published: (2026)
by: Garcia, Gabriel
Published: (2026)
All AI Models are Wrong, but Some are Optimal
by: Anand, Akhil S, et al.
Published: (2025)
by: Anand, Akhil S, et al.
Published: (2025)
LLMs as Assessors: Right for the Right Reason?
by: Saha, Sourav, et al.
Published: (2026)
by: Saha, Sourav, et al.
Published: (2026)
VerificAgent: Domain-Specific Memory Verification for Scalable Oversight of Aligned Computer-Use Agents
by: Nguyen, Thong Q., et al.
Published: (2025)
by: Nguyen, Thong Q., et al.
Published: (2025)
What is Wrong with Perplexity for Long-context Language Modeling?
by: Fang, Lizhe, et al.
Published: (2024)
by: Fang, Lizhe, et al.
Published: (2024)
Beyond Benchmarks: Dynamic, Automatic And Systematic Red-Teaming Agents For Trustworthy Medical Language Models
by: Pan, Jiazhen, et al.
Published: (2025)
by: Pan, Jiazhen, et al.
Published: (2025)
Trustworthy Prediction with Gaussian Process Knowledge Scores
by: Butler, Kurt, et al.
Published: (2025)
by: Butler, Kurt, et al.
Published: (2025)
Verification and Validation for Trustworthy Scientific Machine Learning
by: Jakeman, John D., et al.
Published: (2025)
by: Jakeman, John D., et al.
Published: (2025)
Towards Interpretable and Trustworthy Time Series Reasoning: A BlueSky Vision
by: Ning, Kanghui, et al.
Published: (2025)
by: Ning, Kanghui, et al.
Published: (2025)
Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA
by: Martinez, John Ray B.
Published: (2026)
by: Martinez, John Ray B.
Published: (2026)
An Accurate and Interpretable Framework for Trustworthy Process Monitoring
by: Wang, Hao, et al.
Published: (2023)
by: Wang, Hao, et al.
Published: (2023)
ManifoldMind: Dynamic Hyperbolic Reasoning for Trustworthy Recommendations
by: Harit, Anoushka, et al.
Published: (2025)
by: Harit, Anoushka, et al.
Published: (2025)
The Geometry of Self-Verification in a Task-Specific Reasoning Model
by: Lee, Andrew, et al.
Published: (2025)
by: Lee, Andrew, et al.
Published: (2025)
To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning
by: Qin, Tian, et al.
Published: (2025)
by: Qin, Tian, et al.
Published: (2025)
Step-by-Step Diffusion: An Elementary Tutorial
by: Nakkiran, Preetum, et al.
Published: (2024)
by: Nakkiran, Preetum, et al.
Published: (2024)
Out-of-Distribution Detection Methods Answer the Wrong Questions
by: Li, Yucen Lily, et al.
Published: (2025)
by: Li, Yucen Lily, et al.
Published: (2025)
Your Assumed DAG is Wrong and Here's How To Deal With It
by: Padh, Kirtan, et al.
Published: (2025)
by: Padh, Kirtan, et al.
Published: (2025)
Similar Items
-
Trajectory Guard -- A Lightweight, Sequence-Aware Model for Real-Time Anomaly Detection in Agentic AI
by: Advani, Laksh
Published: (2026) -
Clever Materials: When Models Identify Good Materials for the Wrong Reasons
by: Jablonka, Kevin Maik
Published: (2026) -
Data Cartography for Detecting Memorization Hotspots and Guiding Data Interventions in Generative Models
by: Patel, Laksh, et al.
Published: (2025) -
Wrong Model, Right Uncertainty: Spatial Associations for Discrete Data with Misspecification
by: Burt, David R., et al.
Published: (2025) -
Perplexity Cannot Always Tell Right from Wrong
by: Veličković, Petar, et al.
Published: (2026)