:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Advani, Laksh
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2601.00513
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Trajectory Guard -- A Lightweight, Sequence-Aware Model for Real-Time Anomaly Detection in Agentic AI
by: Advani, Laksh
Published: (2026)

Clever Materials: When Models Identify Good Materials for the Wrong Reasons
by: Jablonka, Kevin Maik
Published: (2026)

Data Cartography for Detecting Memorization Hotspots and Guiding Data Interventions in Generative Models
by: Patel, Laksh, et al.
Published: (2025)

Wrong Model, Right Uncertainty: Spatial Associations for Discrete Data with Misspecification
by: Burt, David R., et al.
Published: (2025)

Perplexity Cannot Always Tell Right from Wrong
by: Veličković, Petar, et al.
Published: (2026)

When World Models Dream Wrong: Physical-Conditioned Adversarial Attacks against World Models
by: Guo, Zhixiang, et al.
Published: (2026)

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
by: Son, Seongho, et al.
Published: (2024)

Stable but Wrong: When More Data Degrades Scientific Conclusions
by: Zhang, Zhipeng, et al.
Published: (2026)

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning
by: Kiyani, Shayan, et al.
Published: (2026)

When PINNs Go Wrong: Pseudo-Time Stepping Against Spurious Solutions
by: Wang, Sifan, et al.
Published: (2026)

[Experiments & Analysis] Evaluating the Feasibility of Sampling-Based Techniques for Training Multilayer Perceptrons
by: Ebrahimi, Sana, et al.
Published: (2023)

When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception
by: Zolfaghari, Vahideh
Published: (2026)

Trustworthy AI: Ensuring Reliability and Accountability from Models to Agents
by: Long, Carol Xuan
Published: (2026)

When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning
by: Singhi, Nishad, et al.
Published: (2025)

Thinking Wrong in Silence: Backdoor Attacks on Continuous Latent Reasoning
by: Parekh, Swapnil
Published: (2026)

Know When You're Wrong: Aligning Confidence with Correctness for LLM Error Detection
by: Xiaohu, Xie, et al.
Published: (2026)

When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic
by: Fernández-Hernández, Alberto, et al.
Published: (2026)

Acting for the Right Reasons: Creating Reason-Sensitive Artificial Moral Agents
by: Baum, Kevin, et al.
Published: (2024)

Right for the Right Reasons: Avoiding Reasoning Shortcuts via Prototypical Neurosymbolic AI
by: Andolfi, Luca, et al.
Published: (2025)

Towards Trustworthy GUI Agents: A Survey
by: Shi, Yucheng, et al.
Published: (2025)

ThermEval: A Structured Benchmark for Evaluation of Vision-Language Models on Thermal Imagery
by: Shrivastava, Ayush, et al.
Published: (2026)

Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL
by: Cai, Siyang, et al.
Published: (2026)

Decidable By Construction: Design-Time Verification for Trustworthy AI
by: Haynes, Houston
Published: (2026)

The Right Answer, the Wrong Direction: Why Transformers Fail at Counting and How to Fix It
by: Garcia, Gabriel
Published: (2026)

All AI Models are Wrong, but Some are Optimal
by: Anand, Akhil S, et al.
Published: (2025)

LLMs as Assessors: Right for the Right Reason?
by: Saha, Sourav, et al.
Published: (2026)

VerificAgent: Domain-Specific Memory Verification for Scalable Oversight of Aligned Computer-Use Agents
by: Nguyen, Thong Q., et al.
Published: (2025)

What is Wrong with Perplexity for Long-context Language Modeling?
by: Fang, Lizhe, et al.
Published: (2024)

Beyond Benchmarks: Dynamic, Automatic And Systematic Red-Teaming Agents For Trustworthy Medical Language Models
by: Pan, Jiazhen, et al.
Published: (2025)

Trustworthy Prediction with Gaussian Process Knowledge Scores
by: Butler, Kurt, et al.
Published: (2025)

Verification and Validation for Trustworthy Scientific Machine Learning
by: Jakeman, John D., et al.
Published: (2025)

Towards Interpretable and Trustworthy Time Series Reasoning: A BlueSky Vision
by: Ning, Kanghui, et al.
Published: (2025)

Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA
by: Martinez, John Ray B.
Published: (2026)

An Accurate and Interpretable Framework for Trustworthy Process Monitoring
by: Wang, Hao, et al.
Published: (2023)

ManifoldMind: Dynamic Hyperbolic Reasoning for Trustworthy Recommendations
by: Harit, Anoushka, et al.
Published: (2025)

The Geometry of Self-Verification in a Task-Specific Reasoning Model
by: Lee, Andrew, et al.
Published: (2025)

To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning
by: Qin, Tian, et al.
Published: (2025)

Step-by-Step Diffusion: An Elementary Tutorial
by: Nakkiran, Preetum, et al.
Published: (2024)

Out-of-Distribution Detection Methods Answer the Wrong Questions
by: Li, Yucen Lily, et al.
Published: (2025)

Your Assumed DAG is Wrong and Here's How To Deal With It
by: Padh, Kirtan, et al.
Published: (2025)