:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zollicoffer, Geigh, Chopra, Tanush, Yan, Mingkuan, Ma, Xiaoxu, Eaton, Kenneth, Riedl, Mark
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2512.01119
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Novelty Detection in Reinforcement Learning with World Models
by: Zollicoffer, Geigh, et al.
Published: (2023)

Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
by: Balloch, Jonathan C., et al.
Published: (2024)

The Interpretability of Codebooks in Model-Based Reinforcement Learning is Limited
by: Eaton, Kenneth, et al.
Published: (2024)

HalluField: Detecting LLM Hallucinations via Field-Theoretic Modeling
by: Vu, Minh, et al.
Published: (2025)

View From Above: A Framework for Evaluating Distribution Shifts in Model Behavior
by: Chopra, Tanush, et al.
Published: (2024)

Topological Signatures of Adversaries in Multimodal Alignments
by: Vu, Minh, et al.
Published: (2025)

LoRID: Low-Rank Iterative Diffusion for Adversarial Purification
by: Zollicoffer, Geigh, et al.
Published: (2024)

LaFA: Latent Feature Attacks on Non-negative Matrix Factorization
by: Vu, Minh, et al.
Published: (2024)

MTRE: Multi-Token Reliability Estimation for Hallucination Detection in VLMs
by: Zollicoffer, Geigh, et al.
Published: (2025)

Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization
by: Bhatta, Kshitij, et al.
Published: (2024)

Towards Faster Matrix Diagonalization with Graph Isomorphism Networks and the AlphaZero Framework
by: Zollicoffer, Geigh, et al.
Published: (2024)

Hybrid Neural World Models
by: Lakshmanan, Pranav, et al.
Published: (2026)

Sanity Checks for Long-Form Hallucination Detection
by: Zollicoffer, Geigh, et al.
Published: (2026)

Surprisal Driven $k$-NN for Robust and Interpretable Nonparametric Learning
by: Banerjee, Amartya, et al.
Published: (2023)

External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling
by: Bhagat, Rishav, et al.
Published: (2024)

The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling
by: Ma, Jiajun, et al.
Published: (2024)

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages
by: Sharma, Aman, et al.
Published: (2026)

Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models
by: Hu, Wentao, et al.
Published: (2025)

Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced Extrapolation in LLMs
by: Ma, Xin, et al.
Published: (2024)

History Rhymes: Macro-Contextual Retrieval for Robust Financial Forecasting
by: Khanna, Sarthak, et al.
Published: (2025)

On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models
by: Farhat, Sean, et al.
Published: (2024)

Verification of the Implicit World Model in a Generative Model via Adversarial Sequences
by: Balogh, András, et al.
Published: (2026)

Influence functions and regularity tangents for efficient active learning
by: Eaton, Frederik
Published: (2024)

Model Agreement via Anchoring
by: Eaton, Eric, et al.
Published: (2026)

The Sequential Edge: Inverse-Entropy Voting Beats Parallel Self-Consistency at Matched Compute
by: Sharma, Aman, et al.
Published: (2025)

Think Just Enough: Sequence-Level Entropy as a Confidence Signal for LLM Reasoning
by: Sharma, Aman, et al.
Published: (2025)

Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts
by: Trehan, Dhruv, et al.
Published: (2026)

Discovering Reinforcement Learning Interfaces with Large Language Models
by: Jaswal, Akshat Singh, et al.
Published: (2026)

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers
by: Joo, Taejong, et al.
Published: (2026)

Response Wide Shut? Surprising Observations in Basic Vision Language Model Capabilities
by: Chandhok, Shivam, et al.
Published: (2025)

Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
by: Hugessen, Adriana, et al.
Published: (2024)

Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
by: GX-Chen, Anthony, et al.
Published: (2024)

AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise
by: Agarwal, Dhruv, et al.
Published: (2025)

Building Interpretable Models for Moral Decision-Making
by: Goel, Mayank, et al.
Published: (2026)

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning
by: Liu, Huihan, et al.
Published: (2026)

Learning Latent Dynamic Robust Representations for World Models
by: Sun, Ruixiang, et al.
Published: (2024)

On the Surprising Effectiveness of Large Learning Rates under Standard Width Scaling
by: Haas, Moritz, et al.
Published: (2025)

Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection
by: Özer, Kadir-Kaan, et al.
Published: (2026)

Using Analytics on Student Created Data to Content Validate Pedagogical Tools
by: Kos, John, et al.
Published: (2023)

A Temporally Augmented Graph Attention Network for Affordance Classification
by: Chopra, Ami, et al.
Published: (2026)