Saved in:
| Main Authors: | Zollicoffer, Geigh, Chopra, Tanush, Yan, Mingkuan, Ma, Xiaoxu, Eaton, Kenneth, Riedl, Mark |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.01119 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Novelty Detection in Reinforcement Learning with World Models
by: Zollicoffer, Geigh, et al.
Published: (2023)
by: Zollicoffer, Geigh, et al.
Published: (2023)
Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
by: Balloch, Jonathan C., et al.
Published: (2024)
by: Balloch, Jonathan C., et al.
Published: (2024)
The Interpretability of Codebooks in Model-Based Reinforcement Learning is Limited
by: Eaton, Kenneth, et al.
Published: (2024)
by: Eaton, Kenneth, et al.
Published: (2024)
HalluField: Detecting LLM Hallucinations via Field-Theoretic Modeling
by: Vu, Minh, et al.
Published: (2025)
by: Vu, Minh, et al.
Published: (2025)
View From Above: A Framework for Evaluating Distribution Shifts in Model Behavior
by: Chopra, Tanush, et al.
Published: (2024)
by: Chopra, Tanush, et al.
Published: (2024)
Topological Signatures of Adversaries in Multimodal Alignments
by: Vu, Minh, et al.
Published: (2025)
by: Vu, Minh, et al.
Published: (2025)
LoRID: Low-Rank Iterative Diffusion for Adversarial Purification
by: Zollicoffer, Geigh, et al.
Published: (2024)
by: Zollicoffer, Geigh, et al.
Published: (2024)
LaFA: Latent Feature Attacks on Non-negative Matrix Factorization
by: Vu, Minh, et al.
Published: (2024)
by: Vu, Minh, et al.
Published: (2024)
MTRE: Multi-Token Reliability Estimation for Hallucination Detection in VLMs
by: Zollicoffer, Geigh, et al.
Published: (2025)
by: Zollicoffer, Geigh, et al.
Published: (2025)
Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization
by: Bhatta, Kshitij, et al.
Published: (2024)
by: Bhatta, Kshitij, et al.
Published: (2024)
Towards Faster Matrix Diagonalization with Graph Isomorphism Networks and the AlphaZero Framework
by: Zollicoffer, Geigh, et al.
Published: (2024)
by: Zollicoffer, Geigh, et al.
Published: (2024)
Hybrid Neural World Models
by: Lakshmanan, Pranav, et al.
Published: (2026)
by: Lakshmanan, Pranav, et al.
Published: (2026)
Sanity Checks for Long-Form Hallucination Detection
by: Zollicoffer, Geigh, et al.
Published: (2026)
by: Zollicoffer, Geigh, et al.
Published: (2026)
Surprisal Driven $k$-NN for Robust and Interpretable Nonparametric Learning
by: Banerjee, Amartya, et al.
Published: (2023)
by: Banerjee, Amartya, et al.
Published: (2023)
External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling
by: Bhagat, Rishav, et al.
Published: (2024)
by: Bhagat, Rishav, et al.
Published: (2024)
The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling
by: Ma, Jiajun, et al.
Published: (2024)
by: Ma, Jiajun, et al.
Published: (2024)
EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages
by: Sharma, Aman, et al.
Published: (2026)
by: Sharma, Aman, et al.
Published: (2026)
Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models
by: Hu, Wentao, et al.
Published: (2025)
by: Hu, Wentao, et al.
Published: (2025)
Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced Extrapolation in LLMs
by: Ma, Xin, et al.
Published: (2024)
by: Ma, Xin, et al.
Published: (2024)
History Rhymes: Macro-Contextual Retrieval for Robust Financial Forecasting
by: Khanna, Sarthak, et al.
Published: (2025)
by: Khanna, Sarthak, et al.
Published: (2025)
On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models
by: Farhat, Sean, et al.
Published: (2024)
by: Farhat, Sean, et al.
Published: (2024)
Verification of the Implicit World Model in a Generative Model via Adversarial Sequences
by: Balogh, András, et al.
Published: (2026)
by: Balogh, András, et al.
Published: (2026)
Influence functions and regularity tangents for efficient active learning
by: Eaton, Frederik
Published: (2024)
by: Eaton, Frederik
Published: (2024)
Model Agreement via Anchoring
by: Eaton, Eric, et al.
Published: (2026)
by: Eaton, Eric, et al.
Published: (2026)
The Sequential Edge: Inverse-Entropy Voting Beats Parallel Self-Consistency at Matched Compute
by: Sharma, Aman, et al.
Published: (2025)
by: Sharma, Aman, et al.
Published: (2025)
Think Just Enough: Sequence-Level Entropy as a Confidence Signal for LLM Reasoning
by: Sharma, Aman, et al.
Published: (2025)
by: Sharma, Aman, et al.
Published: (2025)
Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts
by: Trehan, Dhruv, et al.
Published: (2026)
by: Trehan, Dhruv, et al.
Published: (2026)
Discovering Reinforcement Learning Interfaces with Large Language Models
by: Jaswal, Akshat Singh, et al.
Published: (2026)
by: Jaswal, Akshat Singh, et al.
Published: (2026)
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers
by: Joo, Taejong, et al.
Published: (2026)
by: Joo, Taejong, et al.
Published: (2026)
Response Wide Shut? Surprising Observations in Basic Vision Language Model Capabilities
by: Chandhok, Shivam, et al.
Published: (2025)
by: Chandhok, Shivam, et al.
Published: (2025)
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
by: Hugessen, Adriana, et al.
Published: (2024)
by: Hugessen, Adriana, et al.
Published: (2024)
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
by: GX-Chen, Anthony, et al.
Published: (2024)
by: GX-Chen, Anthony, et al.
Published: (2024)
AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise
by: Agarwal, Dhruv, et al.
Published: (2025)
by: Agarwal, Dhruv, et al.
Published: (2025)
Building Interpretable Models for Moral Decision-Making
by: Goel, Mayank, et al.
Published: (2026)
by: Goel, Mayank, et al.
Published: (2026)
Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning
by: Liu, Huihan, et al.
Published: (2026)
by: Liu, Huihan, et al.
Published: (2026)
Learning Latent Dynamic Robust Representations for World Models
by: Sun, Ruixiang, et al.
Published: (2024)
by: Sun, Ruixiang, et al.
Published: (2024)
On the Surprising Effectiveness of Large Learning Rates under Standard Width Scaling
by: Haas, Moritz, et al.
Published: (2025)
by: Haas, Moritz, et al.
Published: (2025)
Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection
by: Özer, Kadir-Kaan, et al.
Published: (2026)
by: Özer, Kadir-Kaan, et al.
Published: (2026)
Using Analytics on Student Created Data to Content Validate Pedagogical Tools
by: Kos, John, et al.
Published: (2023)
by: Kos, John, et al.
Published: (2023)
A Temporally Augmented Graph Attention Network for Affordance Classification
by: Chopra, Ami, et al.
Published: (2026)
by: Chopra, Ami, et al.
Published: (2026)
Similar Items
-
Novelty Detection in Reinforcement Learning with World Models
by: Zollicoffer, Geigh, et al.
Published: (2023) -
Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
by: Balloch, Jonathan C., et al.
Published: (2024) -
The Interpretability of Codebooks in Model-Based Reinforcement Learning is Limited
by: Eaton, Kenneth, et al.
Published: (2024) -
HalluField: Detecting LLM Hallucinations via Field-Theoretic Modeling
by: Vu, Minh, et al.
Published: (2025) -
View From Above: A Framework for Evaluating Distribution Shifts in Model Behavior
by: Chopra, Tanush, et al.
Published: (2024)