Saved in:
| Main Author: | Clark, Michael J. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.07473 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Training LLMs for Honesty via Confessions
by: Joglekar, Manas, et al.
Published: (2025)
by: Joglekar, Manas, et al.
Published: (2025)
QiMeng-MuPa: Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
by: Ke, Changxin, et al.
Published: (2025)
by: Ke, Changxin, et al.
Published: (2025)
AdaPaD: Adaptive Parallel Deflation for PEFT with Self-Correcting Rank Discovery
by: Su, Barbara, et al.
Published: (2026)
by: Su, Barbara, et al.
Published: (2026)
Interpretable Self-Supervised Learning via Representer Landmarks and Nyström Approximation
by: Zarvandi, Maedeh, et al.
Published: (2025)
by: Zarvandi, Maedeh, et al.
Published: (2025)
E3STO: Orbital Inspired SE(3)-Equivariant Molecular Representation for Electron Density Prediction
by: Mitnikov, Ilan, et al.
Published: (2024)
by: Mitnikov, Ilan, et al.
Published: (2024)
Honesty in Causal Forests: When It Helps and When It Hurts
by: Hou, Yanfang, et al.
Published: (2025)
by: Hou, Yanfang, et al.
Published: (2025)
PaSE: Parallelization Strategies for Efficient DNN Training
by: Elango, Venmugil
Published: (2024)
by: Elango, Venmugil
Published: (2024)
Measure-Theoretic Anti-Causal Representation Learning
by: Behnam, Arman, et al.
Published: (2025)
by: Behnam, Arman, et al.
Published: (2025)
PaPaformer: Language Model from Pre-trained Parallel Paths
by: Tapaninaho, Joonas, et al.
Published: (2025)
by: Tapaninaho, Joonas, et al.
Published: (2025)
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
by: Hu, Jingcheng, et al.
Published: (2026)
by: Hu, Jingcheng, et al.
Published: (2026)
TARDIS: Mitigating Temporal Misalignment via Representation Steering
by: Shin, Changho, et al.
Published: (2025)
by: Shin, Changho, et al.
Published: (2025)
Self-Supervised Graph Representation Learning via Global Context Prediction
by: Peng, Zhen, et al.
Published: (2020)
by: Peng, Zhen, et al.
Published: (2020)
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information
by: Shen, Guobin, et al.
Published: (2026)
by: Shen, Guobin, et al.
Published: (2026)
Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation
by: Mohamadi, Mohamad Amin, et al.
Published: (2025)
by: Mohamadi, Mohamad Amin, et al.
Published: (2025)
STO-RL: Offline RL under Sparse Rewards via LLM-Guided Subgoal Temporal Order
by: Gu, Chengyang, et al.
Published: (2026)
by: Gu, Chengyang, et al.
Published: (2026)
Unlearners Can Lie: Evaluating and Improving Honesty in LLM Unlearning
by: Gu, Renjie, et al.
Published: (2026)
by: Gu, Renjie, et al.
Published: (2026)
Preference Learning with Lie Detectors can Induce Honesty or Evasion
by: Cundy, Chris, et al.
Published: (2025)
by: Cundy, Chris, et al.
Published: (2025)
Self-Supervised Dynamical System Representations for Physiological Time-Series
by: Chen, Yenho, et al.
Published: (2025)
by: Chen, Yenho, et al.
Published: (2025)
Incentivizing Honesty among Competitors in Collaborative Learning and Optimization
by: Dorner, Florian E., et al.
Published: (2023)
by: Dorner, Florian E., et al.
Published: (2023)
Gaussian Joint Embeddings For Self-Supervised Representation Learning
by: Huang, Yongchao
Published: (2026)
by: Huang, Yongchao
Published: (2026)
Gaussian Joint Embeddings For Self-Supervised Representation Learning
by: Huang, Yongchao
Published: (2026)
by: Huang, Yongchao
Published: (2026)
Self-Supervised Representation Learning as Mutual Information Maximization
by: Sabby, Akhlaqur Rahman, et al.
Published: (2025)
by: Sabby, Akhlaqur Rahman, et al.
Published: (2025)
On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning
by: Wang, Bokun, et al.
Published: (2024)
by: Wang, Bokun, et al.
Published: (2024)
The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes
by: Taufeeque, Mohammad, et al.
Published: (2026)
by: Taufeeque, Mohammad, et al.
Published: (2026)
Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression
by: Zhai, Runtian, et al.
Published: (2023)
by: Zhai, Runtian, et al.
Published: (2023)
PaECTER: Patent-level Representation Learning using Citation-informed Transformers
by: Ghosh, Mainak, et al.
Published: (2024)
by: Ghosh, Mainak, et al.
Published: (2024)
Think Before You Lie: How Reasoning Leads to Honesty
by: Yuan, Ann, et al.
Published: (2026)
by: Yuan, Ann, et al.
Published: (2026)
Universal Spectral Tokenization via Self-Supervised Panchromatic Representation Learning
by: Shen, Jeff, et al.
Published: (2025)
by: Shen, Jeff, et al.
Published: (2025)
HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs
by: Kim, Sunwoo, et al.
Published: (2024)
by: Kim, Sunwoo, et al.
Published: (2024)
Anti Mode-Collapse in Mean-Field Transformer via Auxiliary Variables
by: Imaizumi, Masaaki, et al.
Published: (2026)
by: Imaizumi, Masaaki, et al.
Published: (2026)
PaAno: Patch-Based Representation Learning for Time-Series Anomaly Detection
by: Park, Jinju, et al.
Published: (2026)
by: Park, Jinju, et al.
Published: (2026)
Concept Heterogeneity-aware Representation Steering
by: Abdullaev, Laziz U., et al.
Published: (2026)
by: Abdullaev, Laziz U., et al.
Published: (2026)
Understanding Representation Learnability of Nonlinear Self-Supervised Learning
by: Yang, Ruofeng, et al.
Published: (2024)
by: Yang, Ruofeng, et al.
Published: (2024)
The Impact of Semantic Pairs on Self-Supervised Representation Learning
by: Alkhalefi, Mohammad, et al.
Published: (2025)
by: Alkhalefi, Mohammad, et al.
Published: (2025)
Quantifying Representation Reliability in Self-Supervised Learning Models
by: Park, Young-Jin, et al.
Published: (2023)
by: Park, Young-Jin, et al.
Published: (2023)
Data-Driven Self-Supervised Graph Representation Learning
by: Samy, Ahmed E., et al.
Published: (2024)
by: Samy, Ahmed E., et al.
Published: (2024)
On Linear Separation Capacity of Self-Supervised Representation Learning
by: Wang, Shulei
Published: (2023)
by: Wang, Shulei
Published: (2023)
Brep2Shape: Boundary and Shape Representation Alignment via Self-Supervised Transformers
by: Sun, Yuanxu, et al.
Published: (2026)
by: Sun, Yuanxu, et al.
Published: (2026)
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
by: Ren, Richard, et al.
Published: (2025)
by: Ren, Richard, et al.
Published: (2025)
PaPaGei: Open Foundation Models for Optical Physiological Signals
by: Pillai, Arvind, et al.
Published: (2024)
by: Pillai, Arvind, et al.
Published: (2024)
Similar Items
-
Training LLMs for Honesty via Confessions
by: Joglekar, Manas, et al.
Published: (2025) -
QiMeng-MuPa: Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
by: Ke, Changxin, et al.
Published: (2025) -
AdaPaD: Adaptive Parallel Deflation for PEFT with Self-Correcting Rank Discovery
by: Su, Barbara, et al.
Published: (2026) -
Interpretable Self-Supervised Learning via Representer Landmarks and Nyström Approximation
by: Zarvandi, Maedeh, et al.
Published: (2025) -
E3STO: Orbital Inspired SE(3)-Equivariant Molecular Representation for Electron Density Prediction
by: Mitnikov, Ilan, et al.
Published: (2024)