:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Clark, Michael J.
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2601.07473
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Training LLMs for Honesty via Confessions
by: Joglekar, Manas, et al.
Published: (2025)

QiMeng-MuPa: Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
by: Ke, Changxin, et al.
Published: (2025)

AdaPaD: Adaptive Parallel Deflation for PEFT with Self-Correcting Rank Discovery
by: Su, Barbara, et al.
Published: (2026)

Interpretable Self-Supervised Learning via Representer Landmarks and Nyström Approximation
by: Zarvandi, Maedeh, et al.
Published: (2025)

E3STO: Orbital Inspired SE(3)-Equivariant Molecular Representation for Electron Density Prediction
by: Mitnikov, Ilan, et al.
Published: (2024)

Honesty in Causal Forests: When It Helps and When It Hurts
by: Hou, Yanfang, et al.
Published: (2025)

PaSE: Parallelization Strategies for Efficient DNN Training
by: Elango, Venmugil
Published: (2024)

Measure-Theoretic Anti-Causal Representation Learning
by: Behnam, Arman, et al.
Published: (2025)

PaPaformer: Language Model from Pre-trained Parallel Paths
by: Tapaninaho, Joonas, et al.
Published: (2025)

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
by: Hu, Jingcheng, et al.
Published: (2026)

TARDIS: Mitigating Temporal Misalignment via Representation Steering
by: Shin, Changho, et al.
Published: (2025)

Self-Supervised Graph Representation Learning via Global Context Prediction
by: Peng, Zhen, et al.
Published: (2020)

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information
by: Shen, Guobin, et al.
Published: (2026)

Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation
by: Mohamadi, Mohamad Amin, et al.
Published: (2025)

STO-RL: Offline RL under Sparse Rewards via LLM-Guided Subgoal Temporal Order
by: Gu, Chengyang, et al.
Published: (2026)

Unlearners Can Lie: Evaluating and Improving Honesty in LLM Unlearning
by: Gu, Renjie, et al.
Published: (2026)

Preference Learning with Lie Detectors can Induce Honesty or Evasion
by: Cundy, Chris, et al.
Published: (2025)

Self-Supervised Dynamical System Representations for Physiological Time-Series
by: Chen, Yenho, et al.
Published: (2025)

Incentivizing Honesty among Competitors in Collaborative Learning and Optimization
by: Dorner, Florian E., et al.
Published: (2023)

Gaussian Joint Embeddings For Self-Supervised Representation Learning
by: Huang, Yongchao
Published: (2026)

Gaussian Joint Embeddings For Self-Supervised Representation Learning
by: Huang, Yongchao
Published: (2026)

Self-Supervised Representation Learning as Mutual Information Maximization
by: Sabby, Akhlaqur Rahman, et al.
Published: (2025)

On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning
by: Wang, Bokun, et al.
Published: (2024)

The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes
by: Taufeeque, Mohammad, et al.
Published: (2026)

Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression
by: Zhai, Runtian, et al.
Published: (2023)

PaECTER: Patent-level Representation Learning using Citation-informed Transformers
by: Ghosh, Mainak, et al.
Published: (2024)

Think Before You Lie: How Reasoning Leads to Honesty
by: Yuan, Ann, et al.
Published: (2026)

Universal Spectral Tokenization via Self-Supervised Panchromatic Representation Learning
by: Shen, Jeff, et al.
Published: (2025)

HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs
by: Kim, Sunwoo, et al.
Published: (2024)

Anti Mode-Collapse in Mean-Field Transformer via Auxiliary Variables
by: Imaizumi, Masaaki, et al.
Published: (2026)

PaAno: Patch-Based Representation Learning for Time-Series Anomaly Detection
by: Park, Jinju, et al.
Published: (2026)

Concept Heterogeneity-aware Representation Steering
by: Abdullaev, Laziz U., et al.
Published: (2026)

Understanding Representation Learnability of Nonlinear Self-Supervised Learning
by: Yang, Ruofeng, et al.
Published: (2024)

The Impact of Semantic Pairs on Self-Supervised Representation Learning
by: Alkhalefi, Mohammad, et al.
Published: (2025)

Quantifying Representation Reliability in Self-Supervised Learning Models
by: Park, Young-Jin, et al.
Published: (2023)

Data-Driven Self-Supervised Graph Representation Learning
by: Samy, Ahmed E., et al.
Published: (2024)

On Linear Separation Capacity of Self-Supervised Representation Learning
by: Wang, Shulei
Published: (2023)

Brep2Shape: Boundary and Shape Representation Alignment via Self-Supervised Transformers
by: Sun, Yuanxu, et al.
Published: (2026)

The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
by: Ren, Richard, et al.
Published: (2025)

PaPaGei: Open Foundation Models for Optical Physiological Signals
by: Pillai, Arvind, et al.
Published: (2024)