Saved in:
| Main Authors: | Canavan, Callum, Shrivastava, Aditya, Qi, Allison, Michala, Jonathan, Roger, Fabien |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.20400 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Stabilising Explainability Fragility in Cybersecurity AI: The Impact and Mitigation of Multicollinearity in Public Benchmark Datasets
by: Vourganas, Ioannis J., et al.
Published: (2026)
by: Vourganas, Ioannis J., et al.
Published: (2026)
The Elicitation Game: Evaluating Capability Elicitation Techniques
by: Hofstätter, Felix, et al.
Published: (2025)
by: Hofstätter, Felix, et al.
Published: (2025)
Reasoning Elicitation in Language Models via Counterfactual Feedback
by: Hüyük, Alihan, et al.
Published: (2024)
by: Hüyük, Alihan, et al.
Published: (2024)
Generative Kaleidoscopic Networks
by: Shrivastava, Harsh
Published: (2024)
by: Shrivastava, Harsh
Published: (2024)
A Concrete Roadmap towards Safety Cases based on Chain-of-Thought Monitoring
by: Schulz, Julian
Published: (2025)
by: Schulz, Julian
Published: (2025)
Excess Description Length of Learning Generalizable Predictors
by: Donoway, Elizabeth, et al.
Published: (2026)
by: Donoway, Elizabeth, et al.
Published: (2026)
Self-Attribution Bias: When AI Monitors Go Easy on Themselves
by: Khullar, Dipika, et al.
Published: (2026)
by: Khullar, Dipika, et al.
Published: (2026)
Knowledge Propagation over Conditional Independence Graphs
by: Chajewska, Urszula, et al.
Published: (2023)
by: Chajewska, Urszula, et al.
Published: (2023)
Federated Learning with Neural Graphical Models
by: Chajewska, Urszula, et al.
Published: (2023)
by: Chajewska, Urszula, et al.
Published: (2023)
All Code, No Thought: Current Language Models Struggle to Reason in Ciphered Language
by: Guo, Shiyuan, et al.
Published: (2025)
by: Guo, Shiyuan, et al.
Published: (2025)
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction
by: Shrivastava, Gaurav, et al.
Published: (2024)
by: Shrivastava, Gaurav, et al.
Published: (2024)
Causal Preference Elicitation
by: Bonilla, Edwin V., et al.
Published: (2026)
by: Bonilla, Edwin V., et al.
Published: (2026)
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
by: Hugessen, Adriana, et al.
Published: (2024)
by: Hugessen, Adriana, et al.
Published: (2024)
Preference Elicitation for Offline Reinforcement Learning
by: Pace, Alizée, et al.
Published: (2024)
by: Pace, Alizée, et al.
Published: (2024)
Personalized Algorithmic Recourse with Preference Elicitation
by: De Toni, Giovanni, et al.
Published: (2022)
by: De Toni, Giovanni, et al.
Published: (2022)
Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval
by: Yang, Zeyu, et al.
Published: (2026)
by: Yang, Zeyu, et al.
Published: (2026)
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation
by: Formanek, Claude, et al.
Published: (2024)
by: Formanek, Claude, et al.
Published: (2024)
Uncertainty in Action: Confidence Elicitation in Embodied Agents
by: Yu, Tianjiao, et al.
Published: (2025)
by: Yu, Tianjiao, et al.
Published: (2025)
Leveraging Machine Learning for Early Autism Detection via INDT-ASD Indian Database
by: Shrivastava, Trapti, et al.
Published: (2024)
by: Shrivastava, Trapti, et al.
Published: (2024)
Alignment and Safety in Large Language Models: Safety Mechanisms, Training Paradigms, and Emerging Challenges
by: Lu, Haoran, et al.
Published: (2025)
by: Lu, Haoran, et al.
Published: (2025)
Inference Time Context Sparsity: Illusion or Opportunity?
by: Joshi, Sahil, et al.
Published: (2026)
by: Joshi, Sahil, et al.
Published: (2026)
Eliciting Numerical Predictive Distributions of LLMs Without Autoregression
by: Piskorz, Julianna, et al.
Published: (2026)
by: Piskorz, Julianna, et al.
Published: (2026)
Equitable Evaluation via Elicitation
by: Du, Elbert, et al.
Published: (2026)
by: Du, Elbert, et al.
Published: (2026)
Language models are better than humans at next-token prediction
by: Shlegeris, Buck, et al.
Published: (2022)
by: Shlegeris, Buck, et al.
Published: (2022)
Embedding by Elicitation: Dynamic Representations for Bayesian Optimization of System Prompts
by: Lin, Zhiyuan Jerry, et al.
Published: (2026)
by: Lin, Zhiyuan Jerry, et al.
Published: (2026)
Lookahead Unmasking Elicits Accurate Decoding in Diffusion Language Models
by: Lee, Sanghyun, et al.
Published: (2025)
by: Lee, Sanghyun, et al.
Published: (2025)
Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling?
by: Shaposhnyk, Olha, et al.
Published: (2025)
by: Shaposhnyk, Olha, et al.
Published: (2025)
ElicitationGPT: Text Elicitation Mechanisms via Language Models
by: Wu, Yifan, et al.
Published: (2024)
by: Wu, Yifan, et al.
Published: (2024)
Beyond Johnson-Lindenstrauss: Uniform Bounds for Sketched Bilinear Forms
by: Deb, Rohan, et al.
Published: (2025)
by: Deb, Rohan, et al.
Published: (2025)
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
by: Zhang, Shenao, et al.
Published: (2024)
by: Zhang, Shenao, et al.
Published: (2024)
Synthetic Error Injection Fails to Elicit Self-Correction In Language Models
by: Wu, David X., et al.
Published: (2025)
by: Wu, David X., et al.
Published: (2025)
Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques
by: Sharma, Asankhaya
Published: (2025)
by: Sharma, Asankhaya
Published: (2025)
Synthesize, Partition, then Adapt: Eliciting Diverse Samples from Foundation Models
by: Wen, Yeming, et al.
Published: (2024)
by: Wen, Yeming, et al.
Published: (2024)
GLADMamba: Unsupervised Graph-Level Anomaly Detection Powered by Selective State Space Model
by: Fu, Yali, et al.
Published: (2025)
by: Fu, Yali, et al.
Published: (2025)
CVTGAD: Simplified Transformer with Cross-View Attention for Unsupervised Graph-level Anomaly Detection
by: Li, Jindong, et al.
Published: (2024)
by: Li, Jindong, et al.
Published: (2024)
Distillation of Large Language Models via Concrete Score Matching
by: Kim, Yeongmin, et al.
Published: (2025)
by: Kim, Yeongmin, et al.
Published: (2025)
TPTT: Transforming Pretrained Transformers into Titans
by: Furfaro, Fabien
Published: (2025)
by: Furfaro, Fabien
Published: (2025)
IncidentNet: Traffic Incident Detection, Localization and Severity Estimation with Sparse Sensing
by: Peddiraju, Sai Shashank, et al.
Published: (2024)
by: Peddiraju, Sai Shashank, et al.
Published: (2024)
A Simple "Try Again" Can Elicit Multi-Turn LLM Reasoning
by: Liu, Licheng, et al.
Published: (2025)
by: Liu, Licheng, et al.
Published: (2025)
Unsupervised Cognition
by: Ibias, Alfredo, et al.
Published: (2024)
by: Ibias, Alfredo, et al.
Published: (2024)
Similar Items
-
Stabilising Explainability Fragility in Cybersecurity AI: The Impact and Mitigation of Multicollinearity in Public Benchmark Datasets
by: Vourganas, Ioannis J., et al.
Published: (2026) -
The Elicitation Game: Evaluating Capability Elicitation Techniques
by: Hofstätter, Felix, et al.
Published: (2025) -
Reasoning Elicitation in Language Models via Counterfactual Feedback
by: Hüyük, Alihan, et al.
Published: (2024) -
Generative Kaleidoscopic Networks
by: Shrivastava, Harsh
Published: (2024) -
A Concrete Roadmap towards Safety Cases based on Chain-of-Thought Monitoring
by: Schulz, Julian
Published: (2025)