:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rauba, Paulius, Wei, Qiyao, van der Schaar, Mihaela
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2506.07947
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025)

Quantifying perturbation impacts for large language models
by: Rauba, Paulius, et al.
Published: (2024)

Redefining Digital Health Interfaces with Large Language Models
by: Imrie, Fergus, et al.
Published: (2023)

Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models
by: Rauba, Paulius, et al.
Published: (2025)

Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models
by: Rauba, Paulius, et al.
Published: (2024)

Tiny Autoregressive Recursive Models
by: Rauba, Paulius, et al.
Published: (2026)

Multi-Agent Systems Should be Treated as Principal-Agent Problems
by: Rauba, Paulius, et al.
Published: (2026)

No More, No Less: Least-Privilege Language Models
by: Rauba, Paulius, et al.
Published: (2026)

Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments
by: Rauba, Paulius, et al.
Published: (2024)

Language Bottleneck Models for Qualitative Knowledge State Modeling
by: Berthon, Antonin, et al.
Published: (2025)

Cascaded Language Models for Cost-effective Human-AI Decision-Making
by: Fanconi, Claudio, et al.
Published: (2025)

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
by: Sun, Hao, et al.
Published: (2025)

Continuously Updating Digital Twins using Large Language Models
by: Amad, Harry, et al.
Published: (2025)

Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
by: Sun, Hao, et al.
Published: (2025)

The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-fidelity Data
by: Pouplin, Thomas, et al.
Published: (2024)

Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
by: Sun, Hao, et al.
Published: (2023)

Active Task Disambiguation with LLMs
by: Kobalczyk, Katarzyna, et al.
Published: (2025)

GameTalk: Training LLMs for Strategic Conversation
by: Vendrell, Victor Conchello, et al.
Published: (2026)

The AI Imperative: Scaling High-Quality Peer Review in Machine Learning
by: Wei, Qiyao, et al.
Published: (2025)

On Error Propagation of Diffusion Models
by: Li, Yangming, et al.
Published: (2023)

Semantic-KG: Using Knowledge Graphs to Construct Benchmarks for Measuring Semantic Similarity
by: Wei, Qiyao, et al.
Published: (2025)

Defining Expertise: Applications to Treatment Effect Estimation
by: Hüyük, Alihan, et al.
Published: (2024)

OpenReview Should be Protected and Leveraged as a Community Asset for Research in the Era of Large Language Models
by: Sun, Hao, et al.
Published: (2025)

L2MAC: Large Language Model Automatic Computer for Extensive Code Generation
by: Holt, Samuel, et al.
Published: (2023)

Delving into Multilingual Ethical Bias: The MSQAD with Statistical Hypothesis Tests for Large Language Models
by: Yu, Seunguk, et al.
Published: (2025)

Retrieval Augmented Thought Process for Private Data Handling in Healthcare
by: Pouplin, Thomas, et al.
Published: (2024)

Matchmaker: Self-Improving Large Language Model Programs for Schema Matching
by: Seedat, Nabeel, et al.
Published: (2024)

Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
by: Sun, Hao, et al.
Published: (2024)

Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models
by: Li, Yangming, et al.
Published: (2023)

Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search
by: Holt, Samuel, et al.
Published: (2025)

Guideline-Grounded Evidence Accumulation for High-Stakes Agent Verification
by: Zhang, Yichi, et al.
Published: (2026)

Distributionally Robust Reinforcement Learning with Human Feedback
by: Mandal, Debmalya, et al.
Published: (2025)

A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models
by: Cai, Yinpeng, et al.
Published: (2025)

DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
by: Seedat, Nabeel, et al.
Published: (2022)

Why Tabular Foundation Models Should Be a Research Priority
by: van Breugel, Boris, et al.
Published: (2024)

Hypothesis Testing Prompting Improves Deductive Reasoning in Large Language Models
by: Li, Yitian, et al.
Published: (2024)

Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models
by: Chiu, Christopher, et al.
Published: (2025)

Risk-Sensitive Diffusion: Robustly Optimizing Diffusion Models with Noisy Samples
by: Li, Yangming, et al.
Published: (2024)

The Cylindrical Representation Hypothesis for Language Model Steering
by: Gao, Lang, et al.
Published: (2026)

XRec: Large Language Models for Explainable Recommendation
by: Ma, Qiyao, et al.
Published: (2024)