Saved in:
| Main Authors: | Rauba, Paulius, Wei, Qiyao, van der Schaar, Mihaela |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.07947 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025)
by: Rauba, Paulius, et al.
Published: (2025)
Quantifying perturbation impacts for large language models
by: Rauba, Paulius, et al.
Published: (2024)
by: Rauba, Paulius, et al.
Published: (2024)
Redefining Digital Health Interfaces with Large Language Models
by: Imrie, Fergus, et al.
Published: (2023)
by: Imrie, Fergus, et al.
Published: (2023)
Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models
by: Rauba, Paulius, et al.
Published: (2025)
by: Rauba, Paulius, et al.
Published: (2025)
Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models
by: Rauba, Paulius, et al.
Published: (2024)
by: Rauba, Paulius, et al.
Published: (2024)
Tiny Autoregressive Recursive Models
by: Rauba, Paulius, et al.
Published: (2026)
by: Rauba, Paulius, et al.
Published: (2026)
Multi-Agent Systems Should be Treated as Principal-Agent Problems
by: Rauba, Paulius, et al.
Published: (2026)
by: Rauba, Paulius, et al.
Published: (2026)
No More, No Less: Least-Privilege Language Models
by: Rauba, Paulius, et al.
Published: (2026)
by: Rauba, Paulius, et al.
Published: (2026)
Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments
by: Rauba, Paulius, et al.
Published: (2024)
by: Rauba, Paulius, et al.
Published: (2024)
Language Bottleneck Models for Qualitative Knowledge State Modeling
by: Berthon, Antonin, et al.
Published: (2025)
by: Berthon, Antonin, et al.
Published: (2025)
Cascaded Language Models for Cost-effective Human-AI Decision-Making
by: Fanconi, Claudio, et al.
Published: (2025)
by: Fanconi, Claudio, et al.
Published: (2025)
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
by: Sun, Hao, et al.
Published: (2025)
by: Sun, Hao, et al.
Published: (2025)
Continuously Updating Digital Twins using Large Language Models
by: Amad, Harry, et al.
Published: (2025)
by: Amad, Harry, et al.
Published: (2025)
Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
by: Sun, Hao, et al.
Published: (2025)
by: Sun, Hao, et al.
Published: (2025)
The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-fidelity Data
by: Pouplin, Thomas, et al.
Published: (2024)
by: Pouplin, Thomas, et al.
Published: (2024)
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
Active Task Disambiguation with LLMs
by: Kobalczyk, Katarzyna, et al.
Published: (2025)
by: Kobalczyk, Katarzyna, et al.
Published: (2025)
GameTalk: Training LLMs for Strategic Conversation
by: Vendrell, Victor Conchello, et al.
Published: (2026)
by: Vendrell, Victor Conchello, et al.
Published: (2026)
The AI Imperative: Scaling High-Quality Peer Review in Machine Learning
by: Wei, Qiyao, et al.
Published: (2025)
by: Wei, Qiyao, et al.
Published: (2025)
On Error Propagation of Diffusion Models
by: Li, Yangming, et al.
Published: (2023)
by: Li, Yangming, et al.
Published: (2023)
Semantic-KG: Using Knowledge Graphs to Construct Benchmarks for Measuring Semantic Similarity
by: Wei, Qiyao, et al.
Published: (2025)
by: Wei, Qiyao, et al.
Published: (2025)
Defining Expertise: Applications to Treatment Effect Estimation
by: Hüyük, Alihan, et al.
Published: (2024)
by: Hüyük, Alihan, et al.
Published: (2024)
OpenReview Should be Protected and Leveraged as a Community Asset for Research in the Era of Large Language Models
by: Sun, Hao, et al.
Published: (2025)
by: Sun, Hao, et al.
Published: (2025)
L2MAC: Large Language Model Automatic Computer for Extensive Code Generation
by: Holt, Samuel, et al.
Published: (2023)
by: Holt, Samuel, et al.
Published: (2023)
Delving into Multilingual Ethical Bias: The MSQAD with Statistical Hypothesis Tests for Large Language Models
by: Yu, Seunguk, et al.
Published: (2025)
by: Yu, Seunguk, et al.
Published: (2025)
Retrieval Augmented Thought Process for Private Data Handling in Healthcare
by: Pouplin, Thomas, et al.
Published: (2024)
by: Pouplin, Thomas, et al.
Published: (2024)
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching
by: Seedat, Nabeel, et al.
Published: (2024)
by: Seedat, Nabeel, et al.
Published: (2024)
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
by: Sun, Hao, et al.
Published: (2024)
by: Sun, Hao, et al.
Published: (2024)
Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models
by: Li, Yangming, et al.
Published: (2023)
by: Li, Yangming, et al.
Published: (2023)
Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search
by: Holt, Samuel, et al.
Published: (2025)
by: Holt, Samuel, et al.
Published: (2025)
Guideline-Grounded Evidence Accumulation for High-Stakes Agent Verification
by: Zhang, Yichi, et al.
Published: (2026)
by: Zhang, Yichi, et al.
Published: (2026)
Distributionally Robust Reinforcement Learning with Human Feedback
by: Mandal, Debmalya, et al.
Published: (2025)
by: Mandal, Debmalya, et al.
Published: (2025)
A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models
by: Cai, Yinpeng, et al.
Published: (2025)
by: Cai, Yinpeng, et al.
Published: (2025)
DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
by: Seedat, Nabeel, et al.
Published: (2022)
by: Seedat, Nabeel, et al.
Published: (2022)
Why Tabular Foundation Models Should Be a Research Priority
by: van Breugel, Boris, et al.
Published: (2024)
by: van Breugel, Boris, et al.
Published: (2024)
Hypothesis Testing Prompting Improves Deductive Reasoning in Large Language Models
by: Li, Yitian, et al.
Published: (2024)
by: Li, Yitian, et al.
Published: (2024)
Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models
by: Chiu, Christopher, et al.
Published: (2025)
by: Chiu, Christopher, et al.
Published: (2025)
Risk-Sensitive Diffusion: Robustly Optimizing Diffusion Models with Noisy Samples
by: Li, Yangming, et al.
Published: (2024)
by: Li, Yangming, et al.
Published: (2024)
The Cylindrical Representation Hypothesis for Language Model Steering
by: Gao, Lang, et al.
Published: (2026)
by: Gao, Lang, et al.
Published: (2026)
XRec: Large Language Models for Explainable Recommendation
by: Ma, Qiyao, et al.
Published: (2024)
by: Ma, Qiyao, et al.
Published: (2024)
Similar Items
-
Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025) -
Quantifying perturbation impacts for large language models
by: Rauba, Paulius, et al.
Published: (2024) -
Redefining Digital Health Interfaces with Large Language Models
by: Imrie, Fergus, et al.
Published: (2023) -
Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models
by: Rauba, Paulius, et al.
Published: (2025) -
Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models
by: Rauba, Paulius, et al.
Published: (2024)