Saved in:
| Main Authors: | Ezzerg, Abdelhamid, Bogunovic, Ilija, Knoblauch, Jeremias |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.15315 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sample-efficient Bayesian Optimisation Using Known Invariances
by: Brown, Theodore, et al.
Published: (2024)
by: Brown, Theodore, et al.
Published: (2024)
Mean-Field Bayesian Optimisation
by: Steinberg, Petar, et al.
Published: (2025)
by: Steinberg, Petar, et al.
Published: (2025)
REDUCR: Robust Data Downsampling Using Class Priority Reweighting
by: Bankes, William, et al.
Published: (2023)
by: Bankes, William, et al.
Published: (2023)
Adversarially Robust Decision Transformer
by: Tang, Xiaohang, et al.
Published: (2024)
by: Tang, Xiaohang, et al.
Published: (2024)
Robust and Conjugate Gaussian Process Regression
by: Altamirano, Matias, et al.
Published: (2023)
by: Altamirano, Matias, et al.
Published: (2023)
Near-Optimal Approximations for Bayesian Inference in Function Space
by: Wild, Veit, et al.
Published: (2025)
by: Wild, Veit, et al.
Published: (2025)
wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2025)
by: Tang, Xiaohang, et al.
Published: (2025)
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
by: Ramesh, Shyam Sundhar, et al.
Published: (2023)
by: Ramesh, Shyam Sundhar, et al.
Published: (2023)
Multi-Output Robust and Conjugate Gaussian Processes
by: Rooijakkers, Joshua, et al.
Published: (2025)
by: Rooijakkers, Joshua, et al.
Published: (2025)
Robust Multi-Objective Controlled Decoding of Large Language Models
by: Son, Seongho, et al.
Published: (2025)
by: Son, Seongho, et al.
Published: (2025)
Robust and Conjugate Spatio-Temporal Gaussian Processes
by: Laplante, William, et al.
Published: (2025)
by: Laplante, William, et al.
Published: (2025)
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
by: Güzel, Ahmet H., et al.
Published: (2025)
by: Güzel, Ahmet H., et al.
Published: (2025)
No-Regret Linear Bandits under Gap-Adjusted Misspecification
by: Liu, Chong, et al.
Published: (2025)
by: Liu, Chong, et al.
Published: (2025)
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
by: Son, Seongho, et al.
Published: (2024)
by: Son, Seongho, et al.
Published: (2024)
Group Robust Preference Optimization in Reward-free RLHF
by: Ramesh, Shyam Sundhar, et al.
Published: (2024)
by: Ramesh, Shyam Sundhar, et al.
Published: (2024)
RSPO: Regularized Self-Play Alignment of Large Language Models
by: Tang, Xiaohang, et al.
Published: (2025)
by: Tang, Xiaohang, et al.
Published: (2025)
Disentangling Safe and Unsafe Corruptions via Anisotropy and Locality
by: Muthukumar, Ramchandran, et al.
Published: (2025)
by: Muthukumar, Ramchandran, et al.
Published: (2025)
PROWL: Prioritized Regret-Driven Optimization for World Model Learning
by: Güzel, Ahmet H., et al.
Published: (2026)
by: Güzel, Ahmet H., et al.
Published: (2026)
LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs?
by: Ziomek, Juliusz, et al.
Published: (2026)
by: Ziomek, Juliusz, et al.
Published: (2026)
PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial Attacks
by: Feng, Chen, et al.
Published: (2024)
by: Feng, Chen, et al.
Published: (2024)
Distributionally Robust Optimisation with Bayesian Ambiguity Sets
by: Dellaporta, Charita, et al.
Published: (2024)
by: Dellaporta, Charita, et al.
Published: (2024)
Ensemble Distributionally Robust Bayesian Optimisation
by: Ramazyan, Tigran, et al.
Published: (2026)
by: Ramazyan, Tigran, et al.
Published: (2026)
Predictively Oriented Posteriors
by: McLatchie, Yann, et al.
Published: (2025)
by: McLatchie, Yann, et al.
Published: (2025)
On Almost Surely Safe Alignment of Large Language Models at Inference-Time
by: Ji, Xiaotong, et al.
Published: (2025)
by: Ji, Xiaotong, et al.
Published: (2025)
GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2026)
by: Tang, Xiaohang, et al.
Published: (2026)
TabMGP: Martingale Posterior with TabPFN
by: Ng, Kenyon, et al.
Published: (2025)
by: Ng, Kenyon, et al.
Published: (2025)
Contextual Causal Bayesian Optimisation
by: Arsenyan, Vahan, et al.
Published: (2023)
by: Arsenyan, Vahan, et al.
Published: (2023)
Cascading Bandits Robust to Adversarial Corruptions
by: Xie, Jize, et al.
Published: (2025)
by: Xie, Jize, et al.
Published: (2025)
On Corruption-Robustness in Performative Reinforcement Learning
by: Pollatos, Vasilis, et al.
Published: (2025)
by: Pollatos, Vasilis, et al.
Published: (2025)
Corruption-Robust Lipschitz Contextual Search
by: Zuo, Shiliang
Published: (2023)
by: Zuo, Shiliang
Published: (2023)
Graph Agnostic Causal Bayesian Optimisation
by: Mukherjee, Sumantrak, et al.
Published: (2024)
by: Mukherjee, Sumantrak, et al.
Published: (2024)
Imagined Autocurricula
by: Güzel, Ahmet H., et al.
Published: (2025)
by: Güzel, Ahmet H., et al.
Published: (2025)
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
by: Yang, Rui, et al.
Published: (2024)
by: Yang, Rui, et al.
Published: (2024)
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs
by: Wolf, Lorenz, et al.
Published: (2025)
by: Wolf, Lorenz, et al.
Published: (2025)
Sparse Offline Reinforcement Learning with Corruption Robustness
by: Tran, Nam Phuong, et al.
Published: (2025)
by: Tran, Nam Phuong, et al.
Published: (2025)
Certified Robustness against Sparse Adversarial Perturbations via Data Localization
by: Pal, Ambar, et al.
Published: (2024)
by: Pal, Ambar, et al.
Published: (2024)
Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness
by: Pal, Ambar, et al.
Published: (2023)
by: Pal, Ambar, et al.
Published: (2023)
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
by: Ramesh, Shyam Sundhar, et al.
Published: (2026)
by: Ramesh, Shyam Sundhar, et al.
Published: (2026)
Sample Efficient Preference Alignment in LLMs via Active Exploration
by: Mehta, Viraj, et al.
Published: (2023)
by: Mehta, Viraj, et al.
Published: (2023)
Bayesian Optimistic Optimisation with Exponentially Decaying Regret
by: Tran-The, Hung, et al.
Published: (2021)
by: Tran-The, Hung, et al.
Published: (2021)
Similar Items
-
Sample-efficient Bayesian Optimisation Using Known Invariances
by: Brown, Theodore, et al.
Published: (2024) -
Mean-Field Bayesian Optimisation
by: Steinberg, Petar, et al.
Published: (2025) -
REDUCR: Robust Data Downsampling Using Class Priority Reweighting
by: Bankes, William, et al.
Published: (2023) -
Adversarially Robust Decision Transformer
by: Tang, Xiaohang, et al.
Published: (2024) -
Robust and Conjugate Gaussian Process Regression
by: Altamirano, Matias, et al.
Published: (2023)