:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ezzerg, Abdelhamid, Bogunovic, Ilija, Knoblauch, Jeremias
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2511.15315
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sample-efficient Bayesian Optimisation Using Known Invariances
by: Brown, Theodore, et al.
Published: (2024)

Mean-Field Bayesian Optimisation
by: Steinberg, Petar, et al.
Published: (2025)

REDUCR: Robust Data Downsampling Using Class Priority Reweighting
by: Bankes, William, et al.
Published: (2023)

Adversarially Robust Decision Transformer
by: Tang, Xiaohang, et al.
Published: (2024)

Robust and Conjugate Gaussian Process Regression
by: Altamirano, Matias, et al.
Published: (2023)

Near-Optimal Approximations for Bayesian Inference in Function Space
by: Wild, Veit, et al.
Published: (2025)

wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2025)

Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
by: Ramesh, Shyam Sundhar, et al.
Published: (2023)

Multi-Output Robust and Conjugate Gaussian Processes
by: Rooijakkers, Joshua, et al.
Published: (2025)

Robust Multi-Objective Controlled Decoding of Large Language Models
by: Son, Seongho, et al.
Published: (2025)

Robust and Conjugate Spatio-Temporal Gaussian Processes
by: Laplante, William, et al.
Published: (2025)

Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
by: Güzel, Ahmet H., et al.
Published: (2025)

No-Regret Linear Bandits under Gap-Adjusted Misspecification
by: Liu, Chong, et al.
Published: (2025)

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
by: Son, Seongho, et al.
Published: (2024)

Group Robust Preference Optimization in Reward-free RLHF
by: Ramesh, Shyam Sundhar, et al.
Published: (2024)

RSPO: Regularized Self-Play Alignment of Large Language Models
by: Tang, Xiaohang, et al.
Published: (2025)

Disentangling Safe and Unsafe Corruptions via Anisotropy and Locality
by: Muthukumar, Ramchandran, et al.
Published: (2025)

PROWL: Prioritized Regret-Driven Optimization for World Model Learning
by: Güzel, Ahmet H., et al.
Published: (2026)

LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs?
by: Ziomek, Juliusz, et al.
Published: (2026)

PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial Attacks
by: Feng, Chen, et al.
Published: (2024)

Distributionally Robust Optimisation with Bayesian Ambiguity Sets
by: Dellaporta, Charita, et al.
Published: (2024)

Ensemble Distributionally Robust Bayesian Optimisation
by: Ramazyan, Tigran, et al.
Published: (2026)

Predictively Oriented Posteriors
by: McLatchie, Yann, et al.
Published: (2025)

On Almost Surely Safe Alignment of Large Language Models at Inference-Time
by: Ji, Xiaotong, et al.
Published: (2025)

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models
by: Tang, Xiaohang, et al.
Published: (2026)

TabMGP: Martingale Posterior with TabPFN
by: Ng, Kenyon, et al.
Published: (2025)

Contextual Causal Bayesian Optimisation
by: Arsenyan, Vahan, et al.
Published: (2023)

Cascading Bandits Robust to Adversarial Corruptions
by: Xie, Jize, et al.
Published: (2025)

On Corruption-Robustness in Performative Reinforcement Learning
by: Pollatos, Vasilis, et al.
Published: (2025)

Corruption-Robust Lipschitz Contextual Search
by: Zuo, Shiliang
Published: (2023)

Graph Agnostic Causal Bayesian Optimisation
by: Mukherjee, Sumantrak, et al.
Published: (2024)

Imagined Autocurricula
by: Güzel, Ahmet H., et al.
Published: (2025)

Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
by: Yang, Rui, et al.
Published: (2024)

This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs
by: Wolf, Lorenz, et al.
Published: (2025)

Sparse Offline Reinforcement Learning with Corruption Robustness
by: Tran, Nam Phuong, et al.
Published: (2025)

Certified Robustness against Sparse Adversarial Perturbations via Data Localization
by: Pal, Ambar, et al.
Published: (2024)

Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness
by: Pal, Ambar, et al.
Published: (2023)

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
by: Ramesh, Shyam Sundhar, et al.
Published: (2026)

Sample Efficient Preference Alignment in LLMs via Active Exploration
by: Mehta, Viraj, et al.
Published: (2023)

Bayesian Optimistic Optimisation with Exponentially Decaying Regret
by: Tran-The, Hung, et al.
Published: (2021)