:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nkhumise, Reabetswe M., Basu, Debabrota, Prescott, Tony J., Gilra, Aditya
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.09113
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Issues with Measuring Task Complexity via Random Policies in Robotic Tasks
by: Nkhumise, Reabetswe M., et al.
Published: (2026)

Lagrangian-based Equilibrium Propagation: generalisation to arbitrary boundary conditions & equivalence with Hamiltonian Echo Learning
by: Pourcel, Guillaume, et al.
Published: (2025)

Dynamical-VAE-based Hindsight to Learn the Causal Dynamics of Factored-POMDPs
by: Han, Chao, et al.
Published: (2024)

Isoperimetry is All We Need: Langevin Posterior Sampling for RL with Sublinear Regret
by: Jorge, Emilio, et al.
Published: (2024)

Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints
by: Das, Udvas, et al.
Published: (2024)

Preference-based Pure Exploration
by: Shukla, Apurv, et al.
Published: (2024)

Stochastic Online Instrumental Variable Regression: Regrets for Endogeneity and Bandit Feedback
by: Della Vecchia, Riccardo, et al.
Published: (2023)

Some Targets Are Harder to Identify than Others: Quantifying the Target-dependent Membership Leakage
by: Azize, Achraf, et al.
Published: (2024)

Auditing Fairness under Model Updates: Fundamental Complexity and Property-Preserving Updates
by: Ajarra, Ayoub, et al.
Published: (2026)

Performative Policy Gradient: Optimality in Performative Reinforcement Learning
by: Basu, Debabrota, et al.
Published: (2025)

Sublinear Algorithms for Wasserstein and Total Variation Distances: Applications to Fairness and Privacy Auditing
by: Basu, Debabrota, et al.
Published: (2025)

Concentrated Differential Privacy for Bandits
by: Azize, Achraf, et al.
Published: (2023)

FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits
by: Chakraborty, Sunrit, et al.
Published: (2024)

Active Fourier Auditor for Estimating Distributional Properties of ML Models
by: Ajarra, Ayoub, et al.
Published: (2024)

Sequential Membership Inference Attacks
by: Michel, Thomas, et al.
Published: (2026)

DP-SPRT: Differentially Private Sequential Probability Ratio Tests
by: Michel, Thomas, et al.
Published: (2025)

Asymptotically Optimal Sequential Testing with Markovian Data
by: Sethi, Alhad, et al.
Published: (2026)

FraPPE: Fast and Efficient Preference-based Pure Exploration
by: Das, Udvas, et al.
Published: (2025)

When Witnesses Defend: A Witness Graph Topological Layer for Adversarial Graph Learning
by: Arafat, Naheed Anjum, et al.
Published: (2024)

Augmented Bayesian Policy Search
by: Kallel, Mahdi, et al.
Published: (2024)

Pure Exploration in Bandits with Linear Constraints
by: Carlsson, Emil, et al.
Published: (2023)

Optimal Regret of Bernoulli Bandits under Global Differential Privacy
by: Azize, Achraf, et al.
Published: (2025)

Differentially Private Best-Arm Identification
by: Azize, Achraf, et al.
Published: (2024)

Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning
by: Han, Chao, et al.
Published: (2026)

DOT: Dynamic Knob Selection and Online Sampling for Automated Database Tuning
by: Wang, Yifan, et al.
Published: (2026)

Offline RL via Feature-Occupancy Gradient Ascent
by: Neu, Gergely, et al.
Published: (2024)

Dimension Agnostic Testing of Survey Data Credibility through the Lens of Regression
by: Basu, Debabrota, et al.
Published: (2025)

Synthesis and Analysis of Data as Probability Measures with Entropy-Regularized Optimal Transport
by: Mallery, Brendan, et al.
Published: (2025)

Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
by: Zhan, Guojian, et al.
Published: (2025)

How does Inverse RL Scale to Large State Spaces? A Provably Efficient Approach
by: Lazzati, Filippo, et al.
Published: (2024)

Testing Credibility of Public and Private Surveys through the Lens of Regression
by: Basu, Debabrota, et al.
Published: (2024)

Selective Rollout: Mid-Trajectory Termination for Multi-Sample Agent RL
by: Zhai, Zhiyuan, et al.
Published: (2026)

Commute Your Domains: Trajectory Optimality Criterion for Multi-Domain Learning
by: Rukhovich, Alexey, et al.
Published: (2025)

Hyperparameter Trajectory Inference with Conditional Lagrangian Optimal Transport
by: Amad, Harry, et al.
Published: (2026)

Consistent Optimal Transport with Empirical Conditional Measures
by: Manupriya, Piyushi, et al.
Published: (2023)

Optimal Transport for Measures with Noisy Tree Metric
by: Le, Tam, et al.
Published: (2023)

When Are RL Hyperparameters Benign? A Study in Offline Goal-Conditioned RL
by: Töpperwien, Jan Malte, et al.
Published: (2026)

Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS
by: Jin, Can, et al.
Published: (2025)

Optimal Transport with Tempered Exponential Measures
by: Amid, Ehsan, et al.
Published: (2023)

Workspace Optimization: How to Train Your Agent
by: Sarafian, Elad, et al.
Published: (2026)