Saved in:
| Main Authors: | Nkhumise, Reabetswe M., Basu, Debabrota, Prescott, Tony J., Gilra, Aditya |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.09113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Issues with Measuring Task Complexity via Random Policies in Robotic Tasks
by: Nkhumise, Reabetswe M., et al.
Published: (2026)
by: Nkhumise, Reabetswe M., et al.
Published: (2026)
Lagrangian-based Equilibrium Propagation: generalisation to arbitrary boundary conditions & equivalence with Hamiltonian Echo Learning
by: Pourcel, Guillaume, et al.
Published: (2025)
by: Pourcel, Guillaume, et al.
Published: (2025)
Dynamical-VAE-based Hindsight to Learn the Causal Dynamics of Factored-POMDPs
by: Han, Chao, et al.
Published: (2024)
by: Han, Chao, et al.
Published: (2024)
Isoperimetry is All We Need: Langevin Posterior Sampling for RL with Sublinear Regret
by: Jorge, Emilio, et al.
Published: (2024)
by: Jorge, Emilio, et al.
Published: (2024)
Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints
by: Das, Udvas, et al.
Published: (2024)
by: Das, Udvas, et al.
Published: (2024)
Preference-based Pure Exploration
by: Shukla, Apurv, et al.
Published: (2024)
by: Shukla, Apurv, et al.
Published: (2024)
Stochastic Online Instrumental Variable Regression: Regrets for Endogeneity and Bandit Feedback
by: Della Vecchia, Riccardo, et al.
Published: (2023)
by: Della Vecchia, Riccardo, et al.
Published: (2023)
Some Targets Are Harder to Identify than Others: Quantifying the Target-dependent Membership Leakage
by: Azize, Achraf, et al.
Published: (2024)
by: Azize, Achraf, et al.
Published: (2024)
Auditing Fairness under Model Updates: Fundamental Complexity and Property-Preserving Updates
by: Ajarra, Ayoub, et al.
Published: (2026)
by: Ajarra, Ayoub, et al.
Published: (2026)
Performative Policy Gradient: Optimality in Performative Reinforcement Learning
by: Basu, Debabrota, et al.
Published: (2025)
by: Basu, Debabrota, et al.
Published: (2025)
Sublinear Algorithms for Wasserstein and Total Variation Distances: Applications to Fairness and Privacy Auditing
by: Basu, Debabrota, et al.
Published: (2025)
by: Basu, Debabrota, et al.
Published: (2025)
Concentrated Differential Privacy for Bandits
by: Azize, Achraf, et al.
Published: (2023)
by: Azize, Achraf, et al.
Published: (2023)
FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits
by: Chakraborty, Sunrit, et al.
Published: (2024)
by: Chakraborty, Sunrit, et al.
Published: (2024)
Active Fourier Auditor for Estimating Distributional Properties of ML Models
by: Ajarra, Ayoub, et al.
Published: (2024)
by: Ajarra, Ayoub, et al.
Published: (2024)
Sequential Membership Inference Attacks
by: Michel, Thomas, et al.
Published: (2026)
by: Michel, Thomas, et al.
Published: (2026)
DP-SPRT: Differentially Private Sequential Probability Ratio Tests
by: Michel, Thomas, et al.
Published: (2025)
by: Michel, Thomas, et al.
Published: (2025)
Asymptotically Optimal Sequential Testing with Markovian Data
by: Sethi, Alhad, et al.
Published: (2026)
by: Sethi, Alhad, et al.
Published: (2026)
FraPPE: Fast and Efficient Preference-based Pure Exploration
by: Das, Udvas, et al.
Published: (2025)
by: Das, Udvas, et al.
Published: (2025)
When Witnesses Defend: A Witness Graph Topological Layer for Adversarial Graph Learning
by: Arafat, Naheed Anjum, et al.
Published: (2024)
by: Arafat, Naheed Anjum, et al.
Published: (2024)
Augmented Bayesian Policy Search
by: Kallel, Mahdi, et al.
Published: (2024)
by: Kallel, Mahdi, et al.
Published: (2024)
Pure Exploration in Bandits with Linear Constraints
by: Carlsson, Emil, et al.
Published: (2023)
by: Carlsson, Emil, et al.
Published: (2023)
Optimal Regret of Bernoulli Bandits under Global Differential Privacy
by: Azize, Achraf, et al.
Published: (2025)
by: Azize, Achraf, et al.
Published: (2025)
Differentially Private Best-Arm Identification
by: Azize, Achraf, et al.
Published: (2024)
by: Azize, Achraf, et al.
Published: (2024)
Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning
by: Han, Chao, et al.
Published: (2026)
by: Han, Chao, et al.
Published: (2026)
DOT: Dynamic Knob Selection and Online Sampling for Automated Database Tuning
by: Wang, Yifan, et al.
Published: (2026)
by: Wang, Yifan, et al.
Published: (2026)
Offline RL via Feature-Occupancy Gradient Ascent
by: Neu, Gergely, et al.
Published: (2024)
by: Neu, Gergely, et al.
Published: (2024)
Dimension Agnostic Testing of Survey Data Credibility through the Lens of Regression
by: Basu, Debabrota, et al.
Published: (2025)
by: Basu, Debabrota, et al.
Published: (2025)
Synthesis and Analysis of Data as Probability Measures with Entropy-Regularized Optimal Transport
by: Mallery, Brendan, et al.
Published: (2025)
by: Mallery, Brendan, et al.
Published: (2025)
Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
by: Zhan, Guojian, et al.
Published: (2025)
by: Zhan, Guojian, et al.
Published: (2025)
How does Inverse RL Scale to Large State Spaces? A Provably Efficient Approach
by: Lazzati, Filippo, et al.
Published: (2024)
by: Lazzati, Filippo, et al.
Published: (2024)
Testing Credibility of Public and Private Surveys through the Lens of Regression
by: Basu, Debabrota, et al.
Published: (2024)
by: Basu, Debabrota, et al.
Published: (2024)
Selective Rollout: Mid-Trajectory Termination for Multi-Sample Agent RL
by: Zhai, Zhiyuan, et al.
Published: (2026)
by: Zhai, Zhiyuan, et al.
Published: (2026)
Commute Your Domains: Trajectory Optimality Criterion for Multi-Domain Learning
by: Rukhovich, Alexey, et al.
Published: (2025)
by: Rukhovich, Alexey, et al.
Published: (2025)
Hyperparameter Trajectory Inference with Conditional Lagrangian Optimal Transport
by: Amad, Harry, et al.
Published: (2026)
by: Amad, Harry, et al.
Published: (2026)
Consistent Optimal Transport with Empirical Conditional Measures
by: Manupriya, Piyushi, et al.
Published: (2023)
by: Manupriya, Piyushi, et al.
Published: (2023)
Optimal Transport for Measures with Noisy Tree Metric
by: Le, Tam, et al.
Published: (2023)
by: Le, Tam, et al.
Published: (2023)
When Are RL Hyperparameters Benign? A Study in Offline Goal-Conditioned RL
by: Töpperwien, Jan Malte, et al.
Published: (2026)
by: Töpperwien, Jan Malte, et al.
Published: (2026)
Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS
by: Jin, Can, et al.
Published: (2025)
by: Jin, Can, et al.
Published: (2025)
Optimal Transport with Tempered Exponential Measures
by: Amid, Ehsan, et al.
Published: (2023)
by: Amid, Ehsan, et al.
Published: (2023)
Workspace Optimization: How to Train Your Agent
by: Sarafian, Elad, et al.
Published: (2026)
by: Sarafian, Elad, et al.
Published: (2026)
Similar Items
-
Issues with Measuring Task Complexity via Random Policies in Robotic Tasks
by: Nkhumise, Reabetswe M., et al.
Published: (2026) -
Lagrangian-based Equilibrium Propagation: generalisation to arbitrary boundary conditions & equivalence with Hamiltonian Echo Learning
by: Pourcel, Guillaume, et al.
Published: (2025) -
Dynamical-VAE-based Hindsight to Learn the Causal Dynamics of Factored-POMDPs
by: Han, Chao, et al.
Published: (2024) -
Isoperimetry is All We Need: Langevin Posterior Sampling for RL with Sublinear Regret
by: Jorge, Emilio, et al.
Published: (2024) -
Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints
by: Das, Udvas, et al.
Published: (2024)