Saved in:
| Main Authors: | Boock, Magnus Victor, Akgül, Abdullah, Çelikok, Mustafa Mert, Kandemir, Melih |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.06470 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Distributional Active Inference
by: Akgül, Abdullah, et al.
Published: (2026)
by: Akgül, Abdullah, et al.
Published: (2026)
A Measure-Theoretic Finite-Sample Theory for Adaptive-Data Fitted Q-Iteration
by: Haussmann, Manuel, et al.
Published: (2026)
by: Haussmann, Manuel, et al.
Published: (2026)
Continual Learning of Multi-modal Dynamics with External Memory
by: Akgül, Abdullah, et al.
Published: (2022)
by: Akgül, Abdullah, et al.
Published: (2022)
Overcoming Non-stationary Dynamics with Evidential Proximal Policy Optimization
by: Akgül, Abdullah, et al.
Published: (2025)
by: Akgül, Abdullah, et al.
Published: (2025)
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
by: Akgül, Abdullah, et al.
Published: (2024)
by: Akgül, Abdullah, et al.
Published: (2024)
Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits
by: Werge, Nicklas, et al.
Published: (2023)
by: Werge, Nicklas, et al.
Published: (2023)
PAC-Bayesian Soft Actor-Critic Learning
by: Tasdighi, Bahareh, et al.
Published: (2023)
by: Tasdighi, Bahareh, et al.
Published: (2023)
Calibrating Bayesian UNet++ for Sub-Seasonal Forecasting
by: Asan, Busra, et al.
Published: (2024)
by: Asan, Busra, et al.
Published: (2024)
ObjectRL: An Object-Oriented Reinforcement Learning Codebase
by: Baykal, Gulcin, et al.
Published: (2025)
by: Baykal, Gulcin, et al.
Published: (2025)
Social Cooperation in Conversational AI Agents
by: Çelikok, Mustafa Mert, et al.
Published: (2025)
by: Çelikok, Mustafa Mert, et al.
Published: (2025)
On the Complexity of Learning to Cooperate with Populations of Socially Rational Agents
by: Loftin, Robert, et al.
Published: (2024)
by: Loftin, Robert, et al.
Published: (2024)
Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning
by: Vincent, Théo, et al.
Published: (2025)
by: Vincent, Théo, et al.
Published: (2025)
Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards
by: Baran, Orhun Buğra, et al.
Published: (2026)
by: Baran, Orhun Buğra, et al.
Published: (2026)
Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
by: Suau, Miguel, et al.
Published: (2022)
by: Suau, Miguel, et al.
Published: (2022)
Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
by: Çelikok, Mustafa Mert, et al.
Published: (2024)
by: Çelikok, Mustafa Mert, et al.
Published: (2024)
Disentanglement with Factor Quantized Variational Autoencoders
by: Baykal, Gulcin, et al.
Published: (2024)
by: Baykal, Gulcin, et al.
Published: (2024)
EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders
by: Baykal, Gulcin, et al.
Published: (2023)
by: Baykal, Gulcin, et al.
Published: (2023)
Uncoupled Learning of Differential Stackelberg Equilibria with Commitments
by: Loftin, Robert, et al.
Published: (2023)
by: Loftin, Robert, et al.
Published: (2023)
Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures
by: Flynn, Hamish, et al.
Published: (2023)
by: Flynn, Hamish, et al.
Published: (2023)
Improving Actor-Critic Training with Steerable Action-Value Approximation Errors
by: Tasdighi, Bahareh, et al.
Published: (2024)
by: Tasdighi, Bahareh, et al.
Published: (2024)
Adaptive Ensemble Aggregation for Actor-Critics
by: Werge, Nicklas, et al.
Published: (2025)
by: Werge, Nicklas, et al.
Published: (2025)
Deep Exploration with PAC-Bayes
by: Tasdighi, Bahareh, et al.
Published: (2024)
by: Tasdighi, Bahareh, et al.
Published: (2024)
Value Improved Actor Critic Algorithms
by: Oren, Yaniv, et al.
Published: (2024)
by: Oren, Yaniv, et al.
Published: (2024)
Deep Actor-Critics with Tight Risk Certificates
by: Tasdighi, Bahareh, et al.
Published: (2025)
by: Tasdighi, Bahareh, et al.
Published: (2025)
GRU-D Characterizes Age-Specific Temporal Missingness in MIMIC-IV
by: Giesa, Niklas, et al.
Published: (2024)
by: Giesa, Niklas, et al.
Published: (2024)
DIGing--SGLD: Decentralized and Scalable Langevin Sampling over Time--Varying Networks
by: Bajwa, Waheed U., et al.
Published: (2025)
by: Bajwa, Waheed U., et al.
Published: (2025)
Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
Exogenous Isomorphism for Counterfactual Identifiability
by: Chen, Yikang, et al.
Published: (2025)
by: Chen, Yikang, et al.
Published: (2025)
Causal Data Augmentation for Robust Fine-Tuning of Tabular Foundation Models
by: Bühler, Magnus, et al.
Published: (2026)
by: Bühler, Magnus, et al.
Published: (2026)
CANet: ChronoAdaptive Network for Enhanced Long-Term Time Series Forecasting under Non-Stationarity
by: Sonmezer, Mert, et al.
Published: (2025)
by: Sonmezer, Mert, et al.
Published: (2025)
Foundation model for mass spectrometry proteomics
by: Sanders, Justin, et al.
Published: (2025)
by: Sanders, Justin, et al.
Published: (2025)
Algorithmic Stability of Stochastic Gradient Descent with Momentum under Heavy-Tailed Noise
by: Dang, Thanh, et al.
Published: (2025)
by: Dang, Thanh, et al.
Published: (2025)
Robust Defense Against Extreme Grid Events Using Dual-Policy Reinforcement Learning Agents
by: Peter, Benjamin M., et al.
Published: (2024)
by: Peter, Benjamin M., et al.
Published: (2024)
Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection
by: Gao, Daiqi, et al.
Published: (2023)
by: Gao, Daiqi, et al.
Published: (2023)
ULTRA-MC: A Unified Approach to Learning Mixtures of Markov Chains via Hitting Times
by: Spaeh, Fabian, et al.
Published: (2024)
by: Spaeh, Fabian, et al.
Published: (2024)
Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information
by: Akgül, Ömer Faruk, et al.
Published: (2024)
by: Akgül, Ömer Faruk, et al.
Published: (2024)
DeepStage: Learning Autonomous Defense Policies Against Multi-Stage APT Campaigns
by: Phan, Trung V., et al.
Published: (2026)
by: Phan, Trung V., et al.
Published: (2026)
Multi-Stage Prototype Learning for Interpretable Time Series Classification
by: Kalisetti, Bhavesh, et al.
Published: (2021)
by: Kalisetti, Bhavesh, et al.
Published: (2021)
Towards Subgraph Isomorphism Counting with Graph Kernels
by: Liu, Xin, et al.
Published: (2024)
by: Liu, Xin, et al.
Published: (2024)
Learning to Count Isomorphisms with Graph Neural Networks
by: Yu, Xingtong, et al.
Published: (2023)
by: Yu, Xingtong, et al.
Published: (2023)
Similar Items
-
Distributional Active Inference
by: Akgül, Abdullah, et al.
Published: (2026) -
A Measure-Theoretic Finite-Sample Theory for Adaptive-Data Fitted Q-Iteration
by: Haussmann, Manuel, et al.
Published: (2026) -
Continual Learning of Multi-modal Dynamics with External Memory
by: Akgül, Abdullah, et al.
Published: (2022) -
Overcoming Non-stationary Dynamics with Evidential Proximal Policy Optimization
by: Akgül, Abdullah, et al.
Published: (2025) -
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
by: Akgül, Abdullah, et al.
Published: (2024)