:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Boock, Magnus Victor, Akgül, Abdullah, Çelikok, Mustafa Mert, Kandemir, Melih
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2605.06470
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Distributional Active Inference
by: Akgül, Abdullah, et al.
Published: (2026)

A Measure-Theoretic Finite-Sample Theory for Adaptive-Data Fitted Q-Iteration
by: Haussmann, Manuel, et al.
Published: (2026)

Continual Learning of Multi-modal Dynamics with External Memory
by: Akgül, Abdullah, et al.
Published: (2022)

Overcoming Non-stationary Dynamics with Evidential Proximal Policy Optimization
by: Akgül, Abdullah, et al.
Published: (2025)

Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
by: Akgül, Abdullah, et al.
Published: (2024)

Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits
by: Werge, Nicklas, et al.
Published: (2023)

PAC-Bayesian Soft Actor-Critic Learning
by: Tasdighi, Bahareh, et al.
Published: (2023)

Calibrating Bayesian UNet++ for Sub-Seasonal Forecasting
by: Asan, Busra, et al.
Published: (2024)

ObjectRL: An Object-Oriented Reinforcement Learning Codebase
by: Baykal, Gulcin, et al.
Published: (2025)

Social Cooperation in Conversational AI Agents
by: Çelikok, Mustafa Mert, et al.
Published: (2025)

On the Complexity of Learning to Cooperate with Populations of Socially Rational Agents
by: Loftin, Robert, et al.
Published: (2024)

Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning
by: Vincent, Théo, et al.
Published: (2025)

Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards
by: Baran, Orhun Buğra, et al.
Published: (2026)

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
by: Suau, Miguel, et al.
Published: (2022)

Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
by: Çelikok, Mustafa Mert, et al.
Published: (2024)

Disentanglement with Factor Quantized Variational Autoencoders
by: Baykal, Gulcin, et al.
Published: (2024)

EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders
by: Baykal, Gulcin, et al.
Published: (2023)

Uncoupled Learning of Differential Stackelberg Equilibria with Commitments
by: Loftin, Robert, et al.
Published: (2023)

Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures
by: Flynn, Hamish, et al.
Published: (2023)

Improving Actor-Critic Training with Steerable Action-Value Approximation Errors
by: Tasdighi, Bahareh, et al.
Published: (2024)

Adaptive Ensemble Aggregation for Actor-Critics
by: Werge, Nicklas, et al.
Published: (2025)

Deep Exploration with PAC-Bayes
by: Tasdighi, Bahareh, et al.
Published: (2024)

Value Improved Actor Critic Algorithms
by: Oren, Yaniv, et al.
Published: (2024)

Deep Actor-Critics with Tight Risk Certificates
by: Tasdighi, Bahareh, et al.
Published: (2025)

GRU-D Characterizes Age-Specific Temporal Missingness in MIMIC-IV
by: Giesa, Niklas, et al.
Published: (2024)

DIGing--SGLD: Decentralized and Scalable Langevin Sampling over Time--Varying Networks
by: Bajwa, Waheed U., et al.
Published: (2025)

Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
by: Zhang, Fangzhao, et al.
Published: (2024)

Exogenous Isomorphism for Counterfactual Identifiability
by: Chen, Yikang, et al.
Published: (2025)

Causal Data Augmentation for Robust Fine-Tuning of Tabular Foundation Models
by: Bühler, Magnus, et al.
Published: (2026)

CANet: ChronoAdaptive Network for Enhanced Long-Term Time Series Forecasting under Non-Stationarity
by: Sonmezer, Mert, et al.
Published: (2025)

Foundation model for mass spectrometry proteomics
by: Sanders, Justin, et al.
Published: (2025)

Algorithmic Stability of Stochastic Gradient Descent with Momentum under Heavy-Tailed Noise
by: Dang, Thanh, et al.
Published: (2025)

Robust Defense Against Extreme Grid Events Using Dual-Policy Reinforcement Learning Agents
by: Peter, Benjamin M., et al.
Published: (2024)

Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection
by: Gao, Daiqi, et al.
Published: (2023)

ULTRA-MC: A Unified Approach to Learning Mixtures of Markov Chains via Hitting Times
by: Spaeh, Fabian, et al.
Published: (2024)

Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information
by: Akgül, Ömer Faruk, et al.
Published: (2024)

DeepStage: Learning Autonomous Defense Policies Against Multi-Stage APT Campaigns
by: Phan, Trung V., et al.
Published: (2026)

Multi-Stage Prototype Learning for Interpretable Time Series Classification
by: Kalisetti, Bhavesh, et al.
Published: (2021)

Towards Subgraph Isomorphism Counting with Graph Kernels
by: Liu, Xin, et al.
Published: (2024)

Learning to Count Isomorphisms with Graph Neural Networks
by: Yu, Xingtong, et al.
Published: (2023)