:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Milosevic, Nikola, Franz, Leonard, Haeufle, Daniel, Martius, Georg, Scherf, Nico, Kolev, Pavel
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.04599
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Geometry of Nonlinear Reinforcement Learning
by: Milosevic, Nikola, et al.
Published: (2025)

Central Path Proximal Policy Optimization
by: Milosevic, Nikola, et al.
Published: (2025)

Embedding Safety into RL: A New Take on Trust Region Methods
by: Milosevic, Nikola, et al.
Published: (2024)

Open Problem: Active Representation Learning
by: Milosevic, Nikola, et al.
Published: (2024)

Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
by: Kolev, Pavel, et al.
Published: (2025)

Offline Diversity Maximization Under Imitation Constraints
by: Vlastelica, Marin, et al.
Published: (2023)

GASP: Guided Asymmetric Self-Play For Coding LLMs
by: Jana, Swadesh, et al.
Published: (2026)

Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics
by: Beylier, Charlotte, et al.
Published: (2024)

Directed Exploration in Reinforcement Learning from Linear Temporal Logic
by: Bagatella, Marco, et al.
Published: (2024)

Drifting Fields are not Conservative
by: Franz, Leonard T., et al.
Published: (2026)

Equity forecast: Predicting long term stock price movement using machine learning
by: Milosevic, Nikola
Published: (2016)

Zero-Shot Offline Imitation Learning via Optimal Transport
by: Rupf, Thomas, et al.
Published: (2024)

Soft Forward-Backward Representations for Zero-shot Reinforcement Learning with General Utilities
by: Bagatella, Marco, et al.
Published: (2026)

Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning
by: Beylier, Charlotte, et al.
Published: (2025)

Test-time Offline Reinforcement Learning on Goal-related Experience
by: Bagatella, Marco, et al.
Published: (2025)

Forecasting in Offline Reinforcement Learning for Non-stationary Environments
by: Ada, Suzan Ece, et al.
Published: (2025)

Physical Embodiment Enables Information Processing Beyond Explicit Sensing in Active Matter
by: Paul, Diptabrata, et al.
Published: (2025)

Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities
by: Zadaianchuk, Andrii, et al.
Published: (2023)

Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
by: Hollenstein, Jakob, et al.
Published: (2023)

LPGD: A General Framework for Backpropagation through Embedded Optimization Layers
by: Paulus, Anselm, et al.
Published: (2024)

Grid-World Representations in Transformers Reflect Predictive Geometry
by: Brenner, Sasha, et al.
Published: (2026)

Fault Detection in Solar Thermal Systems using Probabilistic Reconstructions
by: Ebmeier, Florian, et al.
Published: (2025)

Comparison of biomedical relationship extraction methods and models for knowledge graph creation
by: Milosevic, Nikola, et al.
Published: (2022)

Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time
by: Mazumdar, Abhijit, et al.
Published: (2024)

The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks
by: Spieler, Aaron, et al.
Published: (2023)

Fair Distributed Machine Learning with Imbalanced Data as a Stackelberg Evolutionary Game
by: Niehaus, Sebastian, et al.
Published: (2024)

Multimodal Recurrent Ensembles for Predicting Brain Responses to Naturalistic Movies (Algonauts 2025)
by: Eren, Semih, et al.
Published: (2025)

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
by: Sancaktar, Cansu, et al.
Published: (2025)

CombOptNet: Fit the Right NP-Hard Problem by Learning Integer Programming Constraints
by: Paulus, Anselm, et al.
Published: (2021)

Geometry matters: insights from Ollivier Ricci Curvature and Ricci Flow into representational alignment through Ollivier-Ricci Curvature and Ricci Flow
by: Torbati, Nahid, et al.
Published: (2025)

Learning 3D-Gaussian Simulators from RGB Videos
by: Zhobro, Mikel, et al.
Published: (2025)

Episodic-Semantic Memory Architecture for Long-Horizon Scientific Agents
by: Milosevic, Nikola
Published: (2026)

Active Fine-Tuning of Multi-Task Policies
by: Bagatella, Marco, et al.
Published: (2024)

Predicting Microbial Interactions Using Graph Neural Networks
by: Gholamzadeh, Elham, et al.
Published: (2025)

Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies
by: Chen, Jiaqi, et al.
Published: (2025)

Scaling Laws and Tradeoffs in Recurrent Networks of Expressive Neurons
by: Spieler, Aaron, et al.
Published: (2026)

Learning to Control Emulated Muscles in Real Robots: Towards Exploiting Bio-Inspired Actuator Morphology
by: Schumacher, Pierre, et al.
Published: (2024)

Differentiation of Blackbox Combinatorial Solvers
by: Vlastelica, Marin, et al.
Published: (2019)

Epistemically-guided forward-backward exploration
by: Urpí, Núria Armengol, et al.
Published: (2025)

A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
by: Guin, Soumyajit, et al.
Published: (2022)