Saved in:
| Main Authors: | Milosevic, Nikola, Franz, Leonard, Haeufle, Daniel, Martius, Georg, Scherf, Nico, Kolev, Pavel |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04599 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Geometry of Nonlinear Reinforcement Learning
by: Milosevic, Nikola, et al.
Published: (2025)
by: Milosevic, Nikola, et al.
Published: (2025)
Central Path Proximal Policy Optimization
by: Milosevic, Nikola, et al.
Published: (2025)
by: Milosevic, Nikola, et al.
Published: (2025)
Embedding Safety into RL: A New Take on Trust Region Methods
by: Milosevic, Nikola, et al.
Published: (2024)
by: Milosevic, Nikola, et al.
Published: (2024)
Open Problem: Active Representation Learning
by: Milosevic, Nikola, et al.
Published: (2024)
by: Milosevic, Nikola, et al.
Published: (2024)
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
by: Kolev, Pavel, et al.
Published: (2025)
by: Kolev, Pavel, et al.
Published: (2025)
Offline Diversity Maximization Under Imitation Constraints
by: Vlastelica, Marin, et al.
Published: (2023)
by: Vlastelica, Marin, et al.
Published: (2023)
GASP: Guided Asymmetric Self-Play For Coding LLMs
by: Jana, Swadesh, et al.
Published: (2026)
by: Jana, Swadesh, et al.
Published: (2026)
Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics
by: Beylier, Charlotte, et al.
Published: (2024)
by: Beylier, Charlotte, et al.
Published: (2024)
Directed Exploration in Reinforcement Learning from Linear Temporal Logic
by: Bagatella, Marco, et al.
Published: (2024)
by: Bagatella, Marco, et al.
Published: (2024)
Drifting Fields are not Conservative
by: Franz, Leonard T., et al.
Published: (2026)
by: Franz, Leonard T., et al.
Published: (2026)
Equity forecast: Predicting long term stock price movement using machine learning
by: Milosevic, Nikola
Published: (2016)
by: Milosevic, Nikola
Published: (2016)
Zero-Shot Offline Imitation Learning via Optimal Transport
by: Rupf, Thomas, et al.
Published: (2024)
by: Rupf, Thomas, et al.
Published: (2024)
Soft Forward-Backward Representations for Zero-shot Reinforcement Learning with General Utilities
by: Bagatella, Marco, et al.
Published: (2026)
by: Bagatella, Marco, et al.
Published: (2026)
Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning
by: Beylier, Charlotte, et al.
Published: (2025)
by: Beylier, Charlotte, et al.
Published: (2025)
Test-time Offline Reinforcement Learning on Goal-related Experience
by: Bagatella, Marco, et al.
Published: (2025)
by: Bagatella, Marco, et al.
Published: (2025)
Forecasting in Offline Reinforcement Learning for Non-stationary Environments
by: Ada, Suzan Ece, et al.
Published: (2025)
by: Ada, Suzan Ece, et al.
Published: (2025)
Physical Embodiment Enables Information Processing Beyond Explicit Sensing in Active Matter
by: Paul, Diptabrata, et al.
Published: (2025)
by: Paul, Diptabrata, et al.
Published: (2025)
Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities
by: Zadaianchuk, Andrii, et al.
Published: (2023)
by: Zadaianchuk, Andrii, et al.
Published: (2023)
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
by: Hollenstein, Jakob, et al.
Published: (2023)
by: Hollenstein, Jakob, et al.
Published: (2023)
LPGD: A General Framework for Backpropagation through Embedded Optimization Layers
by: Paulus, Anselm, et al.
Published: (2024)
by: Paulus, Anselm, et al.
Published: (2024)
Grid-World Representations in Transformers Reflect Predictive Geometry
by: Brenner, Sasha, et al.
Published: (2026)
by: Brenner, Sasha, et al.
Published: (2026)
Fault Detection in Solar Thermal Systems using Probabilistic Reconstructions
by: Ebmeier, Florian, et al.
Published: (2025)
by: Ebmeier, Florian, et al.
Published: (2025)
Comparison of biomedical relationship extraction methods and models for knowledge graph creation
by: Milosevic, Nikola, et al.
Published: (2022)
by: Milosevic, Nikola, et al.
Published: (2022)
Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time
by: Mazumdar, Abhijit, et al.
Published: (2024)
by: Mazumdar, Abhijit, et al.
Published: (2024)
The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks
by: Spieler, Aaron, et al.
Published: (2023)
by: Spieler, Aaron, et al.
Published: (2023)
Fair Distributed Machine Learning with Imbalanced Data as a Stackelberg Evolutionary Game
by: Niehaus, Sebastian, et al.
Published: (2024)
by: Niehaus, Sebastian, et al.
Published: (2024)
Multimodal Recurrent Ensembles for Predicting Brain Responses to Naturalistic Movies (Algonauts 2025)
by: Eren, Semih, et al.
Published: (2025)
by: Eren, Semih, et al.
Published: (2025)
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
by: Sancaktar, Cansu, et al.
Published: (2025)
by: Sancaktar, Cansu, et al.
Published: (2025)
CombOptNet: Fit the Right NP-Hard Problem by Learning Integer Programming Constraints
by: Paulus, Anselm, et al.
Published: (2021)
by: Paulus, Anselm, et al.
Published: (2021)
Geometry matters: insights from Ollivier Ricci Curvature and Ricci Flow into representational alignment through Ollivier-Ricci Curvature and Ricci Flow
by: Torbati, Nahid, et al.
Published: (2025)
by: Torbati, Nahid, et al.
Published: (2025)
Learning 3D-Gaussian Simulators from RGB Videos
by: Zhobro, Mikel, et al.
Published: (2025)
by: Zhobro, Mikel, et al.
Published: (2025)
Episodic-Semantic Memory Architecture for Long-Horizon Scientific Agents
by: Milosevic, Nikola
Published: (2026)
by: Milosevic, Nikola
Published: (2026)
Active Fine-Tuning of Multi-Task Policies
by: Bagatella, Marco, et al.
Published: (2024)
by: Bagatella, Marco, et al.
Published: (2024)
Predicting Microbial Interactions Using Graph Neural Networks
by: Gholamzadeh, Elham, et al.
Published: (2025)
by: Gholamzadeh, Elham, et al.
Published: (2025)
Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies
by: Chen, Jiaqi, et al.
Published: (2025)
by: Chen, Jiaqi, et al.
Published: (2025)
Scaling Laws and Tradeoffs in Recurrent Networks of Expressive Neurons
by: Spieler, Aaron, et al.
Published: (2026)
by: Spieler, Aaron, et al.
Published: (2026)
Learning to Control Emulated Muscles in Real Robots: Towards Exploiting Bio-Inspired Actuator Morphology
by: Schumacher, Pierre, et al.
Published: (2024)
by: Schumacher, Pierre, et al.
Published: (2024)
Differentiation of Blackbox Combinatorial Solvers
by: Vlastelica, Marin, et al.
Published: (2019)
by: Vlastelica, Marin, et al.
Published: (2019)
Epistemically-guided forward-backward exploration
by: Urpí, Núria Armengol, et al.
Published: (2025)
by: Urpí, Núria Armengol, et al.
Published: (2025)
A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
by: Guin, Soumyajit, et al.
Published: (2022)
by: Guin, Soumyajit, et al.
Published: (2022)
Similar Items
-
The Geometry of Nonlinear Reinforcement Learning
by: Milosevic, Nikola, et al.
Published: (2025) -
Central Path Proximal Policy Optimization
by: Milosevic, Nikola, et al.
Published: (2025) -
Embedding Safety into RL: A New Take on Trust Region Methods
by: Milosevic, Nikola, et al.
Published: (2024) -
Open Problem: Active Representation Learning
by: Milosevic, Nikola, et al.
Published: (2024) -
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
by: Kolev, Pavel, et al.
Published: (2025)