Saved in:
| Main Authors: | Aydın, Hüseyin, Godin-Dubois, Kevin, Braz, Libio Goncalvez, Hengst, Floris den, Baraka, Kim, Çelikok, Mustafa Mert, Sauter, Andreas, Wang, Shihan, Oliehoek, Frans A. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.19245 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
by: Çelikok, Mustafa Mert, et al.
Published: (2024)
by: Çelikok, Mustafa Mert, et al.
Published: (2024)
Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
by: Suau, Miguel, et al.
Published: (2022)
by: Suau, Miguel, et al.
Published: (2022)
Uncoupled Learning of Differential Stackelberg Equilibria with Commitments
by: Loftin, Robert, et al.
Published: (2023)
by: Loftin, Robert, et al.
Published: (2023)
Social Cooperation in Conversational AI Agents
by: Çelikok, Mustafa Mert, et al.
Published: (2025)
by: Çelikok, Mustafa Mert, et al.
Published: (2025)
A Measure-Theoretic Finite-Sample Theory for Adaptive-Data Fitted Q-Iteration
by: Haussmann, Manuel, et al.
Published: (2026)
by: Haussmann, Manuel, et al.
Published: (2026)
On the Complexity of Learning to Cooperate with Populations of Socially Rational Agents
by: Loftin, Robert, et al.
Published: (2024)
by: Loftin, Robert, et al.
Published: (2024)
Conditional Policy Generator for Dynamic Constraint Satisfaction and Optimization
by: Lee, Wook, et al.
Published: (2025)
by: Lee, Wook, et al.
Published: (2025)
CoMI-IRL: Contrastive Multi-Intention Inverse Reinforcement Learning
by: Mone, Antonio, et al.
Published: (2026)
by: Mone, Antonio, et al.
Published: (2026)
Hitting Time Isomorphism for Multi-Stage Planning with Foundation Policies
by: Boock, Magnus Victor, et al.
Published: (2026)
by: Boock, Magnus Victor, et al.
Published: (2026)
Analysing Factorizations of Action-Value Networks for Cooperative Multi-Agent Reinforcement Learning
by: Castellini, Jacopo, et al.
Published: (2019)
by: Castellini, Jacopo, et al.
Published: (2019)
Interactive embodied evolution for socially adept Artificial General Creatures
by: Godin-Dubois, Kevin, et al.
Published: (2024)
by: Godin-Dubois, Kevin, et al.
Published: (2024)
Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning
by: Allegue, Daniel De Dios, et al.
Published: (2025)
by: Allegue, Daniel De Dios, et al.
Published: (2025)
Symbolic Quantile Regression for the Interpretable Prediction of Conditional Quantiles
by: Hoekstra, Cas Oude, et al.
Published: (2025)
by: Hoekstra, Cas Oude, et al.
Published: (2025)
Distributional Active Inference
by: Akgül, Abdullah, et al.
Published: (2026)
by: Akgül, Abdullah, et al.
Published: (2026)
Reinforcement Learning for Personalized Dialogue Management
by: Hengst, Floris den, et al.
Published: (2019)
by: Hengst, Floris den, et al.
Published: (2019)
RESILIENCIA FAMILIAR NO CONTEXTO DO TRANSTORNO DO Espectro AUTISTA: A PERSPECTIVA DOS IRMÃOS
by: Larissa Líbio
Published: (2023)
by: Larissa Líbio
Published: (2023)
Las pautas de crianza en la ciudad de Mérida y su relación con la educación inicial. Registro etnográfico-exploratorio
by: Deisy Goncalvez
Published: (2011)
by: Deisy Goncalvez
Published: (2011)
Explaining Learned Reward Functions with Counterfactual Trajectories
by: Wehner, Jan, et al.
Published: (2024)
by: Wehner, Jan, et al.
Published: (2024)
Communicating with Speakers and Listeners of Different Pragmatic Levels
by: Naszadi, Kata, et al.
Published: (2024)
by: Naszadi, Kata, et al.
Published: (2024)
Physics-Informed Reinforcement Learning for Large-Scale EV Smart Charging Considering Distribution Network Voltage Constraints
by: Orfanoudakis, Stavros, et al.
Published: (2025)
by: Orfanoudakis, Stavros, et al.
Published: (2025)
APOGeT: Automated Phylogeny over Geological Time-scales
by: Godin-Dubois, Kevin, et al.
Published: (2024)
by: Godin-Dubois, Kevin, et al.
Published: (2024)
AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents
by: Godin-Dubois, Kevin, et al.
Published: (2024)
by: Godin-Dubois, Kevin, et al.
Published: (2024)
Benefits of Low-Cost Bio-Inspiration in the Age of Overparametrization
by: Godin-Dubois, Kevin, et al.
Published: (2026)
by: Godin-Dubois, Kevin, et al.
Published: (2026)
Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning
by: Osika, Zuzanna, et al.
Published: (2024)
by: Osika, Zuzanna, et al.
Published: (2024)
Multi-Objective Reinforcement Learning for Water Management
by: Osika, Zuzanna, et al.
Published: (2025)
by: Osika, Zuzanna, et al.
Published: (2025)
Timing the Match: A Deep Reinforcement Learning Approach for Ride-Hailing and Ride-Pooling Services
by: Bao, Yiman, et al.
Published: (2025)
by: Bao, Yiman, et al.
Published: (2025)
Sample-Efficient Policy Space Response Oracles with Joint Experience Best Response
by: Bighashdel, Ariyan, et al.
Published: (2026)
by: Bighashdel, Ariyan, et al.
Published: (2026)
SimuDICE: Offline Policy Optimization Through World Model Updates and DICE Estimation
by: Brita, Catalin E., et al.
Published: (2024)
by: Brita, Catalin E., et al.
Published: (2024)
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
by: Suau, Miguel, et al.
Published: (2023)
by: Suau, Miguel, et al.
Published: (2023)
Validación de la Escala de Empatía Percibida de los Padres (EEPP) en adultos jóvenes argentinos: propuesta de una versión breve
by: Agustín Benítez Goncalvez
Published: (2024)
by: Agustín Benítez Goncalvez
Published: (2024)
Caracterización mineralógica de formaciones de hierro bandeadas en Vipongos, provincia Namibe, Angola
by: Antonio Olimpio-Gonçalvez
Published: (2021)
by: Antonio Olimpio-Gonçalvez
Published: (2021)
Detecting Linguistic Bias in Government Documents Using Large language Models
by: de Swart, Milena, et al.
Published: (2025)
by: de Swart, Milena, et al.
Published: (2025)
Difference Rewards Policy Gradients
by: Castellini, Jacopo, et al.
Published: (2020)
by: Castellini, Jacopo, et al.
Published: (2020)
Log Parsing Evaluation in the Era of Modern Software Systems
by: Petrescu, Stefan, et al.
Published: (2023)
by: Petrescu, Stefan, et al.
Published: (2023)
Conformal Intent Classification and Clarification for Fast and Accurate Intent Recognition
by: Hengst, Floris den, et al.
Published: (2024)
by: Hengst, Floris den, et al.
Published: (2024)
EL SUJETO NEURONAL: APORTACIONES PARA UNA PEDAGOGÍA DE LA POSIBILIDAD
by: Teresa N. R. Gonçalvez
Published: (2012)
by: Teresa N. R. Gonçalvez
Published: (2012)
Las principales olas o ciclos de fusiones y adquisiciones desde finales del siglo XIX hasta la actualidad
by: Víctor Libio de los Ríos
Published: (2011)
by: Víctor Libio de los Ríos
Published: (2011)
A Study of Commonsense Reasoning over Visual Object Properties
by: Kolari, Abhishek, et al.
Published: (2025)
by: Kolari, Abhishek, et al.
Published: (2025)
Human Immune Response to Influenza Neuraminidase After Vaccination: A Systematic Review
by: Vardhini Ganesh, et al.
Published: (2025)
by: Vardhini Ganesh, et al.
Published: (2025)
Value Improved Actor Critic Algorithms
by: Oren, Yaniv, et al.
Published: (2024)
by: Oren, Yaniv, et al.
Published: (2024)
Similar Items
-
Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
by: Çelikok, Mustafa Mert, et al.
Published: (2024) -
Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
by: Suau, Miguel, et al.
Published: (2022) -
Uncoupled Learning of Differential Stackelberg Equilibria with Commitments
by: Loftin, Robert, et al.
Published: (2023) -
Social Cooperation in Conversational AI Agents
by: Çelikok, Mustafa Mert, et al.
Published: (2025) -
A Measure-Theoretic Finite-Sample Theory for Adaptive-Data Fitted Q-Iteration
by: Haussmann, Manuel, et al.
Published: (2026)