Saved in:
| Main Authors: | Diederen, Tomek, Zamboni, Nicola |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.10232 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Emergent Compositional Communication for Latent World Properties
by: Kaszyński, Tomek
Published: (2026)
by: Kaszyński, Tomek
Published: (2026)
Convergence of gradient flow for learning convolutional neural networks
by: Diederen, Jona-Maria, et al.
Published: (2026)
by: Diederen, Jona-Maria, et al.
Published: (2026)
Scalable Multi-Agent Offline Reinforcement Learning and the Role of Information
by: Zamboni, Riccardo, et al.
Published: (2025)
by: Zamboni, Riccardo, et al.
Published: (2025)
Training Agents to Self-Report Misbehavior
by: Lee, Bruce W., et al.
Published: (2026)
by: Lee, Bruce W., et al.
Published: (2026)
Towards Principled Unsupervised Multi-Agent Reinforcement Learning
by: Zamboni, Riccardo, et al.
Published: (2025)
by: Zamboni, Riccardo, et al.
Published: (2025)
The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
by: Zamboni, Riccardo, et al.
Published: (2024)
by: Zamboni, Riccardo, et al.
Published: (2024)
Few measurement shots challenge generalization in learning to classify entanglement
by: Banchi, Leonardo, et al.
Published: (2024)
by: Banchi, Leonardo, et al.
Published: (2024)
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
by: De Paola, Vincenzo, et al.
Published: (2025)
by: De Paola, Vincenzo, et al.
Published: (2025)
K-Myriad: Jump-starting reinforcement learning with unsupervised parallel agents
by: De Paola, Vincenzo, et al.
Published: (2026)
by: De Paola, Vincenzo, et al.
Published: (2026)
From Parameters to Behaviors: Unsupervised Compression of the Policy Space
by: Tenedini, Davide, et al.
Published: (2025)
by: Tenedini, Davide, et al.
Published: (2025)
How to Explore with Belief: State Entropy Maximization in POMDPs
by: Zamboni, Riccardo, et al.
Published: (2024)
by: Zamboni, Riccardo, et al.
Published: (2024)
Evaluating Angle and Amplitude Encoding Strategies for Variational Quantum Machine Learning: their impact on model's accuracy
by: Tudisco, Antonio, et al.
Published: (2025)
by: Tudisco, Antonio, et al.
Published: (2025)
ParallelFlow: Parallelizing Linear Transformers via Flow Discretization
by: Cirone, Nicola Muca, et al.
Published: (2025)
by: Cirone, Nicola Muca, et al.
Published: (2025)
Unsupervised Behavioral Compression: Learning Low-Dimensional Policy Manifolds through State-Occupancy Matching
by: Fraschini, Andrea, et al.
Published: (2026)
by: Fraschini, Andrea, et al.
Published: (2026)
Probing Dec-POMDP Reasoning in Cooperative MARL
by: Tessera, Kale-ab, et al.
Published: (2026)
by: Tessera, Kale-ab, et al.
Published: (2026)
Remembering the Markov Property in Cooperative MARL
by: Tessera, Kale-ab Abebe, et al.
Published: (2025)
by: Tessera, Kale-ab Abebe, et al.
Published: (2025)
Fundamental Limitations in Pointwise Defences of LLM Finetuning APIs
by: Davies, Xander, et al.
Published: (2025)
by: Davies, Xander, et al.
Published: (2025)
Async Control: Stress-testing Asynchronous Control Measures for LLM Agents
by: Stickland, Asa Cooper, et al.
Published: (2025)
by: Stickland, Asa Cooper, et al.
Published: (2025)
Run-Time Monitoring of ERTMS/ETCS Control Flow by Process Mining
by: Vitale, Francesco, et al.
Published: (2025)
by: Vitale, Francesco, et al.
Published: (2025)
MicroFlow: An Efficient Rust-Based Inference Engine for TinyML
by: Carnelos, Matteo, et al.
Published: (2024)
by: Carnelos, Matteo, et al.
Published: (2024)
Multi-Marginal Flow Matching with Adversarially Learnt Interpolants
by: Kviman, Oskar, et al.
Published: (2025)
by: Kviman, Oskar, et al.
Published: (2025)
FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates
by: Pia, Nicola, et al.
Published: (2024)
by: Pia, Nicola, et al.
Published: (2024)
Online Non-convex Optimization with Long-term Non-convex Constraints
by: Pan, Shijie, et al.
Published: (2023)
by: Pan, Shijie, et al.
Published: (2023)
The Nyström method for convex loss functions
by: Della Vecchia, Andrea, et al.
Published: (2020)
by: Della Vecchia, Andrea, et al.
Published: (2020)
Enforcing convex constraints in Graph Neural Networks
by: Rashwan, Ahmed, et al.
Published: (2025)
by: Rashwan, Ahmed, et al.
Published: (2025)
Denoising data using convex relaxations
by: Fefferman, Charles, et al.
Published: (2026)
by: Fefferman, Charles, et al.
Published: (2026)
Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion
by: Cutkosky, Ashok, et al.
Published: (2023)
by: Cutkosky, Ashok, et al.
Published: (2023)
Tightening convex relaxations of trained neural networks: a unified approach for convex and S-shaped activations
by: Carrasco, Pablo, et al.
Published: (2024)
by: Carrasco, Pablo, et al.
Published: (2024)
CODA: Coordination via On-Policy Diffusion for Multi-Agent Offline Reinforcement Learning
by: Hedman, Marcel, et al.
Published: (2026)
by: Hedman, Marcel, et al.
Published: (2026)
Near-optimal delta-convex estimation of Lipschitz functions
by: Balázs, Gábor
Published: (2025)
by: Balázs, Gábor
Published: (2025)
A lift for input-convex neural network training
by: Siahkoohi, Ali, et al.
Published: (2026)
by: Siahkoohi, Ali, et al.
Published: (2026)
Differentially Private Non-convex Distributionally Robust Optimization
by: Xu, Difei, et al.
Published: (2026)
by: Xu, Difei, et al.
Published: (2026)
Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs
by: O'Brien, Kyle, et al.
Published: (2025)
by: O'Brien, Kyle, et al.
Published: (2025)
On amortizing convex conjugates for optimal transport
by: Amos, Brandon
Published: (2022)
by: Amos, Brandon
Published: (2022)
Strong convexity-guided hyper-parameter optimization for flatter losses
by: Yedida, Rahul, et al.
Published: (2024)
by: Yedida, Rahul, et al.
Published: (2024)
Distributional simplicity bias and effective convexity in Energy Based Models
by: Decelle, Aurélien, et al.
Published: (2026)
by: Decelle, Aurélien, et al.
Published: (2026)
Robust stabilization of polytopic systems via fast and reliable neural network-based approximations
by: Fabiani, Filippo, et al.
Published: (2022)
by: Fabiani, Filippo, et al.
Published: (2022)
On convex decision regions in deep network representations
by: Tětková, Lenka, et al.
Published: (2023)
by: Tětková, Lenka, et al.
Published: (2023)
Graph Matching via convex relaxation to the simplex
by: Valdivia, Ernesto Araya, et al.
Published: (2023)
by: Valdivia, Ernesto Araya, et al.
Published: (2023)
Nesterov acceleration in benignly non-convex landscapes
by: Gupta, Kanan, et al.
Published: (2024)
by: Gupta, Kanan, et al.
Published: (2024)
Similar Items
-
Emergent Compositional Communication for Latent World Properties
by: Kaszyński, Tomek
Published: (2026) -
Convergence of gradient flow for learning convolutional neural networks
by: Diederen, Jona-Maria, et al.
Published: (2026) -
Scalable Multi-Agent Offline Reinforcement Learning and the Role of Information
by: Zamboni, Riccardo, et al.
Published: (2025) -
Training Agents to Self-Report Misbehavior
by: Lee, Bruce W., et al.
Published: (2026) -
Towards Principled Unsupervised Multi-Agent Reinforcement Learning
by: Zamboni, Riccardo, et al.
Published: (2025)