:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Diederen, Tomek, Zamboni, Nicola
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2503.10232
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Emergent Compositional Communication for Latent World Properties
by: Kaszyński, Tomek
Published: (2026)

Convergence of gradient flow for learning convolutional neural networks
by: Diederen, Jona-Maria, et al.
Published: (2026)

Scalable Multi-Agent Offline Reinforcement Learning and the Role of Information
by: Zamboni, Riccardo, et al.
Published: (2025)

Training Agents to Self-Report Misbehavior
by: Lee, Bruce W., et al.
Published: (2026)

Towards Principled Unsupervised Multi-Agent Reinforcement Learning
by: Zamboni, Riccardo, et al.
Published: (2025)

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
by: Zamboni, Riccardo, et al.
Published: (2024)

Few measurement shots challenge generalization in learning to classify entanglement
by: Banchi, Leonardo, et al.
Published: (2024)

Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
by: De Paola, Vincenzo, et al.
Published: (2025)

K-Myriad: Jump-starting reinforcement learning with unsupervised parallel agents
by: De Paola, Vincenzo, et al.
Published: (2026)

From Parameters to Behaviors: Unsupervised Compression of the Policy Space
by: Tenedini, Davide, et al.
Published: (2025)

How to Explore with Belief: State Entropy Maximization in POMDPs
by: Zamboni, Riccardo, et al.
Published: (2024)

Evaluating Angle and Amplitude Encoding Strategies for Variational Quantum Machine Learning: their impact on model's accuracy
by: Tudisco, Antonio, et al.
Published: (2025)

ParallelFlow: Parallelizing Linear Transformers via Flow Discretization
by: Cirone, Nicola Muca, et al.
Published: (2025)

Unsupervised Behavioral Compression: Learning Low-Dimensional Policy Manifolds through State-Occupancy Matching
by: Fraschini, Andrea, et al.
Published: (2026)

Probing Dec-POMDP Reasoning in Cooperative MARL
by: Tessera, Kale-ab, et al.
Published: (2026)

Remembering the Markov Property in Cooperative MARL
by: Tessera, Kale-ab Abebe, et al.
Published: (2025)

Fundamental Limitations in Pointwise Defences of LLM Finetuning APIs
by: Davies, Xander, et al.
Published: (2025)

Async Control: Stress-testing Asynchronous Control Measures for LLM Agents
by: Stickland, Asa Cooper, et al.
Published: (2025)

Run-Time Monitoring of ERTMS/ETCS Control Flow by Process Mining
by: Vitale, Francesco, et al.
Published: (2025)

MicroFlow: An Efficient Rust-Based Inference Engine for TinyML
by: Carnelos, Matteo, et al.
Published: (2024)

Multi-Marginal Flow Matching with Adversarially Learnt Interpolants
by: Kviman, Oskar, et al.
Published: (2025)

FlowMAC: Conditional Flow Matching for Audio Coding at Low Bit Rates
by: Pia, Nicola, et al.
Published: (2024)

Online Non-convex Optimization with Long-term Non-convex Constraints
by: Pan, Shijie, et al.
Published: (2023)

The Nyström method for convex loss functions
by: Della Vecchia, Andrea, et al.
Published: (2020)

Enforcing convex constraints in Graph Neural Networks
by: Rashwan, Ahmed, et al.
Published: (2025)

Denoising data using convex relaxations
by: Fefferman, Charles, et al.
Published: (2026)

Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion
by: Cutkosky, Ashok, et al.
Published: (2023)

Tightening convex relaxations of trained neural networks: a unified approach for convex and S-shaped activations
by: Carrasco, Pablo, et al.
Published: (2024)

CODA: Coordination via On-Policy Diffusion for Multi-Agent Offline Reinforcement Learning
by: Hedman, Marcel, et al.
Published: (2026)

Near-optimal delta-convex estimation of Lipschitz functions
by: Balázs, Gábor
Published: (2025)

A lift for input-convex neural network training
by: Siahkoohi, Ali, et al.
Published: (2026)

Differentially Private Non-convex Distributionally Robust Optimization
by: Xu, Difei, et al.
Published: (2026)

Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs
by: O'Brien, Kyle, et al.
Published: (2025)

On amortizing convex conjugates for optimal transport
by: Amos, Brandon
Published: (2022)

Strong convexity-guided hyper-parameter optimization for flatter losses
by: Yedida, Rahul, et al.
Published: (2024)

Distributional simplicity bias and effective convexity in Energy Based Models
by: Decelle, Aurélien, et al.
Published: (2026)

Robust stabilization of polytopic systems via fast and reliable neural network-based approximations
by: Fabiani, Filippo, et al.
Published: (2022)

On convex decision regions in deep network representations
by: Tětková, Lenka, et al.
Published: (2023)

Graph Matching via convex relaxation to the simplex
by: Valdivia, Ernesto Araya, et al.
Published: (2023)

Nesterov acceleration in benignly non-convex landscapes
by: Gupta, Kanan, et al.
Published: (2024)