:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Olausson, Theo X., Monteiro, João, Klein, Michal, Cuturi, Marco
Format:	Preprint
Publié:	2026
Sujets:	Machine Learning
Accès en ligne:	https://arxiv.org/abs/2603.08001
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

Learning Unmasking Policies for Diffusion Language Models
par: Jazbec, Metod, et autres
Publié: (2025)

Nectar: Neural Estimation of Cached-Token Attention via Regression
par: Monteiro, João, et autres
Publié: (2026)

GENOT: Entropic (Gromov) Wasserstein Flow Matching with Applications to Single-Cell Genomics
par: Klein, Dominik, et autres
Publié: (2023)

Multivariate Conformal Prediction using Optimal Transport
par: Klein, Michal, et autres
Publié: (2025)

Contrasting Multiple Representations with the Multi-Marginal Matching Gap
par: Piran, Zoe, et autres
Publié: (2024)

Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing
par: Filippova, Anastasiia, et autres
Publié: (2026)

On Fitting Flow Models with Large Sinkhorn Couplings
par: Zhang, Stephen, et autres
Publié: (2025)

Flow Matching with Semidiscrete Couplings
par: Mousavi-Hosseini, Alireza, et autres
Publié: (2025)

HyperTransport: Amortized Conditioning of T2I Generative Models
par: Maiorca, Valentino, et autres
Publié: (2026)

Learning Elastic Costs to Shape Monge Displacements
par: Klein, Michal, et autres
Publié: (2023)

DynaMiCS: Fine-tuning LLMs with Performance Constraints using Dynamic Mixtures
par: Gualdoni, Eleonora, et autres
Publié: (2026)

Careful with that Scalpel: Improving Gradient Surgery with an EMA
par: Hsieh, Yu-Guan, et autres
Publié: (2024)

Progressive Entropic Optimal Transport Solvers
par: Kassraie, Parnian, et autres
Publié: (2024)

The Data-Quality Illusion: Rethinking Classifier-Based Quality Filtering for LLM Pretraining
par: Saada, Thiziri Nait, et autres
Publié: (2025)

On a Neural Implementation of Brenier's Polar Factorization
par: Vesseron, Nina, et autres
Publié: (2024)

Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search
par: Bruch, Sebastian, et autres
Publié: (2024)

Disentangled Representation Learning with the Gromov-Monge Gap
par: Uscidda, Théo, et autres
Publié: (2024)

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration
par: Mlodozeniec, Bruno, et autres
Publié: (2025)

The Geometries of Truth Are Orthogonal Across Tasks
par: Azizian, Waiss, et autres
Publié: (2025)

Amortized Active Learning for Nonparametric Functions
par: Li, Cen-You, et autres
Publié: (2024)

Amortized Variational Inference for Partial-Label Learning: A Probabilistic Approach to Label Disambiguation
par: Fuchs, Tobias, et autres
Publié: (2025)

LILO: Learning Interpretable Libraries by Compressing and Documenting Code
par: Grand, Gabriel, et autres
Publié: (2023)

Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures
par: Vesseron, Nina, et autres
Publié: (2025)

A Tale of Two Temperatures: Simple, Efficient, and Diverse Sampling from Diffusion Language Models
par: Olausson, Theo X., et autres
Publié: (2026)

Learning Decision Trees as Amortized Structure Inference
par: Mahfoud, Mohammed, et autres
Publié: (2025)

Simple ReFlow: Improved Techniques for Fast Flow Models
par: Kim, Beomsu, et autres
Publié: (2024)

The Design Space of Tri-Modal Masked Diffusion Models
par: Bethune, Louis, et autres
Publié: (2026)

Locking Pretrained Weights via Deep Low-Rank Residual Distillation
par: Sakamoto, Keitaro, et autres
Publié: (2026)

The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?
par: Gu, Alex, et autres
Publié: (2024)

A Specialized Semismooth Newton Method for Kernel-Based Optimal Transport
par: Lin, Tianyi, et autres
Publié: (2023)

The Coupling Within: Flow Matching via Distilled Normalizing Flows
par: Berthelot, David, et autres
Publié: (2026)

Amortized Sampling with Transferable Normalizing Flows
par: Tan, Charlie B., et autres
Publié: (2025)

A Galois theorem for machine learning: Functions on symmetric matrices and point clouds via lightweight invariant features
par: Blum-Smith, Ben, et autres
Publié: (2024)

k-Maximum Inner Product Attention for Graph Transformers and the Expressive Power of GraphGPS
par: De Schouwer, Jonas, et autres
Publié: (2026)

Latent Geometry Beyond Search: Amortizing Planning in World Models
par: Nguyen, Hoang, et autres
Publié: (2026)

Self-Refining Training for Amortized Density Functional Theory
par: Hassan, Majdi, et autres
Publié: (2025)

Controlling Language and Diffusion Models by Transporting Activations
par: Rodriguez, Pau, et autres
Publié: (2024)

Unsupervised Continual Learning for Amortized Bayesian Inference
par: Mishra, Aayush, et autres
Publié: (2026)

Memory-Amortized Inference: A Topological Unification of Search, Closure, and Structure
par: Li, Xin
Publié: (2025)

Amortized Bayesian Workflow
par: Li, Chengkun, et autres
Publié: (2024)