Saved in:
| Main Authors: | Bradley, Arwen, Nakkiran, Preetum, Berthelot, David, Thornton, James, Susskind, Joshua M. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.04549 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo
by: Thornton, James, et al.
Published: (2025)
by: Thornton, James, et al.
Published: (2025)
Classifier-Free Guidance is a Predictor-Corrector
by: Bradley, Arwen, et al.
Published: (2024)
by: Bradley, Arwen, et al.
Published: (2024)
Step-by-Step Diffusion: An Elementary Tutorial
by: Nakkiran, Preetum, et al.
Published: (2024)
by: Nakkiran, Preetum, et al.
Published: (2024)
Vanishing Gradients in Reinforcement Finetuning of Language Models
by: Razin, Noam, et al.
Published: (2023)
by: Razin, Noam, et al.
Published: (2023)
Trained on Tokens, Calibrated on Concepts: The Emergence of Semantic Calibration in LLMs
by: Nakkiran, Preetum, et al.
Published: (2025)
by: Nakkiran, Preetum, et al.
Published: (2025)
Local Mechanisms of Compositional Generalization in Conditional Diffusion
by: Bradley, Arwen
Published: (2025)
by: Bradley, Arwen
Published: (2025)
How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation Networks
by: Littwin, Etai, et al.
Published: (2024)
by: Littwin, Etai, et al.
Published: (2024)
Normalizing Flows are Capable Generative Models
by: Zhai, Shuangfei, et al.
Published: (2024)
by: Zhai, Shuangfei, et al.
Published: (2024)
When is Multicalibration Post-Processing Necessary?
by: Hansen, Dutch, et al.
Published: (2024)
by: Hansen, Dutch, et al.
Published: (2024)
Trace Length is a Simple Uncertainty Signal in Reasoning Models
by: Devic, Siddartha, et al.
Published: (2025)
by: Devic, Siddartha, et al.
Published: (2025)
TADA: Improved Diffusion Sampling with Training-free Augmented Dynamics
by: Chen, Tianrong, et al.
Published: (2025)
by: Chen, Tianrong, et al.
Published: (2025)
Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion
by: Zhang, Ruixiang, et al.
Published: (2025)
by: Zhang, Ruixiang, et al.
Published: (2025)
Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting
by: Mallinar, Neil, et al.
Published: (2022)
by: Mallinar, Neil, et al.
Published: (2022)
To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models
by: Malach, Eran, et al.
Published: (2025)
by: Malach, Eran, et al.
Published: (2025)
Normalizing Trajectory Models
by: Gu, Jiatao, et al.
Published: (2026)
by: Gu, Jiatao, et al.
Published: (2026)
A Formal Framework for Understanding Length Generalization in Transformers
by: Huang, Xinting, et al.
Published: (2024)
by: Huang, Xinting, et al.
Published: (2024)
Manifold Diffusion Fields
by: Elhag, Ahmed A., et al.
Published: (2023)
by: Elhag, Ahmed A., et al.
Published: (2023)
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
by: Gu, Jiatao, et al.
Published: (2025)
by: Gu, Jiatao, et al.
Published: (2025)
The Coupling Within: Flow Matching via Distilled Normalizing Flows
by: Berthelot, David, et al.
Published: (2026)
by: Berthelot, David, et al.
Published: (2026)
Annotations Mitigate Post-Training Mode Collapse
by: Springer, Jacob Mitchell, et al.
Published: (2026)
by: Springer, Jacob Mitchell, et al.
Published: (2026)
Matryoshka Diffusion Models
by: Gu, Jiatao, et al.
Published: (2023)
by: Gu, Jiatao, et al.
Published: (2023)
Simulating Diffusion Bridges with Score Matching
by: Heng, Jeremy, et al.
Published: (2021)
by: Heng, Jeremy, et al.
Published: (2021)
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
by: Gu, Jiatao, et al.
Published: (2025)
by: Gu, Jiatao, et al.
Published: (2025)
Bridging the Generalisation Gap: Synthetic Data Generation for Multi-Site Clinical Model Validation
by: Segal, Bradley, et al.
Published: (2025)
by: Segal, Bradley, et al.
Published: (2025)
Generative Modeling with Phase Stochastic Bridges
by: Chen, Tianrong, et al.
Published: (2023)
by: Chen, Tianrong, et al.
Published: (2023)
Compositional Image Decomposition with Diffusion Models
by: Su, Jocelin, et al.
Published: (2024)
by: Su, Jocelin, et al.
Published: (2024)
Normalizing Flows with Iterative Denoising
by: Chen, Tianrong, et al.
Published: (2026)
by: Chen, Tianrong, et al.
Published: (2026)
Swallowing the Bitter Pill: Simplified Scalable Conformer Generation
by: Wang, Yuyang, et al.
Published: (2023)
by: Wang, Yuyang, et al.
Published: (2023)
TRYLOCK: Defense-in-Depth Against LLM Jailbreaks via Layered Preference and Representation Engineering
by: Thornton, Scott
Published: (2026)
by: Thornton, Scott
Published: (2026)
SecureCode: A Production-Grade Multi-Turn Dataset for Training Security-Aware Code Generation Models
by: Thornton, Scott
Published: (2025)
by: Thornton, Scott
Published: (2025)
Differentiable Cost-Parameterized Monge Map Estimators
by: Howard, Samuel, et al.
Published: (2024)
by: Howard, Samuel, et al.
Published: (2024)
Contrasting Multiple Representations with the Multi-Marginal Matching Gap
by: Piran, Zoe, et al.
Published: (2024)
by: Piran, Zoe, et al.
Published: (2024)
Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency
by: Kirchhof, Michael, et al.
Published: (2024)
by: Kirchhof, Michael, et al.
Published: (2024)
Can Adversarial Code Comments Fool AI Security Reviewers -- Large-Scale Empirical Study of Comment-Based Attacks and Defenses Against LLM Code Analysis
by: Thornton, Scott
Published: (2026)
by: Thornton, Scott
Published: (2026)
Retrieval Pivot Attacks in Hybrid RAG: Measuring and Mitigating Amplified Leakage from Vector Seeds to Graph Expansion
by: Thornton, Scott
Published: (2026)
by: Thornton, Scott
Published: (2026)
Semantic Chameleon: Corpus-Dependent Poisoning Attacks and Defenses in RAG Systems
by: Thornton, Scott
Published: (2026)
by: Thornton, Scott
Published: (2026)
When can transformers reason with abstract symbols?
by: Boix-Adsera, Enric, et al.
Published: (2023)
by: Boix-Adsera, Enric, et al.
Published: (2023)
Constrained Synthesis with Projected Diffusion Models
by: Christopher, Jacob K, et al.
Published: (2024)
by: Christopher, Jacob K, et al.
Published: (2024)
Algebraic Diversity: Group-Theoretic Spectral Estimation from Single Observations
by: Thornton, Mitchell A.
Published: (2026)
by: Thornton, Mitchell A.
Published: (2026)
Polynomial-Time Optimal Group Selection via the Double-Commutator Eigenvalue Problem
by: Thornton, Mitchell A.
Published: (2026)
by: Thornton, Mitchell A.
Published: (2026)
Similar Items
-
Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo
by: Thornton, James, et al.
Published: (2025) -
Classifier-Free Guidance is a Predictor-Corrector
by: Bradley, Arwen, et al.
Published: (2024) -
Step-by-Step Diffusion: An Elementary Tutorial
by: Nakkiran, Preetum, et al.
Published: (2024) -
Vanishing Gradients in Reinforcement Finetuning of Language Models
by: Razin, Noam, et al.
Published: (2023) -
Trained on Tokens, Calibrated on Concepts: The Emergence of Semantic Calibration in LLMs
by: Nakkiran, Preetum, et al.
Published: (2025)