Saved in:
| Main Authors: | Cundy, Chris, Ermon, Stefano |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2306.05426 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF
by: Belakaria, Syrine, et al.
Published: (2025)
by: Belakaria, Syrine, et al.
Published: (2025)
Preference Learning with Lie Detectors can Induce Honesty or Evasion
by: Cundy, Chris, et al.
Published: (2025)
by: Cundy, Chris, et al.
Published: (2025)
Generative Modeling with Flux Matching
by: Pao-Huang, Peter, et al.
Published: (2026)
by: Pao-Huang, Peter, et al.
Published: (2026)
Inductive Moment Matching
by: Zhou, Linqi, et al.
Published: (2025)
by: Zhou, Linqi, et al.
Published: (2025)
Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
by: Cundy, Chris, et al.
Published: (2020)
by: Cundy, Chris, et al.
Published: (2020)
Autoregressive Action Sequence Learning for Robotic Manipulation
by: Zhang, Xinyu, et al.
Published: (2024)
by: Zhang, Xinyu, et al.
Published: (2024)
Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective
by: Ou, Jingyang, et al.
Published: (2025)
by: Ou, Jingyang, et al.
Published: (2025)
ABC: Any-Subset Autoregression via Non-Markovian Diffusion Bridges in Continuous Time and Space
by: Guo, Gabe, et al.
Published: (2026)
by: Guo, Gabe, et al.
Published: (2026)
The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes
by: Taufeeque, Mohammad, et al.
Published: (2026)
by: Taufeeque, Mohammad, et al.
Published: (2026)
FlatASCEND: Autoregressive Clinical Sequence Generation with Continuous Time Prediction and Association-Based Pharmacological Testing
by: Sainsbury, Chris, et al.
Published: (2026)
by: Sainsbury, Chris, et al.
Published: (2026)
Handling and Interpreting Missing Modalities in Patient Clinical Trajectories via Autoregressive Sequence Modeling
by: Wang, Andrew, et al.
Published: (2026)
by: Wang, Andrew, et al.
Published: (2026)
Imitation Learning as Return Distribution Matching
by: Lazzati, Filippo, et al.
Published: (2025)
by: Lazzati, Filippo, et al.
Published: (2025)
Reinforcement Learning with Backtracking Feedback
by: Sel, Bilgehan, et al.
Published: (2026)
by: Sel, Bilgehan, et al.
Published: (2026)
Recasting Continual Learning as Sequence Modeling
by: Lee, Soochan, et al.
Published: (2023)
by: Lee, Soochan, et al.
Published: (2023)
Behavioral Sequence Modeling with Ensemble Learning
by: Kawawa-Beaudan, Maxime, et al.
Published: (2024)
by: Kawawa-Beaudan, Maxime, et al.
Published: (2024)
AI Companies Should Report Pre- and Post-Mitigation Safety Evaluations
by: Bowen, Dillon, et al.
Published: (2025)
by: Bowen, Dillon, et al.
Published: (2025)
Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations
by: Selvam, Nikil Roashan, et al.
Published: (2024)
by: Selvam, Nikil Roashan, et al.
Published: (2024)
Detecting the Future: All-at-Once Event Sequence Forecasting with Horizon Matching
by: Karpukhin, Ivan, et al.
Published: (2024)
by: Karpukhin, Ivan, et al.
Published: (2024)
FastDiSS: Few-step Match Many-step Diffusion Language Model on Sequence-to-Sequence Generation--Full Version
by: Nguyen-Cong, Dat, et al.
Published: (2026)
by: Nguyen-Cong, Dat, et al.
Published: (2026)
Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation
by: Manvi, Rohin, et al.
Published: (2024)
by: Manvi, Rohin, et al.
Published: (2024)
Sequence-to-Sequence Spanish Pre-trained Language Models
by: Araujo, Vladimir, et al.
Published: (2023)
by: Araujo, Vladimir, et al.
Published: (2023)
MADiff: Offline Multi-agent Learning with Diffusion Models
by: Zhu, Zhengbang, et al.
Published: (2023)
by: Zhu, Zhengbang, et al.
Published: (2023)
Deep Backtracking Counterfactuals for Causally Compliant Explanations
by: Kladny, Klaus-Rudolf, et al.
Published: (2023)
by: Kladny, Klaus-Rudolf, et al.
Published: (2023)
CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models
by: Hu, Zheyuan, et al.
Published: (2025)
by: Hu, Zheyuan, et al.
Published: (2025)
Learning Submodular Sequencing from Samples
by: Yuan, Jing, et al.
Published: (2024)
by: Yuan, Jing, et al.
Published: (2024)
Backtracking Improves Generation Safety
by: Zhang, Yiming, et al.
Published: (2024)
by: Zhang, Yiming, et al.
Published: (2024)
The Principles of Diffusion Models
by: Lai, Chieh-Hsin, et al.
Published: (2025)
by: Lai, Chieh-Hsin, et al.
Published: (2025)
Anomalous State Sequence Modeling to Enhance Safety in Reinforcement Learning
by: Kweider, Leen, et al.
Published: (2024)
by: Kweider, Leen, et al.
Published: (2024)
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
by: He, Shenghong
Published: (2025)
by: He, Shenghong
Published: (2025)
One-Shot Imitation Learning with Invariance Matching for Robotic Manipulation
by: Zhang, Xinyu, et al.
Published: (2024)
by: Zhang, Xinyu, et al.
Published: (2024)
Calibrated Probabilistic Forecasts for Arbitrary Sequences
by: Marx, Charles, et al.
Published: (2024)
by: Marx, Charles, et al.
Published: (2024)
GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters
by: Zhao, Wanjia, et al.
Published: (2025)
by: Zhao, Wanjia, et al.
Published: (2025)
Active Learning for Derivative-Based Global Sensitivity Analysis with Gaussian Processes
by: Belakaria, Syrine, et al.
Published: (2024)
by: Belakaria, Syrine, et al.
Published: (2024)
Sequencing to Mitigate Catastrophic Forgetting in Continual Learning
by: Moussa, Hesham G., et al.
Published: (2025)
by: Moussa, Hesham G., et al.
Published: (2025)
Symbolic Autoencoding for Self-Supervised Sequence Learning
by: Amani, Mohammad Hossein, et al.
Published: (2024)
by: Amani, Mohammad Hossein, et al.
Published: (2024)
Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing
by: Qiu, Zijie, et al.
Published: (2025)
by: Qiu, Zijie, et al.
Published: (2025)
Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling
by: Xu, Jiawei, et al.
Published: (2024)
by: Xu, Jiawei, et al.
Published: (2024)
Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution
by: Wu, Tailin, et al.
Published: (2024)
by: Wu, Tailin, et al.
Published: (2024)
Can Transformers Learn to Verify During Backtracking Search?
by: Phua, Yin Jun, et al.
Published: (2026)
by: Phua, Yin Jun, et al.
Published: (2026)
Reinforcement Learning for Sequence Design Leveraging Protein Language Models
by: Subramanian, Jithendaraa, et al.
Published: (2024)
by: Subramanian, Jithendaraa, et al.
Published: (2024)
Similar Items
-
Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF
by: Belakaria, Syrine, et al.
Published: (2025) -
Preference Learning with Lie Detectors can Induce Honesty or Evasion
by: Cundy, Chris, et al.
Published: (2025) -
Generative Modeling with Flux Matching
by: Pao-Huang, Peter, et al.
Published: (2026) -
Inductive Moment Matching
by: Zhou, Linqi, et al.
Published: (2025) -
Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
by: Cundy, Chris, et al.
Published: (2020)