Saved in:
| Main Authors: | Li, Lu, Zhang, Tianyu, Bu, Zhiqi, Wang, Suyuchen, He, Huan, Fu, Jie, Wu, Yonghui, Bian, Jiang, Chen, Yong, Bengio, Yoshua |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.07529 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text
by: Zhang, Tianyu, et al.
Published: (2024)
by: Zhang, Tianyu, et al.
Published: (2024)
Scope: Selective Cross-modal Orchestration of Visual Perception Experts
by: Zhang, Tianyu, et al.
Published: (2025)
by: Zhang, Tianyu, et al.
Published: (2025)
Learning Decision Trees as Amortized Structure Inference
by: Mahfoud, Mohammed, et al.
Published: (2025)
by: Mahfoud, Mohammed, et al.
Published: (2025)
Amortizing intractable inference in large language models
by: Hu, Edward J., et al.
Published: (2023)
by: Hu, Edward J., et al.
Published: (2023)
Machine learning and information theory concepts towards an AI Mathematician
by: Bengio, Yoshua, et al.
Published: (2024)
by: Bengio, Yoshua, et al.
Published: (2024)
Large Language Models as Amortized Pareto-Front Generators for Constrained Bi-Objective Convex Optimization
by: Xu, Peipei, et al.
Published: (2026)
by: Xu, Peipei, et al.
Published: (2026)
Baking Symmetry into GFlowNets
by: Ma, George, et al.
Published: (2024)
by: Ma, George, et al.
Published: (2024)
Amortizing intractable inference in diffusion models for vision, language, and control
by: Venkatraman, Siddarth, et al.
Published: (2024)
by: Venkatraman, Siddarth, et al.
Published: (2024)
Amortized Active Generation of Pareto Sets
by: Steinberg, Daniel M., et al.
Published: (2025)
by: Steinberg, Daniel M., et al.
Published: (2025)
Noticing the Watcher: LLM Agents Can Infer CoT Monitoring from Blocking Feedback
by: Jiralerspong, Thomas, et al.
Published: (2026)
by: Jiralerspong, Thomas, et al.
Published: (2026)
Scaling depth capacity via zero/one-layer model expansion
by: Bu, Zhiqi
Published: (2025)
by: Bu, Zhiqi
Published: (2025)
Fast Monte Carlo Tree Diffusion: 100x Speedup via Parallel Sparse Planning
by: Yoon, Jaesik, et al.
Published: (2025)
by: Yoon, Jaesik, et al.
Published: (2025)
When 2D Tasks Meet 1D Serialization: On Serialization Friction in Structured Tasks
by: Lo, Chung-Hsiang, et al.
Published: (2026)
by: Lo, Chung-Hsiang, et al.
Published: (2026)
Learning What Matters: Steering Diffusion via Spectrally Anisotropic Forward Noise
by: Scimeca, Luca, et al.
Published: (2025)
by: Scimeca, Luca, et al.
Published: (2025)
On Generalization for Generative Flow Networks
by: Krichel, Anas, et al.
Published: (2024)
by: Krichel, Anas, et al.
Published: (2024)
Interventional Causal Representation Learning
by: Ahuja, Kartik, et al.
Published: (2022)
by: Ahuja, Kartik, et al.
Published: (2022)
A Complexity-Based Theory of Compositionality
by: Elmoznino, Eric, et al.
Published: (2024)
by: Elmoznino, Eric, et al.
Published: (2024)
Visual symbolic mechanisms: Emergent symbol processing in vision language models
by: Assouel, Rim, et al.
Published: (2025)
by: Assouel, Rim, et al.
Published: (2025)
Relative Trajectory Balance is equivalent to Trust-PCL
by: Deleu, Tristan, et al.
Published: (2025)
by: Deleu, Tristan, et al.
Published: (2025)
In-Context Parametric Inference: Point or Distribution Estimators?
by: Mittal, Sarthak, et al.
Published: (2025)
by: Mittal, Sarthak, et al.
Published: (2025)
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
by: Liu, Ruohong, et al.
Published: (2024)
by: Liu, Ruohong, et al.
Published: (2024)
Optimal Linear MAP Decoding of Convolutional Codes
by: Li, Yonghui, et al.
Published: (2025)
by: Li, Yonghui, et al.
Published: (2025)
GFlowNet Foundations
by: Bengio, Yoshua, et al.
Published: (2021)
by: Bengio, Yoshua, et al.
Published: (2021)
Expected flow networks in stochastic environments and two-player zero-sum games
by: Jiralerspong, Marco, et al.
Published: (2023)
by: Jiralerspong, Marco, et al.
Published: (2023)
Memory Efficient Neural Processes via Constant Memory Attention Block
by: Feng, Leo, et al.
Published: (2023)
by: Feng, Leo, et al.
Published: (2023)
Adaptive parameter-efficient fine-tuning via Hessian-informed subset selection
by: Xu, Shiyun, et al.
Published: (2025)
by: Xu, Shiyun, et al.
Published: (2025)
Pareto Front Approximation for Multi-Objective Session-Based Recommender Systems
by: Wilm, Timo, et al.
Published: (2024)
by: Wilm, Timo, et al.
Published: (2024)
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
by: Liu, Zhen, et al.
Published: (2024)
by: Liu, Zhen, et al.
Published: (2024)
Can Safety Fine-Tuning Be More Principled? Lessons Learned from Cybersecurity
by: Williams-King, David, et al.
Published: (2025)
by: Williams-King, David, et al.
Published: (2025)
RL, but don't do anything I wouldn't do
by: Cohen, Michael K., et al.
Published: (2024)
by: Cohen, Michael K., et al.
Published: (2024)
Local Search GFlowNets
by: Kim, Minsu, et al.
Published: (2023)
by: Kim, Minsu, et al.
Published: (2023)
Active Attacks: Red-teaming LLMs via Adaptive Environments
by: Yun, Taeyoung, et al.
Published: (2025)
by: Yun, Taeyoung, et al.
Published: (2025)
Cascaded Transformer for Robust and Scalable SLA Decomposition via Amortized Optimization
by: Hsu, Cyril Shih-Huan
Published: (2026)
by: Hsu, Cyril Shih-Huan
Published: (2026)
Monte Carlo Tree Diffusion for System 2 Planning
by: Yoon, Jaesik, et al.
Published: (2025)
by: Yoon, Jaesik, et al.
Published: (2025)
Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control
by: Jiralerspong, Thomas, et al.
Published: (2025)
by: Jiralerspong, Thomas, et al.
Published: (2025)
Discrete Probabilistic Inference as Control in Multi-path Environments
by: Deleu, Tristan, et al.
Published: (2024)
by: Deleu, Tristan, et al.
Published: (2024)
In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior
by: Berkes, Anaïs, et al.
Published: (2026)
by: Berkes, Anaïs, et al.
Published: (2026)
Rejecting Hallucinated State Targets during Planning
by: Zhao, Mingde, et al.
Published: (2024)
by: Zhao, Mingde, et al.
Published: (2024)
Efficient Causal Graph Discovery Using Large Language Models
by: Jiralerspong, Thomas, et al.
Published: (2024)
by: Jiralerspong, Thomas, et al.
Published: (2024)
AP-BMM: Approximating Capability-Cost Pareto Sets of LLMs via Asynchronous Prior-Guided Bayesian Model Merging
by: Chen, Kesheng, et al.
Published: (2025)
by: Chen, Kesheng, et al.
Published: (2025)
Similar Items
-
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text
by: Zhang, Tianyu, et al.
Published: (2024) -
Scope: Selective Cross-modal Orchestration of Visual Perception Experts
by: Zhang, Tianyu, et al.
Published: (2025) -
Learning Decision Trees as Amortized Structure Inference
by: Mahfoud, Mohammed, et al.
Published: (2025) -
Amortizing intractable inference in large language models
by: Hu, Edward J., et al.
Published: (2023) -
Machine learning and information theory concepts towards an AI Mathematician
by: Bengio, Yoshua, et al.
Published: (2024)