Saved in:
| Main Authors: | Nabati, Ofir, Tennenholtz, Guy, Hsu, ChihWei, Ryu, Moonkyung, Ramachandran, Deepak, Chow, Yinlam, Li, Xiang, Boutilier, Craig |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.10419 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
by: Ryu, Moonkyung, et al.
Published: (2025)
by: Ryu, Moonkyung, et al.
Published: (2025)
Embedding-Aligned Language Models
by: Tennenholtz, Guy, et al.
Published: (2024)
by: Tennenholtz, Guy, et al.
Published: (2024)
Demystifying Embedding Spaces using Large Language Models
by: Tennenholtz, Guy, et al.
Published: (2023)
by: Tennenholtz, Guy, et al.
Published: (2023)
Descriptive History Representations: Learning Representations by Answering Questions
by: Tennenholtz, Guy, et al.
Published: (2025)
by: Tennenholtz, Guy, et al.
Published: (2025)
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
by: Liang, Anthony, et al.
Published: (2024)
by: Liang, Anthony, et al.
Published: (2024)
Diffusion Controller: Framework, Algorithms and Parameterization
by: Yang, Tong, et al.
Published: (2026)
by: Yang, Tong, et al.
Published: (2026)
Representation-Driven Reinforcement Learning
by: Nabati, Ofir, et al.
Published: (2023)
by: Nabati, Ofir, et al.
Published: (2023)
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
by: Chow, Yinlam, et al.
Published: (2024)
by: Chow, Yinlam, et al.
Published: (2024)
Spectral Bellman Method: Unifying Representation and Exploration in RL
by: Nabati, Ofir, et al.
Published: (2025)
by: Nabati, Ofir, et al.
Published: (2025)
Asking Clarifying Questions for Preference Elicitation With Large Language Models
by: Montazeralghaem, Ali, et al.
Published: (2025)
by: Montazeralghaem, Ali, et al.
Published: (2025)
Controllable User Simulation
by: Tennenholtz, Guy, et al.
Published: (2026)
by: Tennenholtz, Guy, et al.
Published: (2026)
Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation
by: Xing, Xiaoying, et al.
Published: (2025)
by: Xing, Xiaoying, et al.
Published: (2025)
Spectral Souping: A Unified Framework for Online Preference Alignment
by: Chow, Yinlam, et al.
Published: (2026)
by: Chow, Yinlam, et al.
Published: (2026)
ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders
by: Meshi, Ofer, et al.
Published: (2026)
by: Meshi, Ofer, et al.
Published: (2026)
SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training
by: Refael, Yehonathan, et al.
Published: (2025)
by: Refael, Yehonathan, et al.
Published: (2025)
Inference of Utilities and Time Preference in Sequential Decision-Making
by: Cao, Haoyang, et al.
Published: (2024)
by: Cao, Haoyang, et al.
Published: (2024)
Intrinsic Sequentiality in P: Causal Limits of Parallel Computation
by: Wei, Jing-Yuan
Published: (2026)
by: Wei, Jing-Yuan
Published: (2026)
Policy Gradient Algorithms in Average-Reward Multichain MDPs
by: Lee, Jongmin, et al.
Published: (2026)
by: Lee, Jongmin, et al.
Published: (2026)
Bilevel subsidy-enabled mobility hub network design with perturbed utility coalitional choice-based assignment
by: Yang, Hai, et al.
Published: (2025)
by: Yang, Hai, et al.
Published: (2025)
Action Recommendations for Sequentially Rational Strategic Agents
by: Sun, Renyan, et al.
Published: (2026)
by: Sun, Renyan, et al.
Published: (2026)
Exact and Evolutionary Algorithms for Sequential Multi-Objective Transmission Topology Planning
by: Groeneveld, Job, et al.
Published: (2026)
by: Groeneveld, Job, et al.
Published: (2026)
Adaptive Mobile Manipulation for Articulated Objects In the Open World
by: Xiong, Haoyu, et al.
Published: (2024)
by: Xiong, Haoyu, et al.
Published: (2024)
QUIVER: Cost-Aware Adaptive Preference Querying in Surrogate-Assisted Evolutionary Multi-Objective Optimization
by: Burnat, Florian A. D.
Published: (2026)
by: Burnat, Florian A. D.
Published: (2026)
Immersive and Wearable Thermal Rendering for Augmented Reality
by: Watkins, Alexandra, et al.
Published: (2025)
by: Watkins, Alexandra, et al.
Published: (2025)
A GPU-Accelerated Distributed Algorithm for Optimal Power Flow in Distribution Systems
by: Ryu, Minseok, et al.
Published: (2025)
by: Ryu, Minseok, et al.
Published: (2025)
Gap-gradient methods for solving generalized mixed integer inverse optimization: an application to political gerrymandering
by: Smith, Ari J., et al.
Published: (2024)
by: Smith, Ari J., et al.
Published: (2024)
Direction of slip modulates the perception of slip distance and slip speed
by: Khan, Ayesha Tooba, et al.
Published: (2024)
by: Khan, Ayesha Tooba, et al.
Published: (2024)
Continuous-time q-learning for mean-field control problems
by: Wei, Xiaoli, et al.
Published: (2023)
by: Wei, Xiaoli, et al.
Published: (2023)
Incorporating the ChEES Criterion into Sequential Monte Carlo Samplers
by: Millard, Andrew, et al.
Published: (2025)
by: Millard, Andrew, et al.
Published: (2025)
Geometry Denoising with Preferred Normal Vectors
by: Weiß, Manuel, et al.
Published: (2025)
by: Weiß, Manuel, et al.
Published: (2025)
Robustness of Incentive Mechanisms Against System Misspecification in Congestion Games
by: Chiu, Chih-Yuan, et al.
Published: (2025)
by: Chiu, Chih-Yuan, et al.
Published: (2025)
LSSGen: Leveraging Latent Space Scaling in Flow and Diffusion for Efficient Text to Image Generation
by: Tang, Jyun-Ze, et al.
Published: (2025)
by: Tang, Jyun-Ze, et al.
Published: (2025)
Sequential Selection with Expirations
by: Xu, Yihua, et al.
Published: (2024)
by: Xu, Yihua, et al.
Published: (2024)
Fully Stochastic Trust-Region Sequential Quadratic Programming for Equality-Constrained Optimization Problems
by: Fang, Yuchen, et al.
Published: (2022)
by: Fang, Yuchen, et al.
Published: (2022)
Unified continuous-time q-learning for mean-field game and mean-field control problems
by: Wei, Xiaoli, et al.
Published: (2024)
by: Wei, Xiaoli, et al.
Published: (2024)
Addressing Unboundedness in Quadratically-Constrained Mixed-Integer Problems
by: Zepko, Guy, et al.
Published: (2024)
by: Zepko, Guy, et al.
Published: (2024)
Evaluating Text-to-Visual Generation with Image-to-Text Generation
by: Lin, Zhiqiu, et al.
Published: (2024)
by: Lin, Zhiqiu, et al.
Published: (2024)
PREFER: Personalized Review Summarization with Online Preference Learning
by: Roy, Millend, et al.
Published: (2026)
by: Roy, Millend, et al.
Published: (2026)
The Oracle Complexity of Simplex-based Matrix Games
by: Kornowski, Guy, et al.
Published: (2024)
by: Kornowski, Guy, et al.
Published: (2024)
Adaptive Optimal Control for Avatar-Guided Motor Rehabilitation in Virtual Reality
by: De Lellis, Francesco, et al.
Published: (2025)
by: De Lellis, Francesco, et al.
Published: (2025)
Similar Items
-
Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
by: Ryu, Moonkyung, et al.
Published: (2025) -
Embedding-Aligned Language Models
by: Tennenholtz, Guy, et al.
Published: (2024) -
Demystifying Embedding Spaces using Large Language Models
by: Tennenholtz, Guy, et al.
Published: (2023) -
Descriptive History Representations: Learning Representations by Answering Questions
by: Tennenholtz, Guy, et al.
Published: (2025) -
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
by: Liang, Anthony, et al.
Published: (2024)