Saved in:
| Main Authors: | Tennenholtz, Guy, Chow, Yinlam, Hsu, Chih-Wei, Jeong, Jihwan, Shani, Lior, Tulepbergenov, Azamat, Ramachandran, Deepak, Mladenov, Martin, Boutilier, Craig |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.04475 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Embedding-Aligned Language Models
by: Tennenholtz, Guy, et al.
Published: (2024)
by: Tennenholtz, Guy, et al.
Published: (2024)
Descriptive History Representations: Learning Representations by Answering Questions
by: Tennenholtz, Guy, et al.
Published: (2025)
by: Tennenholtz, Guy, et al.
Published: (2025)
Preference Adaptive and Sequential Text-to-Image Generation
by: Nabati, Ofir, et al.
Published: (2024)
by: Nabati, Ofir, et al.
Published: (2024)
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
by: Liang, Anthony, et al.
Published: (2024)
by: Liang, Anthony, et al.
Published: (2024)
Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
by: Ryu, Moonkyung, et al.
Published: (2025)
by: Ryu, Moonkyung, et al.
Published: (2025)
Controllable User Simulation
by: Tennenholtz, Guy, et al.
Published: (2026)
by: Tennenholtz, Guy, et al.
Published: (2026)
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
by: Chow, Yinlam, et al.
Published: (2024)
by: Chow, Yinlam, et al.
Published: (2024)
Asking Clarifying Questions for Preference Elicitation With Large Language Models
by: Montazeralghaem, Ali, et al.
Published: (2025)
by: Montazeralghaem, Ali, et al.
Published: (2025)
ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders
by: Meshi, Ofer, et al.
Published: (2026)
by: Meshi, Ofer, et al.
Published: (2026)
Diffusion Controller: Framework, Algorithms and Parameterization
by: Yang, Tong, et al.
Published: (2026)
by: Yang, Tong, et al.
Published: (2026)
Spectral Souping: A Unified Framework for Online Preference Alignment
by: Chow, Yinlam, et al.
Published: (2026)
by: Chow, Yinlam, et al.
Published: (2026)
Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
by: Hsu, Chih-Wei, et al.
Published: (2024)
by: Hsu, Chih-Wei, et al.
Published: (2024)
pyRDDLGym: From RDDL to Gym Environments
by: Taitler, Ayal, et al.
Published: (2022)
by: Taitler, Ayal, et al.
Published: (2022)
Heuristics for Partially Observable Stochastic Contingent Planning
by: Shani, Guy
Published: (2024)
by: Shani, Guy
Published: (2024)
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
by: Linial, Ori, et al.
Published: (2024)
by: Linial, Ori, et al.
Published: (2024)
Bayesian Regret Minimization in Offline Bandits
by: Petrik, Marek, et al.
Published: (2023)
by: Petrik, Marek, et al.
Published: (2023)
Representation-Driven Reinforcement Learning
by: Nabati, Ofir, et al.
Published: (2023)
by: Nabati, Ofir, et al.
Published: (2023)
Predicting User Satisfaction in Online Education Platforms: A Large Language Model Based Multi-Modal Review Mining Framework
by: Bekov, Arman, et al.
Published: (2026)
by: Bekov, Arman, et al.
Published: (2026)
White Hat Search Engine Optimization using Large Language Models
by: Bardas, Niv, et al.
Published: (2025)
by: Bardas, Niv, et al.
Published: (2025)
Advanced Memristor Modeling
by: Mladenov, Valeri
Published: (2022)
by: Mladenov, Valeri
Published: (2022)
Differential graded categories in holomorphic symplectic geometry
by: Mladenov, Borislav
Published: (2026)
by: Mladenov, Borislav
Published: (2026)
Formality of differential graded algebras and complex Lagrangian submanifolds
by: Mladenov, Borislav
Published: (2020)
by: Mladenov, Borislav
Published: (2020)
Spectral Bellman Method: Unifying Representation and Exploration in RL
by: Nabati, Ofir, et al.
Published: (2025)
by: Nabati, Ofir, et al.
Published: (2025)
O'QUVCHILARINI MA'NAVIY AXLOQIY RUHDA TARBIYALASH
by: Ilhomov, Azamat
Published: (2025)
by: Ilhomov, Azamat
Published: (2025)
REKLAMA XIZMATLARIDA MAVJUD MUAMMOLAR VA ULARNING YECHIMLARI
by: Xamrakulov, Azamat
Published: (2025)
by: Xamrakulov, Azamat
Published: (2025)
THEORETICAL FOUNDATIONS OF THE SEMANTICS OF IDIOMS: COMPREHENSIVE ANALYSIS OF THE SEMANTIC SYSTEM OF PHRASEOLOGICAL UNITS
by: Jurakhonov, Azamat
Published: (2026)
by: Jurakhonov, Azamat
Published: (2026)
FOREIGN EXPERIENCE IN STAFFING THE ACTIVITIES OF INVESTIGATIVE UNITS
by: Abdreimov, Azamat
Published: (2026)
by: Abdreimov, Azamat
Published: (2026)
WHAT DO FLEXIBLE WORKERS WANT FROM MANAGERS?
by: Azamat Sulaymonov
Published: (2025)
by: Azamat Sulaymonov
Published: (2025)
O'ZBEKISTON IQTISODIYOTIGA TO'G'RIDAN-TO'G'RI XORIJIY INVESTITSIYALAR OQIMINI OSHIRISH OMILLARI
by: Madaminov, Azamat
Published: (2025)
by: Madaminov, Azamat
Published: (2025)
METHODOLOGICAL BASIS FOR DEVELOPING DEONTOLOGICAL READINESS IN FUTURE SPECIALISTS
by: Kabulov, Azamat
Published: (2026)
by: Kabulov, Azamat
Published: (2026)
Desarrollo fiscal progresivo en el sistema tributario: un análisis comparativo de la UE y los países de Asia Central
by: Azamat Baimagambetov
Published: (2025)
by: Azamat Baimagambetov
Published: (2025)
SoK: Demystifying the multiverse of MPC protocols
by: De Viti, Roberta, et al.
Published: (2025)
by: De Viti, Roberta, et al.
Published: (2025)
Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
by: Wan, Yanming, et al.
Published: (2025)
by: Wan, Yanming, et al.
Published: (2025)
Demystifying Singular Defects in Large Language Models
by: Wang, Haoqi, et al.
Published: (2025)
by: Wang, Haoqi, et al.
Published: (2025)
Demystifying Verbatim Memorization in Large Language Models
by: Huang, Jing, et al.
Published: (2024)
by: Huang, Jing, et al.
Published: (2024)
Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval
by: Wu, Haolun, et al.
Published: (2023)
by: Wu, Haolun, et al.
Published: (2023)
Privacy Preserving Multi Agent Path Finding
by: Lehman, Rotem Lev, et al.
Published: (2026)
by: Lehman, Rotem Lev, et al.
Published: (2026)
Processes of control and verification of settlements with employees and scholarship recipients in budget organizations
by: Shonazarovich, Kuliboyev Azamat
Published: (2025)
by: Shonazarovich, Kuliboyev Azamat
Published: (2025)
LITSENZIYALASH TIZIMI ISLOHOTIDA KORRUPSIYAGA QARSHI KURASHISH OMILINI KUCHAYTIRISH
by: Bahodirov, Azamat Alimovich
Published: (2025)
by: Bahodirov, Azamat Alimovich
Published: (2025)
Mechanisms of control and audit of settlements with employees and scholarship recipients in budgetary organizations
by: Shonazarovich, Kuliboyev Azamat
Published: (2025)
by: Shonazarovich, Kuliboyev Azamat
Published: (2025)
Similar Items
-
Embedding-Aligned Language Models
by: Tennenholtz, Guy, et al.
Published: (2024) -
Descriptive History Representations: Learning Representations by Answering Questions
by: Tennenholtz, Guy, et al.
Published: (2025) -
Preference Adaptive and Sequential Text-to-Image Generation
by: Nabati, Ofir, et al.
Published: (2024) -
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
by: Liang, Anthony, et al.
Published: (2024) -
Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
by: Ryu, Moonkyung, et al.
Published: (2025)