:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tennenholtz, Guy, Chow, Yinlam, Hsu, Chih-Wei, Jeong, Jihwan, Shani, Lior, Tulepbergenov, Azamat, Ramachandran, Deepak, Mladenov, Martin, Boutilier, Craig
Format:	Preprint
Published:	2023
Subjects:	Computation and Language Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2310.04475
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Embedding-Aligned Language Models
by: Tennenholtz, Guy, et al.
Published: (2024)

Descriptive History Representations: Learning Representations by Answering Questions
by: Tennenholtz, Guy, et al.
Published: (2025)

Preference Adaptive and Sequential Text-to-Image Generation
by: Nabati, Ofir, et al.
Published: (2024)

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
by: Liang, Anthony, et al.
Published: (2024)

Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
by: Ryu, Moonkyung, et al.
Published: (2025)

Controllable User Simulation
by: Tennenholtz, Guy, et al.
Published: (2026)

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
by: Chow, Yinlam, et al.
Published: (2024)

Asking Clarifying Questions for Preference Elicitation With Large Language Models
by: Montazeralghaem, Ali, et al.
Published: (2025)

ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders
by: Meshi, Ofer, et al.
Published: (2026)

Diffusion Controller: Framework, Algorithms and Parameterization
by: Yang, Tong, et al.
Published: (2026)

Spectral Souping: A Unified Framework for Online Preference Alignment
by: Chow, Yinlam, et al.
Published: (2026)

Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
by: Hsu, Chih-Wei, et al.
Published: (2024)

pyRDDLGym: From RDDL to Gym Environments
by: Taitler, Ayal, et al.
Published: (2022)

Heuristics for Partially Observable Stochastic Contingent Planning
by: Shani, Guy
Published: (2024)

Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
by: Linial, Ori, et al.
Published: (2024)

Bayesian Regret Minimization in Offline Bandits
by: Petrik, Marek, et al.
Published: (2023)

Representation-Driven Reinforcement Learning
by: Nabati, Ofir, et al.
Published: (2023)

Predicting User Satisfaction in Online Education Platforms: A Large Language Model Based Multi-Modal Review Mining Framework
by: Bekov, Arman, et al.
Published: (2026)

White Hat Search Engine Optimization using Large Language Models
by: Bardas, Niv, et al.
Published: (2025)

Advanced Memristor Modeling
by: Mladenov, Valeri
Published: (2022)

Differential graded categories in holomorphic symplectic geometry
by: Mladenov, Borislav
Published: (2026)

Formality of differential graded algebras and complex Lagrangian submanifolds
by: Mladenov, Borislav
Published: (2020)

Spectral Bellman Method: Unifying Representation and Exploration in RL
by: Nabati, Ofir, et al.
Published: (2025)

O'QUVCHILARINI MA'NAVIY AXLOQIY RUHDA TARBIYALASH
by: Ilhomov, Azamat
Published: (2025)

REKLAMA XIZMATLARIDA MAVJUD MUAMMOLAR VA ULARNING YECHIMLARI
by: Xamrakulov, Azamat
Published: (2025)

THEORETICAL FOUNDATIONS OF THE SEMANTICS OF IDIOMS: COMPREHENSIVE ANALYSIS OF THE SEMANTIC SYSTEM OF PHRASEOLOGICAL UNITS
by: Jurakhonov, Azamat
Published: (2026)

FOREIGN EXPERIENCE IN STAFFING THE ACTIVITIES OF INVESTIGATIVE UNITS
by: Abdreimov, Azamat
Published: (2026)

WHAT DO FLEXIBLE WORKERS WANT FROM MANAGERS?
by: Azamat Sulaymonov
Published: (2025)

O'ZBEKISTON IQTISODIYOTIGA TO'G'RIDAN-TO'G'RI XORIJIY INVESTITSIYALAR OQIMINI OSHIRISH OMILLARI
by: Madaminov, Azamat
Published: (2025)

METHODOLOGICAL BASIS FOR DEVELOPING DEONTOLOGICAL READINESS IN FUTURE SPECIALISTS
by: Kabulov, Azamat
Published: (2026)

Desarrollo fiscal progresivo en el sistema tributario: un análisis comparativo de la UE y los países de Asia Central
by: Azamat Baimagambetov
Published: (2025)

SoK: Demystifying the multiverse of MPC protocols
by: De Viti, Roberta, et al.
Published: (2025)

Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
by: Wan, Yanming, et al.
Published: (2025)

Demystifying Singular Defects in Large Language Models
by: Wang, Haoqi, et al.
Published: (2025)

Demystifying Verbatim Memorization in Large Language Models
by: Huang, Jing, et al.
Published: (2024)

Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval
by: Wu, Haolun, et al.
Published: (2023)

Privacy Preserving Multi Agent Path Finding
by: Lehman, Rotem Lev, et al.
Published: (2026)

Processes of control and verification of settlements with employees and scholarship recipients in budget organizations
by: Shonazarovich, Kuliboyev Azamat
Published: (2025)

LITSENZIYALASH TIZIMI ISLOHOTIDA KORRUPSIYAGA QARSHI KURASHISH OMILINI KUCHAYTIRISH
by: Bahodirov, Azamat Alimovich
Published: (2025)

Mechanisms of control and audit of settlements with employees and scholarship recipients in budgetary organizations
by: Shonazarovich, Kuliboyev Azamat
Published: (2025)