:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jeknić, Isidora, Schlangen, David, Koller, Alexander
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2406.08202
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Collaborative Problem-Solving in an Optimization Game
by: Jeknic, Isidora, et al.
Published: (2025)

Can Visual Dialogue Models Do Scorekeeping? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge
by: Madureira, Brielen, et al.
Published: (2022)

Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
by: Sadler, Philipp, et al.
Published: (2024)

Multi-Turn Multi-Agent Dialogue for Collaborative Reconstruction Improves VLM Performance on Spatial Reasoning, But Only Barely
by: Kranti, Chalamalasetti, et al.
Published: (2026)

A Third Paradigm for LLM Evaluation: Dialogue Game-Based Evaluation using clembench
by: Schlangen, David, et al.
Published: (2025)

Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
by: Sadler, Philipp, et al.
Published: (2024)

The Image Reconstruction Game: Drawing Common Ground Through Iterative Multimodal Dialogue
by: Hakimov, Sherzod, et al.
Published: (2026)

Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
by: Momentè, Filippo, et al.
Published: (2025)

clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations
by: Kranti, Chalamalasetti, et al.
Published: (2025)

Prior Lessons of Incremental Dialogue and Robot Action Management for the Age of Language Models
by: Kennington, Casey, et al.
Published: (2025)

Ad-hoc Concept Forming in the Game Codenames as a Means for Evaluating Large Language Models
by: Hakimov, Sherzod, et al.
Published: (2025)

LLMs as Function Approximators: Terminology, Taxonomy, and Questions for Evaluation
by: Schlangen, David
Published: (2024)

Taking Action Towards Graceful Interaction: The Effects of Performing Actions on Modelling Policies for Instruction Clarification Requests
by: Madureira, Brielen, et al.
Published: (2024)

It Couldn't Help But Overhear: On the Limits of Modelling Meta-Communicative Grounding Acts with Supervised Learning
by: Madureira, Brielen, et al.
Published: (2024)

Could the Road to Grounded, Neuro-symbolic AI be Paved with Words-as-Classifiers?
by: Kennington, Casey, et al.
Published: (2025)

Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU
by: Madureira, Brielen, et al.
Published: (2020)

How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics
by: Bhavsar, Nidhir, et al.
Published: (2024)

Characterizing Language Use in a Collaborative Situated Game
by: Tomlin, Nicholas, et al.
Published: (2025)

Retrieval-Augmented Code Generation for Situated Action Generation: A Case Study on Minecraft
by: Kranti, Chalamalasetti, et al.
Published: (2024)

A Survey on Complex Tasks for Goal-Directed Interactive Agents
by: Hartmann, Mareike, et al.
Published: (2024)

When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality
by: Madureira, Brielen, et al.
Published: (2024)

The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization
by: Borec, Luka, et al.
Published: (2024)

Towards Incremental Transformers: An Empirical Analysis of Transformer Models for Incremental NLU
by: Kahardipraja, Patrick, et al.
Published: (2021)

Plant in Cupboard, Orange on Rably, Inat Aphone. Benchmarking Incremental Learning of Situation and Language Model using a Text-Simulated Situated Environment
by: Jordan, Jonathan, et al.
Published: (2025)

From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning
by: Kranti, Chalamalasetti, et al.
Published: (2025)

Simple and effective data augmentation for compositional generalization
by: Yao, Yuekun, et al.
Published: (2024)

Fine-grained Controllable Text Generation through In-context Learning with Feedback
by: Thillainathan, Sarubi, et al.
Published: (2024)

Predicting generalization performance with correctness discriminators
by: Yao, Yuekun, et al.
Published: (2023)

SafeTy Reasoning Elicitation Alignment for Multi-Turn Dialogues
by: Kuo, Martin, et al.
Published: (2025)

Towards No-Code Programming of Cobots: Experiments with Code Synthesis by Large Code Models for Conversational Programming
by: Kranti, Chalamalasetti, et al.
Published: (2024)

Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes
by: Li, Meng, et al.
Published: (2025)

Using Game Play to Investigate Multimodal and Conversational Grounding in Large Multimodal Models
by: Hakimov, Sherzod, et al.
Published: (2024)

Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
by: Ryu, Moonkyung, et al.
Published: (2025)

Deflanderization for Game Dialogue: Balancing Character Authenticity with Task Execution in LLM-based NPCs
by: Buakhaw, Pasin, et al.
Published: (2025)

Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations
by: Lindemann, Matthias, et al.
Published: (2024)

SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation
by: Lindemann, Matthias, et al.
Published: (2023)

LLMs syntactically adapt their language use to their conversational partner
by: Kandra, Florian, et al.
Published: (2025)

Direct Neural Machine Translation with Task-level Mixture of Experts models
by: Tourni, Isidora Chara, et al.
Published: (2023)

Towards Negotiative Dialogue for the Talkamatic Dialogue Manager
by: Larsson, Staffan, et al.
Published: (2024)

A Knapsack by Any Other Name: Presentation impacts LLM performance on NP-hard problems
by: Duchnowski, Alex, et al.
Published: (2025)