Saved in:
| Main Authors: | Jeknić, Isidora, Schlangen, David, Koller, Alexander |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.08202 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Collaborative Problem-Solving in an Optimization Game
by: Jeknic, Isidora, et al.
Published: (2025)
by: Jeknic, Isidora, et al.
Published: (2025)
Can Visual Dialogue Models Do Scorekeeping? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge
by: Madureira, Brielen, et al.
Published: (2022)
by: Madureira, Brielen, et al.
Published: (2022)
Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
by: Sadler, Philipp, et al.
Published: (2024)
by: Sadler, Philipp, et al.
Published: (2024)
Multi-Turn Multi-Agent Dialogue for Collaborative Reconstruction Improves VLM Performance on Spatial Reasoning, But Only Barely
by: Kranti, Chalamalasetti, et al.
Published: (2026)
by: Kranti, Chalamalasetti, et al.
Published: (2026)
A Third Paradigm for LLM Evaluation: Dialogue Game-Based Evaluation using clembench
by: Schlangen, David, et al.
Published: (2025)
by: Schlangen, David, et al.
Published: (2025)
Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
by: Sadler, Philipp, et al.
Published: (2024)
by: Sadler, Philipp, et al.
Published: (2024)
The Image Reconstruction Game: Drawing Common Ground Through Iterative Multimodal Dialogue
by: Hakimov, Sherzod, et al.
Published: (2026)
by: Hakimov, Sherzod, et al.
Published: (2026)
Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
by: Momentè, Filippo, et al.
Published: (2025)
by: Momentè, Filippo, et al.
Published: (2025)
clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations
by: Kranti, Chalamalasetti, et al.
Published: (2025)
by: Kranti, Chalamalasetti, et al.
Published: (2025)
Prior Lessons of Incremental Dialogue and Robot Action Management for the Age of Language Models
by: Kennington, Casey, et al.
Published: (2025)
by: Kennington, Casey, et al.
Published: (2025)
Ad-hoc Concept Forming in the Game Codenames as a Means for Evaluating Large Language Models
by: Hakimov, Sherzod, et al.
Published: (2025)
by: Hakimov, Sherzod, et al.
Published: (2025)
LLMs as Function Approximators: Terminology, Taxonomy, and Questions for Evaluation
by: Schlangen, David
Published: (2024)
by: Schlangen, David
Published: (2024)
Taking Action Towards Graceful Interaction: The Effects of Performing Actions on Modelling Policies for Instruction Clarification Requests
by: Madureira, Brielen, et al.
Published: (2024)
by: Madureira, Brielen, et al.
Published: (2024)
It Couldn't Help But Overhear: On the Limits of Modelling Meta-Communicative Grounding Acts with Supervised Learning
by: Madureira, Brielen, et al.
Published: (2024)
by: Madureira, Brielen, et al.
Published: (2024)
Could the Road to Grounded, Neuro-symbolic AI be Paved with Words-as-Classifiers?
by: Kennington, Casey, et al.
Published: (2025)
by: Kennington, Casey, et al.
Published: (2025)
Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU
by: Madureira, Brielen, et al.
Published: (2020)
by: Madureira, Brielen, et al.
Published: (2020)
How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics
by: Bhavsar, Nidhir, et al.
Published: (2024)
by: Bhavsar, Nidhir, et al.
Published: (2024)
Characterizing Language Use in a Collaborative Situated Game
by: Tomlin, Nicholas, et al.
Published: (2025)
by: Tomlin, Nicholas, et al.
Published: (2025)
Retrieval-Augmented Code Generation for Situated Action Generation: A Case Study on Minecraft
by: Kranti, Chalamalasetti, et al.
Published: (2024)
by: Kranti, Chalamalasetti, et al.
Published: (2024)
A Survey on Complex Tasks for Goal-Directed Interactive Agents
by: Hartmann, Mareike, et al.
Published: (2024)
by: Hartmann, Mareike, et al.
Published: (2024)
When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality
by: Madureira, Brielen, et al.
Published: (2024)
by: Madureira, Brielen, et al.
Published: (2024)
The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization
by: Borec, Luka, et al.
Published: (2024)
by: Borec, Luka, et al.
Published: (2024)
Towards Incremental Transformers: An Empirical Analysis of Transformer Models for Incremental NLU
by: Kahardipraja, Patrick, et al.
Published: (2021)
by: Kahardipraja, Patrick, et al.
Published: (2021)
Plant in Cupboard, Orange on Rably, Inat Aphone. Benchmarking Incremental Learning of Situation and Language Model using a Text-Simulated Situated Environment
by: Jordan, Jonathan, et al.
Published: (2025)
by: Jordan, Jonathan, et al.
Published: (2025)
From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning
by: Kranti, Chalamalasetti, et al.
Published: (2025)
by: Kranti, Chalamalasetti, et al.
Published: (2025)
Simple and effective data augmentation for compositional generalization
by: Yao, Yuekun, et al.
Published: (2024)
by: Yao, Yuekun, et al.
Published: (2024)
Fine-grained Controllable Text Generation through In-context Learning with Feedback
by: Thillainathan, Sarubi, et al.
Published: (2024)
by: Thillainathan, Sarubi, et al.
Published: (2024)
Predicting generalization performance with correctness discriminators
by: Yao, Yuekun, et al.
Published: (2023)
by: Yao, Yuekun, et al.
Published: (2023)
SafeTy Reasoning Elicitation Alignment for Multi-Turn Dialogues
by: Kuo, Martin, et al.
Published: (2025)
by: Kuo, Martin, et al.
Published: (2025)
Towards No-Code Programming of Cobots: Experiments with Code Synthesis by Large Code Models for Conversational Programming
by: Kranti, Chalamalasetti, et al.
Published: (2024)
by: Kranti, Chalamalasetti, et al.
Published: (2024)
Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes
by: Li, Meng, et al.
Published: (2025)
by: Li, Meng, et al.
Published: (2025)
Using Game Play to Investigate Multimodal and Conversational Grounding in Large Multimodal Models
by: Hakimov, Sherzod, et al.
Published: (2024)
by: Hakimov, Sherzod, et al.
Published: (2024)
Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)
by: Ryu, Moonkyung, et al.
Published: (2025)
by: Ryu, Moonkyung, et al.
Published: (2025)
Deflanderization for Game Dialogue: Balancing Character Authenticity with Task Execution in LLM-based NPCs
by: Buakhaw, Pasin, et al.
Published: (2025)
by: Buakhaw, Pasin, et al.
Published: (2025)
Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations
by: Lindemann, Matthias, et al.
Published: (2024)
by: Lindemann, Matthias, et al.
Published: (2024)
SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation
by: Lindemann, Matthias, et al.
Published: (2023)
by: Lindemann, Matthias, et al.
Published: (2023)
LLMs syntactically adapt their language use to their conversational partner
by: Kandra, Florian, et al.
Published: (2025)
by: Kandra, Florian, et al.
Published: (2025)
Direct Neural Machine Translation with Task-level Mixture of Experts models
by: Tourni, Isidora Chara, et al.
Published: (2023)
by: Tourni, Isidora Chara, et al.
Published: (2023)
Towards Negotiative Dialogue for the Talkamatic Dialogue Manager
by: Larsson, Staffan, et al.
Published: (2024)
by: Larsson, Staffan, et al.
Published: (2024)
A Knapsack by Any Other Name: Presentation impacts LLM performance on NP-hard problems
by: Duchnowski, Alex, et al.
Published: (2025)
by: Duchnowski, Alex, et al.
Published: (2025)
Similar Items
-
Collaborative Problem-Solving in an Optimization Game
by: Jeknic, Isidora, et al.
Published: (2025) -
Can Visual Dialogue Models Do Scorekeeping? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge
by: Madureira, Brielen, et al.
Published: (2022) -
Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
by: Sadler, Philipp, et al.
Published: (2024) -
Multi-Turn Multi-Agent Dialogue for Collaborative Reconstruction Improves VLM Performance on Spatial Reasoning, But Only Barely
by: Kranti, Chalamalasetti, et al.
Published: (2026) -
A Third Paradigm for LLM Evaluation: Dialogue Game-Based Evaluation using clembench
by: Schlangen, David, et al.
Published: (2025)