:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Suresh, Varsha, Mughal, M. Hamza, Theobalt, Christian, Demberg, Vera
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2510.19350
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Enhancing Spoken Discourse Modeling in Language Models Using Gestural Cues
por: Suresh, Varsha, et al.
Publicado: (2025)

Semantic Motion Anchors: Bridging Motion and Meaning in Co-Speech Gestures
por: Suresh, Varsha, et al.
Publicado: (2026)

MIBURI: Towards Expressive Interactive Gesture Synthesis
por: Mughal, M. Hamza, et al.
Publicado: (2026)

Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis
por: Mughal, M. Hamza, et al.
Publicado: (2024)

Generation-Step-Aware Framework for Cross-Modal Representation and Control in Multilingual Speech-Text Models
por: Nakai, Toshiki, et al.
Publicado: (2026)

MUStReason: A Benchmark for Diagnosing Pragmatic Reasoning in Video-LMs for Multimodal Sarcasm Detection
por: Saha, Anisha, et al.
Publicado: (2025)

Synthetic Data Augmentation for Cross-domain Implicit Discourse Relation Recognition
por: Yung, Frances, et al.
Publicado: (2025)

GestureCoach: Rehearsing for Engaging Talks with LLM-Driven Gesture Recommendations
por: Ram, Ashwin, et al.
Publicado: (2025)

MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization
por: Saha, Anisha, et al.
Publicado: (2026)

System-Mediated Attention Imbalances Make Vision-Language Models Say Yes
por: Chan, Tsan Tsai, et al.
Publicado: (2026)

ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
por: Mughal, Muhammad Hamza, et al.
Publicado: (2024)

On Crowdsourcing Task Design for Discourse Relation Annotation
por: Yung, Frances, et al.
Publicado: (2024)

RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework
por: Wang, Yifan, et al.
Publicado: (2024)

The Design of Informative Take-Over Requests for Semi-Autonomous Cyber-Physical Systems: Combining Spoken Language and Visual Icons in a Drone-Controller Setting
por: Gundappa, Ashwini, et al.
Publicado: (2024)

ChatGPT vs Human-authored Text: Insights into Controllable Text Summarization and Sentence Style Transfer
por: Liu, Dongqi, et al.
Publicado: (2023)

RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization
por: Liu, Dongqi, et al.
Publicado: (2024)

LLMs syntactically adapt their language use to their conversational partner
por: Kandra, Florian, et al.
Publicado: (2025)

Implicit Discourse Relation Classification For Nigerian Pidgin
por: Saeed, Muhammed, et al.
Publicado: (2024)

Human Speech Perception in Noise: Can Large Language Models Paraphrase to Improve It?
por: Chingacham, Anupama, et al.
Publicado: (2024)

Temperature-scaling surprisal estimates improve fit to human reading times -- but does it do so for the "right reasons"?
por: Liu, Tong, et al.
Publicado: (2023)

Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization
por: Liu, Dongqi, et al.
Publicado: (2023)

Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin
por: Lin, Pin-Jie, et al.
Publicado: (2024)

Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English
por: Škrjanec, Iza, et al.
Publicado: (2026)

Prompting Implicit Discourse Relation Annotation
por: Yung, Frances, et al.
Publicado: (2024)

Planning Ahead with RSA: Efficient Signalling in Dynamic Environments by Projecting User Awareness across Future Timesteps
por: Das, Anwesha, et al.
Publicado: (2025)

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
por: Arora, Siddhant, et al.
Publicado: (2025)

Human Label Variation in Implicit Discourse Relation Recognition
por: Yung, Frances, et al.
Publicado: (2026)

Pragmatic Reasoning improves LLM Code Generation
por: Cao, Zhuchen, et al.
Publicado: (2025)

Explanatory Summarization with Discourse-Driven Planning
por: Liu, Dongqi, et al.
Publicado: (2025)

SciNews: From Scholarly Complexities to Public Narratives -- A Dataset for Scientific News Report Generation
por: Liu, Dongqi, et al.
Publicado: (2024)

B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability
por: Wang, Yifan, et al.
Publicado: (2025)

The Spatial Semantics of Iconic Gesture
por: Lücking, Andy, et al.
Publicado: (2024)

An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks
por: Suresh, Varsha, et al.
Publicado: (2024)

Tug-of-war between idioms' figurative and literal interpretations in LLMs
por: Oh, Soyoung, et al.
Publicado: (2025)

Prompt-Guided Turn-Taking Prediction
por: Inoue, Koji, et al.
Publicado: (2025)

Synchronization and Turn-Taking in Full-Duplex Speech Dialogue Models
por: Riera, Pablo, et al.
Publicado: (2026)

"Dyadosyncrasy", Idiosyncrasy and Demographic Factors in Turn-Taking
por: Cavalcanti, Julio Cesar, et al.
Publicado: (2025)

Syn-TurnTurk: A Synthetic Dataset for Turn-Taking Prediction in Turkish Dialogues
por: Bayrak, Ahmet Tuğrul, et al.
Publicado: (2026)

Advancing BDD Software Testing: Dynamic Scenario Re-Usability And Step Auto-Complete For Cucumber Framework
por: Mughal, A. H.
Publicado: (2024)

DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining
por: Rajaa, Shangeth
Publicado: (2026)