Guardado en:
| Autores principales: | Suresh, Varsha, Mughal, M. Hamza, Theobalt, Christian, Demberg, Vera |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2510.19350 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Enhancing Spoken Discourse Modeling in Language Models Using Gestural Cues
por: Suresh, Varsha, et al.
Publicado: (2025)
por: Suresh, Varsha, et al.
Publicado: (2025)
Semantic Motion Anchors: Bridging Motion and Meaning in Co-Speech Gestures
por: Suresh, Varsha, et al.
Publicado: (2026)
por: Suresh, Varsha, et al.
Publicado: (2026)
MIBURI: Towards Expressive Interactive Gesture Synthesis
por: Mughal, M. Hamza, et al.
Publicado: (2026)
por: Mughal, M. Hamza, et al.
Publicado: (2026)
Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis
por: Mughal, M. Hamza, et al.
Publicado: (2024)
por: Mughal, M. Hamza, et al.
Publicado: (2024)
Generation-Step-Aware Framework for Cross-Modal Representation and Control in Multilingual Speech-Text Models
por: Nakai, Toshiki, et al.
Publicado: (2026)
por: Nakai, Toshiki, et al.
Publicado: (2026)
MUStReason: A Benchmark for Diagnosing Pragmatic Reasoning in Video-LMs for Multimodal Sarcasm Detection
por: Saha, Anisha, et al.
Publicado: (2025)
por: Saha, Anisha, et al.
Publicado: (2025)
Synthetic Data Augmentation for Cross-domain Implicit Discourse Relation Recognition
por: Yung, Frances, et al.
Publicado: (2025)
por: Yung, Frances, et al.
Publicado: (2025)
GestureCoach: Rehearsing for Engaging Talks with LLM-Driven Gesture Recommendations
por: Ram, Ashwin, et al.
Publicado: (2025)
por: Ram, Ashwin, et al.
Publicado: (2025)
MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization
por: Saha, Anisha, et al.
Publicado: (2026)
por: Saha, Anisha, et al.
Publicado: (2026)
System-Mediated Attention Imbalances Make Vision-Language Models Say Yes
por: Chan, Tsan Tsai, et al.
Publicado: (2026)
por: Chan, Tsan Tsai, et al.
Publicado: (2026)
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
por: Mughal, Muhammad Hamza, et al.
Publicado: (2024)
por: Mughal, Muhammad Hamza, et al.
Publicado: (2024)
On Crowdsourcing Task Design for Discourse Relation Annotation
por: Yung, Frances, et al.
Publicado: (2024)
por: Yung, Frances, et al.
Publicado: (2024)
RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework
por: Wang, Yifan, et al.
Publicado: (2024)
por: Wang, Yifan, et al.
Publicado: (2024)
The Design of Informative Take-Over Requests for Semi-Autonomous Cyber-Physical Systems: Combining Spoken Language and Visual Icons in a Drone-Controller Setting
por: Gundappa, Ashwini, et al.
Publicado: (2024)
por: Gundappa, Ashwini, et al.
Publicado: (2024)
ChatGPT vs Human-authored Text: Insights into Controllable Text Summarization and Sentence Style Transfer
por: Liu, Dongqi, et al.
Publicado: (2023)
por: Liu, Dongqi, et al.
Publicado: (2023)
RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization
por: Liu, Dongqi, et al.
Publicado: (2024)
por: Liu, Dongqi, et al.
Publicado: (2024)
LLMs syntactically adapt their language use to their conversational partner
por: Kandra, Florian, et al.
Publicado: (2025)
por: Kandra, Florian, et al.
Publicado: (2025)
Implicit Discourse Relation Classification For Nigerian Pidgin
por: Saeed, Muhammed, et al.
Publicado: (2024)
por: Saeed, Muhammed, et al.
Publicado: (2024)
Human Speech Perception in Noise: Can Large Language Models Paraphrase to Improve It?
por: Chingacham, Anupama, et al.
Publicado: (2024)
por: Chingacham, Anupama, et al.
Publicado: (2024)
Temperature-scaling surprisal estimates improve fit to human reading times -- but does it do so for the "right reasons"?
por: Liu, Tong, et al.
Publicado: (2023)
por: Liu, Tong, et al.
Publicado: (2023)
Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization
por: Liu, Dongqi, et al.
Publicado: (2023)
por: Liu, Dongqi, et al.
Publicado: (2023)
Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin
por: Lin, Pin-Jie, et al.
Publicado: (2024)
por: Lin, Pin-Jie, et al.
Publicado: (2024)
Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English
por: Škrjanec, Iza, et al.
Publicado: (2026)
por: Škrjanec, Iza, et al.
Publicado: (2026)
Prompting Implicit Discourse Relation Annotation
por: Yung, Frances, et al.
Publicado: (2024)
por: Yung, Frances, et al.
Publicado: (2024)
Planning Ahead with RSA: Efficient Signalling in Dynamic Environments by Projecting User Awareness across Future Timesteps
por: Das, Anwesha, et al.
Publicado: (2025)
por: Das, Anwesha, et al.
Publicado: (2025)
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
por: Arora, Siddhant, et al.
Publicado: (2025)
por: Arora, Siddhant, et al.
Publicado: (2025)
Human Label Variation in Implicit Discourse Relation Recognition
por: Yung, Frances, et al.
Publicado: (2026)
por: Yung, Frances, et al.
Publicado: (2026)
Pragmatic Reasoning improves LLM Code Generation
por: Cao, Zhuchen, et al.
Publicado: (2025)
por: Cao, Zhuchen, et al.
Publicado: (2025)
Explanatory Summarization with Discourse-Driven Planning
por: Liu, Dongqi, et al.
Publicado: (2025)
por: Liu, Dongqi, et al.
Publicado: (2025)
SciNews: From Scholarly Complexities to Public Narratives -- A Dataset for Scientific News Report Generation
por: Liu, Dongqi, et al.
Publicado: (2024)
por: Liu, Dongqi, et al.
Publicado: (2024)
B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability
por: Wang, Yifan, et al.
Publicado: (2025)
por: Wang, Yifan, et al.
Publicado: (2025)
The Spatial Semantics of Iconic Gesture
por: Lücking, Andy, et al.
Publicado: (2024)
por: Lücking, Andy, et al.
Publicado: (2024)
An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks
por: Suresh, Varsha, et al.
Publicado: (2024)
por: Suresh, Varsha, et al.
Publicado: (2024)
Tug-of-war between idioms' figurative and literal interpretations in LLMs
por: Oh, Soyoung, et al.
Publicado: (2025)
por: Oh, Soyoung, et al.
Publicado: (2025)
Prompt-Guided Turn-Taking Prediction
por: Inoue, Koji, et al.
Publicado: (2025)
por: Inoue, Koji, et al.
Publicado: (2025)
Synchronization and Turn-Taking in Full-Duplex Speech Dialogue Models
por: Riera, Pablo, et al.
Publicado: (2026)
por: Riera, Pablo, et al.
Publicado: (2026)
"Dyadosyncrasy", Idiosyncrasy and Demographic Factors in Turn-Taking
por: Cavalcanti, Julio Cesar, et al.
Publicado: (2025)
por: Cavalcanti, Julio Cesar, et al.
Publicado: (2025)
Syn-TurnTurk: A Synthetic Dataset for Turn-Taking Prediction in Turkish Dialogues
por: Bayrak, Ahmet Tuğrul, et al.
Publicado: (2026)
por: Bayrak, Ahmet Tuğrul, et al.
Publicado: (2026)
Advancing BDD Software Testing: Dynamic Scenario Re-Usability And Step Auto-Complete For Cucumber Framework
por: Mughal, A. H.
Publicado: (2024)
por: Mughal, A. H.
Publicado: (2024)
DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining
por: Rajaa, Shangeth
Publicado: (2026)
por: Rajaa, Shangeth
Publicado: (2026)
Ejemplares similares
-
Enhancing Spoken Discourse Modeling in Language Models Using Gestural Cues
por: Suresh, Varsha, et al.
Publicado: (2025) -
Semantic Motion Anchors: Bridging Motion and Meaning in Co-Speech Gestures
por: Suresh, Varsha, et al.
Publicado: (2026) -
MIBURI: Towards Expressive Interactive Gesture Synthesis
por: Mughal, M. Hamza, et al.
Publicado: (2026) -
Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis
por: Mughal, M. Hamza, et al.
Publicado: (2024) -
Generation-Step-Aware Framework for Cross-Modal Representation and Control in Multilingual Speech-Text Models
por: Nakai, Toshiki, et al.
Publicado: (2026)