Saved in:
| Main Authors: | Wang, Peng, Lu, Songshuo, Tang, Yaohua, Yan, Sijie, Xia, Wei, Xiong, Yuanjun |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.19487 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
URPO: A Unified Reward & Policy Optimization Framework for Large Language Models
by: Lu, Songshuo, et al.
Published: (2025)
by: Lu, Songshuo, et al.
Published: (2025)
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
by: Lu, Songshuo, et al.
Published: (2024)
by: Lu, Songshuo, et al.
Published: (2024)
FLEXI: Benchmarking Full-duplex Human-LLM Speech Interaction
by: Ge, Yuan, et al.
Published: (2025)
by: Ge, Yuan, et al.
Published: (2025)
Towards a Japanese Full-duplex Spoken Dialogue System
by: Ohashi, Atsumoto, et al.
Published: (2025)
by: Ohashi, Atsumoto, et al.
Published: (2025)
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
by: Lin, Guan-Ting, et al.
Published: (2025)
by: Lin, Guan-Ting, et al.
Published: (2025)
SALMONN-omni: A Standalone Speech LLM without Codec Injection for Full-duplex Conversation
by: Yu, Wenyi, et al.
Published: (2025)
by: Yu, Wenyi, et al.
Published: (2025)
SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation
by: Yu, Wenyi, et al.
Published: (2024)
by: Yu, Wenyi, et al.
Published: (2024)
A Prompt-driven Task Planning Method for Multi-drones based on Large Language Model
by: Liu, Yaohua
Published: (2024)
by: Liu, Yaohua
Published: (2024)
Full-text Error Correction for Chinese Speech Recognition with Large Language Model
by: Tang, Zhiyuan, et al.
Published: (2024)
by: Tang, Zhiyuan, et al.
Published: (2024)
Chain of Correction for Full-text Speech Recognition with Large Language Models
by: Tang, Zhiyuan, et al.
Published: (2025)
by: Tang, Zhiyuan, et al.
Published: (2025)
DialogueAgents: A Hybrid Agent-Based Speech Synthesis Framework for Multi-Party Dialogue
by: Li, Xiang, et al.
Published: (2025)
by: Li, Xiang, et al.
Published: (2025)
Synchronization and Turn-Taking in Full-Duplex Speech Dialogue Models
by: Riera, Pablo, et al.
Published: (2026)
by: Riera, Pablo, et al.
Published: (2026)
SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models
by: Cai, Zicheng, et al.
Published: (2025)
by: Cai, Zicheng, et al.
Published: (2025)
Chronological Thinking in Full-Duplex Spoken Dialogue Language Models
by: Wu, Donghang, et al.
Published: (2025)
by: Wu, Donghang, et al.
Published: (2025)
Planning with Diffusion Models for Target-Oriented Dialogue Systems
by: Du, Hanwen, et al.
Published: (2025)
by: Du, Hanwen, et al.
Published: (2025)
SpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM Development
by: Wang, Minghan, et al.
Published: (2025)
by: Wang, Minghan, et al.
Published: (2025)
Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models
by: Wang, Xiaolong, et al.
Published: (2024)
by: Wang, Xiaolong, et al.
Published: (2024)
Closing the Modality Reasoning Gap for Speech Large Language Models
by: Wang, Chaoren, et al.
Published: (2026)
by: Wang, Chaoren, et al.
Published: (2026)
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems
by: Liao, Borui, et al.
Published: (2025)
by: Liao, Borui, et al.
Published: (2025)
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
by: Wu, Junkai, et al.
Published: (2024)
by: Wu, Junkai, et al.
Published: (2024)
Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models
by: Wang, Yile, et al.
Published: (2024)
by: Wang, Yile, et al.
Published: (2024)
EMRModel: A Large Language Model for Extracting Medical Consultation Dialogues into Structured Medical Records
by: Zhao, Shuguang, et al.
Published: (2025)
by: Zhao, Shuguang, et al.
Published: (2025)
SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation
by: Zhao, Kun, et al.
Published: (2024)
by: Zhao, Kun, et al.
Published: (2024)
SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
by: Lu, Haitian, et al.
Published: (2025)
by: Lu, Haitian, et al.
Published: (2025)
Large Language Model based Situational Dialogues for Second Language Learning
by: Xu, Shuyao, et al.
Published: (2024)
by: Xu, Shuyao, et al.
Published: (2024)
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues
by: Luo, Xiang, et al.
Published: (2024)
by: Luo, Xiang, et al.
Published: (2024)
Cross-Modal Knowledge Distillation for Speech Large Language Models
by: Wang, Enzhi, et al.
Published: (2025)
by: Wang, Enzhi, et al.
Published: (2025)
Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models
by: She, Shuaijie, et al.
Published: (2023)
by: She, Shuaijie, et al.
Published: (2023)
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
by: Guo, Zhicheng, et al.
Published: (2024)
by: Guo, Zhicheng, et al.
Published: (2024)
Sibyl: Empowering Empathetic Dialogue Generation in Large Language Models via Sensible and Visionary Commonsense Inference
by: Wang, Lanrui, et al.
Published: (2023)
by: Wang, Lanrui, et al.
Published: (2023)
Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models
by: Kao, Chang-Sheng, et al.
Published: (2024)
by: Kao, Chang-Sheng, et al.
Published: (2024)
Aligning Large Language Models with Searcher Preferences
by: Wu, Wei, et al.
Published: (2026)
by: Wu, Wei, et al.
Published: (2026)
AnchorMem: Anchored Facts with Associative Contexts for Building Memory in Large Language Models
by: Shen, Zhanyu, et al.
Published: (2026)
by: Shen, Zhanyu, et al.
Published: (2026)
An Annotation Scheme and Classifier for Personal Facts in Dialogue
by: Zaitsev, Konstantin
Published: (2026)
by: Zaitsev, Konstantin
Published: (2026)
DRE: An Effective Dual-Refined Method for Integrating Small and Large Language Models in Open-Domain Dialogue Evaluation
by: Zhao, Kun, et al.
Published: (2025)
by: Zhao, Kun, et al.
Published: (2025)
SpeechR: A Benchmark for Speech Reasoning in Large Audio-Language Models
by: Yang, Wanqi, et al.
Published: (2025)
by: Yang, Wanqi, et al.
Published: (2025)
Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation
by: Wang, Siyin, et al.
Published: (2024)
by: Wang, Siyin, et al.
Published: (2024)
Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
by: Zhao, Shuaijiang, et al.
Published: (2024)
by: Zhao, Shuaijiang, et al.
Published: (2024)
RAD-Bench: Evaluating Large Language Models Capabilities in Retrieval Augmented Dialogues
by: Kuo, Tzu-Lin, et al.
Published: (2024)
by: Kuo, Tzu-Lin, et al.
Published: (2024)
Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue Generation
by: Zhang, Bo, et al.
Published: (2025)
by: Zhang, Bo, et al.
Published: (2025)
Similar Items
-
URPO: A Unified Reward & Policy Optimization Framework for Large Language Models
by: Lu, Songshuo, et al.
Published: (2025) -
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text
by: Lu, Songshuo, et al.
Published: (2024) -
FLEXI: Benchmarking Full-duplex Human-LLM Speech Interaction
by: Ge, Yuan, et al.
Published: (2025) -
Towards a Japanese Full-duplex Spoken Dialogue System
by: Ohashi, Atsumoto, et al.
Published: (2025) -
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
by: Lin, Guan-Ting, et al.
Published: (2025)