Saved in:
| Main Author: | Masoka, Happymore |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.14249 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Shona spaCy: A Morphological Analyzer for an Under-Resourced Bantu Language
by: Masoka, Happymore
Published: (2025)
by: Masoka, Happymore
Published: (2025)
How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages
by: Wu, Siyang, et al.
Published: (2025)
by: Wu, Siyang, et al.
Published: (2025)
Far Out: Evaluating Language Models on Slang in Australian and Indian English
by: Dilsiz, Deniz Kaya, et al.
Published: (2026)
by: Dilsiz, Deniz Kaya, et al.
Published: (2026)
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI
by: Zhang, Jianguo, et al.
Published: (2023)
by: Zhang, Jianguo, et al.
Published: (2023)
Topic-Conversation Relevance (TCR) Dataset and Benchmarks
by: Fan, Yaran, et al.
Published: (2024)
by: Fan, Yaran, et al.
Published: (2024)
Advancing Conversational Diagnostic AI with Multimodal Reasoning
by: Saab, Khaled, et al.
Published: (2025)
by: Saab, Khaled, et al.
Published: (2025)
Can Large Language Models Generate Effective Datasets for Emotion Recognition in Conversations?
by: Kaplan, Burak Can, et al.
Published: (2025)
by: Kaplan, Burak Can, et al.
Published: (2025)
An AI-Based Behavioral Health Safety Filter and Dataset for Identifying Mental Health Crises in Text-Based Conversations
by: Nelson, Benjamin W., et al.
Published: (2025)
by: Nelson, Benjamin W., et al.
Published: (2025)
TruthStance: An Annotated Dataset of Conversations on Truth Social
by: Ameen, Fathima, et al.
Published: (2026)
by: Ameen, Fathima, et al.
Published: (2026)
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
by: Yang, Wanqi, et al.
Published: (2025)
by: Yang, Wanqi, et al.
Published: (2025)
Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset
by: Liu, Rui, et al.
Published: (2024)
by: Liu, Rui, et al.
Published: (2024)
Hybrid-SQuAD: Hybrid Scholarly Question Answering Dataset
by: Taffa, Tilahun Abedissa, et al.
Published: (2024)
by: Taffa, Tilahun Abedissa, et al.
Published: (2024)
"What's Up, Doc?": Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets
by: Paruchuri, Akshay, et al.
Published: (2025)
by: Paruchuri, Akshay, et al.
Published: (2025)
T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
by: Chakraborty, Amartya, et al.
Published: (2025)
by: Chakraborty, Amartya, et al.
Published: (2025)
ShareChat: A Dataset of Chatbot Conversations in the Wild
by: Yan, Yueru, et al.
Published: (2025)
by: Yan, Yueru, et al.
Published: (2025)
When2Speak: A Dataset for Temporal Participation and Turn-Taking in Multi-Party Conversations for Large Language Models
by: Nama, Vihaan, et al.
Published: (2026)
by: Nama, Vihaan, et al.
Published: (2026)
Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison
by: Pandit, Tejul, et al.
Published: (2025)
by: Pandit, Tejul, et al.
Published: (2025)
Allo-AVA: A Large-Scale Multimodal Conversational AI Dataset for Allocentric Avatar Gesture Animation
by: Punjwani, Saif, et al.
Published: (2024)
by: Punjwani, Saif, et al.
Published: (2024)
Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting
by: Rostam, Zhyar Rzgar K, et al.
Published: (2025)
by: Rostam, Zhyar Rzgar K, et al.
Published: (2025)
ForensicsData: A Digital Forensics Dataset for Large Language Models
by: Chakir, Youssef, et al.
Published: (2025)
by: Chakir, Youssef, et al.
Published: (2025)
Commercial Persuasion in AI-Mediated Conversations
by: Salvi, Francesco, et al.
Published: (2026)
by: Salvi, Francesco, et al.
Published: (2026)
DuCCAE: A Hybrid Engine for Immersive Conversation via Collaboration, Augmentation, and Evolution
by: Shen, Xin, et al.
Published: (2026)
by: Shen, Xin, et al.
Published: (2026)
MedAlpaca -- An Open-Source Collection of Medical Conversational AI Models and Training Data
by: Han, Tianyu, et al.
Published: (2023)
by: Han, Tianyu, et al.
Published: (2023)
SynBullying: A Multi LLM Synthetic Conversational Dataset for Cyberbullying Detection
by: Kazemi, Arefeh, et al.
Published: (2025)
by: Kazemi, Arefeh, et al.
Published: (2025)
A Survey on Recent Advances in Conversational Data Generation
by: Soudani, Heydar, et al.
Published: (2024)
by: Soudani, Heydar, et al.
Published: (2024)
Conversational Process Modeling: Can Generative AI Empower Domain Experts in Creating and Redesigning Process Models?
by: Klievtsova, Nataliia, et al.
Published: (2023)
by: Klievtsova, Nataliia, et al.
Published: (2023)
Ace-CEFR -- A Dataset for Automated Evaluation of the Linguistic Difficulty of Conversational Texts for LLM Applications
by: Kogan, David, et al.
Published: (2025)
by: Kogan, David, et al.
Published: (2025)
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
by: Zheng, Lianmin, et al.
Published: (2023)
by: Zheng, Lianmin, et al.
Published: (2023)
Towards Anthropomorphic Conversational AI Part I: A Practical Framework
by: Wei, Fei, et al.
Published: (2025)
by: Wei, Fei, et al.
Published: (2025)
Memoria: A Scalable Agentic Memory Framework for Personalized Conversational AI
by: Sarin, Samarth, et al.
Published: (2025)
by: Sarin, Samarth, et al.
Published: (2025)
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
by: Miehling, Erik, et al.
Published: (2024)
by: Miehling, Erik, et al.
Published: (2024)
Intellecta Cognitiva: A Comprehensive Dataset for Advancing Academic Knowledge and Machine Reasoning
by: PS, Ajmal, et al.
Published: (2024)
by: PS, Ajmal, et al.
Published: (2024)
Towards Conversational Diagnostic AI
by: Tu, Tao, et al.
Published: (2024)
by: Tu, Tao, et al.
Published: (2024)
Goal Alignment in LLM-Based User Simulators for Conversational AI
by: Mehri, Shuhaib, et al.
Published: (2025)
by: Mehri, Shuhaib, et al.
Published: (2025)
Conversational Tree Search: A New Hybrid Dialog Task
by: Väth, Dirk, et al.
Published: (2023)
by: Väth, Dirk, et al.
Published: (2023)
Semantic XPath: Structured Agentic Memory Access for Conversational AI
by: Liu, Yifan Simon, et al.
Published: (2026)
by: Liu, Yifan Simon, et al.
Published: (2026)
Towards Privacy-aware Mental Health AI Models: Advances, Challenges, and Opportunities
by: Mandal, Aishik, et al.
Published: (2025)
by: Mandal, Aishik, et al.
Published: (2025)
Introducing MeMo: A Multimodal Dataset for Memory Modelling in Multiparty Conversations
by: Tsfasman, Maria, et al.
Published: (2024)
by: Tsfasman, Maria, et al.
Published: (2024)
Simulating User Agents for Embodied Conversational-AI
by: Philipov, Daniel, et al.
Published: (2024)
by: Philipov, Daniel, et al.
Published: (2024)
iTBLS: A Dataset of Interactive Conversations Over Tabular Information
by: Sundar, Anirudh, et al.
Published: (2024)
by: Sundar, Anirudh, et al.
Published: (2024)
Similar Items
-
Shona spaCy: A Morphological Analyzer for an Under-Resourced Bantu Language
by: Masoka, Happymore
Published: (2025) -
How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages
by: Wu, Siyang, et al.
Published: (2025) -
Far Out: Evaluating Language Models on Slang in Australian and Indian English
by: Dilsiz, Deniz Kaya, et al.
Published: (2026) -
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI
by: Zhang, Jianguo, et al.
Published: (2023) -
Topic-Conversation Relevance (TCR) Dataset and Benchmarks
by: Fan, Yaran, et al.
Published: (2024)