:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Masoka, Happymore
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.14249
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Shona spaCy: A Morphological Analyzer for an Under-Resourced Bantu Language
by: Masoka, Happymore
Published: (2025)

How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages
by: Wu, Siyang, et al.
Published: (2025)

Far Out: Evaluating Language Models on Slang in Australian and Indian English
by: Dilsiz, Deniz Kaya, et al.
Published: (2026)

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI
by: Zhang, Jianguo, et al.
Published: (2023)

Topic-Conversation Relevance (TCR) Dataset and Benchmarks
by: Fan, Yaran, et al.
Published: (2024)

Advancing Conversational Diagnostic AI with Multimodal Reasoning
by: Saab, Khaled, et al.
Published: (2025)

Can Large Language Models Generate Effective Datasets for Emotion Recognition in Conversations?
by: Kaplan, Burak Can, et al.
Published: (2025)

An AI-Based Behavioral Health Safety Filter and Dataset for Identifying Mental Health Crises in Text-Based Conversations
by: Nelson, Benjamin W., et al.
Published: (2025)

TruthStance: An Annotated Dataset of Conversations on Truth Social
by: Ameen, Fathima, et al.
Published: (2026)

MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
by: Yang, Wanqi, et al.
Published: (2025)

Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset
by: Liu, Rui, et al.
Published: (2024)

Hybrid-SQuAD: Hybrid Scholarly Question Answering Dataset
by: Taffa, Tilahun Abedissa, et al.
Published: (2024)

"What's Up, Doc?": Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets
by: Paruchuri, Akshay, et al.
Published: (2025)

T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
by: Chakraborty, Amartya, et al.
Published: (2025)

ShareChat: A Dataset of Chatbot Conversations in the Wild
by: Yan, Yueru, et al.
Published: (2025)

When2Speak: A Dataset for Temporal Participation and Turn-Taking in Multi-Party Conversations for Large Language Models
by: Nama, Vihaan, et al.
Published: (2026)

Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison
by: Pandit, Tejul, et al.
Published: (2025)

Allo-AVA: A Large-Scale Multimodal Conversational AI Dataset for Allocentric Avatar Gesture Animation
by: Punjwani, Saif, et al.
Published: (2024)

Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting
by: Rostam, Zhyar Rzgar K, et al.
Published: (2025)

ForensicsData: A Digital Forensics Dataset for Large Language Models
by: Chakir, Youssef, et al.
Published: (2025)

Commercial Persuasion in AI-Mediated Conversations
by: Salvi, Francesco, et al.
Published: (2026)

DuCCAE: A Hybrid Engine for Immersive Conversation via Collaboration, Augmentation, and Evolution
by: Shen, Xin, et al.
Published: (2026)

MedAlpaca -- An Open-Source Collection of Medical Conversational AI Models and Training Data
by: Han, Tianyu, et al.
Published: (2023)

SynBullying: A Multi LLM Synthetic Conversational Dataset for Cyberbullying Detection
by: Kazemi, Arefeh, et al.
Published: (2025)

A Survey on Recent Advances in Conversational Data Generation
by: Soudani, Heydar, et al.
Published: (2024)

Conversational Process Modeling: Can Generative AI Empower Domain Experts in Creating and Redesigning Process Models?
by: Klievtsova, Nataliia, et al.
Published: (2023)

Ace-CEFR -- A Dataset for Automated Evaluation of the Linguistic Difficulty of Conversational Texts for LLM Applications
by: Kogan, David, et al.
Published: (2025)

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
by: Zheng, Lianmin, et al.
Published: (2023)

Towards Anthropomorphic Conversational AI Part I: A Practical Framework
by: Wei, Fei, et al.
Published: (2025)

Memoria: A Scalable Agentic Memory Framework for Personalized Conversational AI
by: Sarin, Samarth, et al.
Published: (2025)

Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
by: Miehling, Erik, et al.
Published: (2024)

Intellecta Cognitiva: A Comprehensive Dataset for Advancing Academic Knowledge and Machine Reasoning
by: PS, Ajmal, et al.
Published: (2024)

Towards Conversational Diagnostic AI
by: Tu, Tao, et al.
Published: (2024)

Goal Alignment in LLM-Based User Simulators for Conversational AI
by: Mehri, Shuhaib, et al.
Published: (2025)

Conversational Tree Search: A New Hybrid Dialog Task
by: Väth, Dirk, et al.
Published: (2023)

Semantic XPath: Structured Agentic Memory Access for Conversational AI
by: Liu, Yifan Simon, et al.
Published: (2026)

Towards Privacy-aware Mental Health AI Models: Advances, Challenges, and Opportunities
by: Mandal, Aishik, et al.
Published: (2025)

Introducing MeMo: A Multimodal Dataset for Memory Modelling in Multiparty Conversations
by: Tsfasman, Maria, et al.
Published: (2024)

Simulating User Agents for Embodied Conversational-AI
by: Philipov, Daniel, et al.
Published: (2024)

iTBLS: A Dataset of Interactive Conversations Over Tabular Information
by: Sundar, Anirudh, et al.
Published: (2024)