:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Yang, Xiaodong
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2504.14969
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Evaluating LLMs on Chinese Idiom Translation
by: Yang, Cai, et al.
Published: (2025)

A Topic-aware Comparable Corpus of Chinese Variations
by: Lian, Da-Chen, et al.
Published: (2024)

The performances of the Chinese and U.S. Large Language Models on the Topic of Chinese Culture
by: Liu, Feiyan, et al.
Published: (2026)

Can LLMs Act as Historians? Evaluating Historical Research Capabilities of LLMs via the Chinese Imperial Examination
by: Gao, Lirong, et al.
Published: (2026)

TianHui: A Domain-Specific Large Language Model for Diverse Traditional Chinese Medicine Scenarios
by: Yin, Ji, et al.
Published: (2025)

TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
by: Tang, Liyan, et al.
Published: (2024)

Unveiling the Competitive Dynamics: A Comparative Evaluation of American and Chinese LLMs
by: Jiang, Zhenhui, et al.
Published: (2024)

Interdisciplinary Fairness in Imbalanced Research Proposal Topic Inference: A Hierarchical Transformer-based Method with Selective Interpolation
by: Xiao, Meng, et al.
Published: (2023)

Proposal Report for the 2nd SciCAP Competition 2024
by: Li, Pengpeng, et al.
Published: (2024)

Addressing Topic Leakage in Cross-Topic Evaluation for Authorship Verification
by: Sawatphol, Jitkapat, et al.
Published: (2024)

Topic-Controllable Summarization: Topic-Aware Evaluation and Transformer Methods
by: Passali, Tatiana, et al.
Published: (2022)

WHBench: Evaluating Frontier LLMs with Expert-in-the-Loop Validation on Women's Health Topics
by: Maurya, Sneha, et al.
Published: (2026)

LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs
by: Davoodi, Arash Gholami, et al.
Published: (2024)

Evaluating Distributed Representations for Multi-Level Lexical Semantics: A Research Proposal
by: Liu, Zhu
Published: (2024)

ClinConsensus: A Physician-Calibrated Benchmark for Evaluating Clinical Rubric Coverage in Chinese Medical LLMs
by: Zheng, Xiang, et al.
Published: (2026)

Proposing Topic Models and Evaluation Frameworks for Analyzing Associations with External Outcomes: An Application to Leadership Analysis Using Large-Scale Corporate Review Data
by: Yoshida, Yura, et al.
Published: (2026)

Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark
by: Jiang, Feng, et al.
Published: (2023)

Exploring Safety Alignment Evaluation of LLMs in Chinese Mental Health Dialogues via LLM-as-Judge
by: Cai, Yunna, et al.
Published: (2025)

CDTP: A Large-Scale Chinese Data-Text Pair Dataset for Comprehensive Evaluation of Chinese LLMs
by: Wu, Chengwei, et al.
Published: (2025)

Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media
by: Kristensen-McLachlan, Ross Deans, et al.
Published: (2024)

ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
by: Zhang, Hengxiang, et al.
Published: (2024)

NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven Conversation
by: Wang, Xiaoyang, et al.
Published: (2021)

Holistic Evaluations of Topic Models
by: Compton, Thomas
Published: (2025)

CSSBench: Evaluating the Safety of Lightweight LLMs against Chinese-Specific Adversarial Patterns
by: Zhou, Zhenhong, et al.
Published: (2026)

Can AI Write Classical Chinese Poetry like Humans? An Empirical Study Inspired by Turing Test
by: Deng, Zekun, et al.
Published: (2024)

Assisting Research Proposal Writing with Large Language Models: Evaluation and Refinement
by: Ren, Jing, et al.
Published: (2025)

Hierarchical Graph Topic Modeling with Topic Tree-based Transformer
by: Zhang, Delvin Ce, et al.
Published: (2025)

Topic Modeling with Fine-tuning LLMs and Bag of Sentences
by: Schneider, Johannes
Published: (2024)

Topic-Guided Reinforcement Learning with LLMs for Enhancing Multi-Document Summarization
by: Li, Chuyuan, et al.
Published: (2025)

Analyzing Cancer Patients' Experiences with Embedding-based Topic Modeling and LLMs
by: Ionescu, Teodor-Călin, et al.
Published: (2026)

QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation
by: Hong, Mengze, et al.
Published: (2025)

Automatic Construction of Chinese Verb Collostruction Database
by: Tang, Xuri, et al.
Published: (2025)

Comprehensive Evaluation of Large Language Models for Topic Modeling
by: Doi, Tomoki, et al.
Published: (2024)

Are LLMs Effective Backbones for Fine-tuning? An Experimental Investigation of Supervised LLMs on Chinese Short Text Matching
by: Liu, Shulin, et al.
Published: (2024)

LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models
by: Yang, Xiaohao, et al.
Published: (2024)

TopicGPT: A Prompt-based Topic Modeling Framework
by: Pham, Chau Minh, et al.
Published: (2023)

TaxPraBen: A Scalable Benchmark for Structured Evaluation of LLMs in Chinese Real-World Tax Practice
by: Hu, Gang, et al.
Published: (2026)

A Neural Topic Method Using a Large-Language-Model-in-the-Loop for Business Research
by: Ludwig, Stephan, et al.
Published: (2026)

Automating Thematic Analysis: How LLMs Analyse Controversial Topics
by: Khan, Awais Hameed, et al.
Published: (2024)

MTCMB: A Multi-Task Benchmark Framework for Evaluating LLMs on Knowledge, Reasoning, and Safety in Traditional Chinese Medicine
by: Kong, Shufeng, et al.
Published: (2025)