Saved in:
| Main Author: | Yang, Xiaodong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.14969 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evaluating LLMs on Chinese Idiom Translation
by: Yang, Cai, et al.
Published: (2025)
by: Yang, Cai, et al.
Published: (2025)
A Topic-aware Comparable Corpus of Chinese Variations
by: Lian, Da-Chen, et al.
Published: (2024)
by: Lian, Da-Chen, et al.
Published: (2024)
The performances of the Chinese and U.S. Large Language Models on the Topic of Chinese Culture
by: Liu, Feiyan, et al.
Published: (2026)
by: Liu, Feiyan, et al.
Published: (2026)
Can LLMs Act as Historians? Evaluating Historical Research Capabilities of LLMs via the Chinese Imperial Examination
by: Gao, Lirong, et al.
Published: (2026)
by: Gao, Lirong, et al.
Published: (2026)
TianHui: A Domain-Specific Large Language Model for Diverse Traditional Chinese Medicine Scenarios
by: Yin, Ji, et al.
Published: (2025)
by: Yin, Ji, et al.
Published: (2025)
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
by: Tang, Liyan, et al.
Published: (2024)
by: Tang, Liyan, et al.
Published: (2024)
Unveiling the Competitive Dynamics: A Comparative Evaluation of American and Chinese LLMs
by: Jiang, Zhenhui, et al.
Published: (2024)
by: Jiang, Zhenhui, et al.
Published: (2024)
Interdisciplinary Fairness in Imbalanced Research Proposal Topic Inference: A Hierarchical Transformer-based Method with Selective Interpolation
by: Xiao, Meng, et al.
Published: (2023)
by: Xiao, Meng, et al.
Published: (2023)
Proposal Report for the 2nd SciCAP Competition 2024
by: Li, Pengpeng, et al.
Published: (2024)
by: Li, Pengpeng, et al.
Published: (2024)
Addressing Topic Leakage in Cross-Topic Evaluation for Authorship Verification
by: Sawatphol, Jitkapat, et al.
Published: (2024)
by: Sawatphol, Jitkapat, et al.
Published: (2024)
Topic-Controllable Summarization: Topic-Aware Evaluation and Transformer Methods
by: Passali, Tatiana, et al.
Published: (2022)
by: Passali, Tatiana, et al.
Published: (2022)
WHBench: Evaluating Frontier LLMs with Expert-in-the-Loop Validation on Women's Health Topics
by: Maurya, Sneha, et al.
Published: (2026)
by: Maurya, Sneha, et al.
Published: (2026)
LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs
by: Davoodi, Arash Gholami, et al.
Published: (2024)
by: Davoodi, Arash Gholami, et al.
Published: (2024)
Evaluating Distributed Representations for Multi-Level Lexical Semantics: A Research Proposal
by: Liu, Zhu
Published: (2024)
by: Liu, Zhu
Published: (2024)
ClinConsensus: A Physician-Calibrated Benchmark for Evaluating Clinical Rubric Coverage in Chinese Medical LLMs
by: Zheng, Xiang, et al.
Published: (2026)
by: Zheng, Xiang, et al.
Published: (2026)
Proposing Topic Models and Evaluation Frameworks for Analyzing Associations with External Outcomes: An Application to Leadership Analysis Using Large-Scale Corporate Review Data
by: Yoshida, Yura, et al.
Published: (2026)
by: Yoshida, Yura, et al.
Published: (2026)
Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark
by: Jiang, Feng, et al.
Published: (2023)
by: Jiang, Feng, et al.
Published: (2023)
Exploring Safety Alignment Evaluation of LLMs in Chinese Mental Health Dialogues via LLM-as-Judge
by: Cai, Yunna, et al.
Published: (2025)
by: Cai, Yunna, et al.
Published: (2025)
CDTP: A Large-Scale Chinese Data-Text Pair Dataset for Comprehensive Evaluation of Chinese LLMs
by: Wu, Chengwei, et al.
Published: (2025)
by: Wu, Chengwei, et al.
Published: (2025)
Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media
by: Kristensen-McLachlan, Ross Deans, et al.
Published: (2024)
by: Kristensen-McLachlan, Ross Deans, et al.
Published: (2024)
ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
by: Zhang, Hengxiang, et al.
Published: (2024)
by: Zhang, Hengxiang, et al.
Published: (2024)
NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven Conversation
by: Wang, Xiaoyang, et al.
Published: (2021)
by: Wang, Xiaoyang, et al.
Published: (2021)
Holistic Evaluations of Topic Models
by: Compton, Thomas
Published: (2025)
by: Compton, Thomas
Published: (2025)
CSSBench: Evaluating the Safety of Lightweight LLMs against Chinese-Specific Adversarial Patterns
by: Zhou, Zhenhong, et al.
Published: (2026)
by: Zhou, Zhenhong, et al.
Published: (2026)
Can AI Write Classical Chinese Poetry like Humans? An Empirical Study Inspired by Turing Test
by: Deng, Zekun, et al.
Published: (2024)
by: Deng, Zekun, et al.
Published: (2024)
Assisting Research Proposal Writing with Large Language Models: Evaluation and Refinement
by: Ren, Jing, et al.
Published: (2025)
by: Ren, Jing, et al.
Published: (2025)
Hierarchical Graph Topic Modeling with Topic Tree-based Transformer
by: Zhang, Delvin Ce, et al.
Published: (2025)
by: Zhang, Delvin Ce, et al.
Published: (2025)
Topic Modeling with Fine-tuning LLMs and Bag of Sentences
by: Schneider, Johannes
Published: (2024)
by: Schneider, Johannes
Published: (2024)
Topic-Guided Reinforcement Learning with LLMs for Enhancing Multi-Document Summarization
by: Li, Chuyuan, et al.
Published: (2025)
by: Li, Chuyuan, et al.
Published: (2025)
Analyzing Cancer Patients' Experiences with Embedding-based Topic Modeling and LLMs
by: Ionescu, Teodor-Călin, et al.
Published: (2026)
by: Ionescu, Teodor-Călin, et al.
Published: (2026)
QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation
by: Hong, Mengze, et al.
Published: (2025)
by: Hong, Mengze, et al.
Published: (2025)
Automatic Construction of Chinese Verb Collostruction Database
by: Tang, Xuri, et al.
Published: (2025)
by: Tang, Xuri, et al.
Published: (2025)
Comprehensive Evaluation of Large Language Models for Topic Modeling
by: Doi, Tomoki, et al.
Published: (2024)
by: Doi, Tomoki, et al.
Published: (2024)
Are LLMs Effective Backbones for Fine-tuning? An Experimental Investigation of Supervised LLMs on Chinese Short Text Matching
by: Liu, Shulin, et al.
Published: (2024)
by: Liu, Shulin, et al.
Published: (2024)
LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models
by: Yang, Xiaohao, et al.
Published: (2024)
by: Yang, Xiaohao, et al.
Published: (2024)
TopicGPT: A Prompt-based Topic Modeling Framework
by: Pham, Chau Minh, et al.
Published: (2023)
by: Pham, Chau Minh, et al.
Published: (2023)
TaxPraBen: A Scalable Benchmark for Structured Evaluation of LLMs in Chinese Real-World Tax Practice
by: Hu, Gang, et al.
Published: (2026)
by: Hu, Gang, et al.
Published: (2026)
A Neural Topic Method Using a Large-Language-Model-in-the-Loop for Business Research
by: Ludwig, Stephan, et al.
Published: (2026)
by: Ludwig, Stephan, et al.
Published: (2026)
Automating Thematic Analysis: How LLMs Analyse Controversial Topics
by: Khan, Awais Hameed, et al.
Published: (2024)
by: Khan, Awais Hameed, et al.
Published: (2024)
MTCMB: A Multi-Task Benchmark Framework for Evaluating LLMs on Knowledge, Reasoning, and Safety in Traditional Chinese Medicine
by: Kong, Shufeng, et al.
Published: (2025)
by: Kong, Shufeng, et al.
Published: (2025)
Similar Items
-
Evaluating LLMs on Chinese Idiom Translation
by: Yang, Cai, et al.
Published: (2025) -
A Topic-aware Comparable Corpus of Chinese Variations
by: Lian, Da-Chen, et al.
Published: (2024) -
The performances of the Chinese and U.S. Large Language Models on the Topic of Chinese Culture
by: Liu, Feiyan, et al.
Published: (2026) -
Can LLMs Act as Historians? Evaluating Historical Research Capabilities of LLMs via the Chinese Imperial Examination
by: Gao, Lirong, et al.
Published: (2026) -
TianHui: A Domain-Specific Large Language Model for Diverse Traditional Chinese Medicine Scenarios
by: Yin, Ji, et al.
Published: (2025)