Saved in:
| Main Authors: | Fu, Yicheng, Wang, Zikui, Yang, Liuxin, Huo, Meiqing, Dai, Zhongdongming |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.14662 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Chengyu-Bench: Benchmarking Large Language Models for Chinese Idiom Understanding and Use
by: Fu, Yicheng, et al.
Published: (2025)
by: Fu, Yicheng, et al.
Published: (2025)
Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks
by: Chernov, Andrei
Published: (2025)
by: Chernov, Andrei
Published: (2025)
The Impossible Test: A 2024 Unsolvable Dataset and A Chance for an AGI Quiz
by: Noever, David, et al.
Published: (2024)
by: Noever, David, et al.
Published: (2024)
ACE-TA: An Agentic Teaching Assistant for Grounded Q&A, Quiz Generation, and Code Tutoring
by: Tripathi, Himanshu, et al.
Published: (2026)
by: Tripathi, Himanshu, et al.
Published: (2026)
ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models
by: Karev, Alexey, et al.
Published: (2025)
by: Karev, Alexey, et al.
Published: (2025)
ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models
by: Elangovan, Aparna, et al.
Published: (2024)
by: Elangovan, Aparna, et al.
Published: (2024)
SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG
by: Si, Xiaonan, et al.
Published: (2025)
by: Si, Xiaonan, et al.
Published: (2025)
Digital Gatekeepers: Exploring Large Language Model's Role in Immigration Decisions
by: Mao, Yicheng, et al.
Published: (2025)
by: Mao, Yicheng, et al.
Published: (2025)
Constructing Cloze Questions Generatively
by: Sun, Yicheng, et al.
Published: (2024)
by: Sun, Yicheng, et al.
Published: (2024)
Beyond Solving Math Quiz: Evaluating the Ability of Large Reasoning Models to Ask for Information
by: Huang, Youcheng, et al.
Published: (2025)
by: Huang, Youcheng, et al.
Published: (2025)
Lightweight Prompt Engineering for Cognitive Alignment in Educational AI: A OneClickQuiz Case Study
by: Yaacoub, Antoun, et al.
Published: (2025)
by: Yaacoub, Antoun, et al.
Published: (2025)
LIBERTy: A Causal Framework for Benchmarking Concept-Based Explanations of LLMs with Structural Counterfactuals
by: Toker, Gilat, et al.
Published: (2026)
by: Toker, Gilat, et al.
Published: (2026)
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models
by: Golchin, Shahriar, et al.
Published: (2023)
by: Golchin, Shahriar, et al.
Published: (2023)
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device
by: Fu, Yicheng, et al.
Published: (2024)
by: Fu, Yicheng, et al.
Published: (2024)
Sarc7: Evaluating Sarcasm Detection and Generation with Seven Types and Emotion-Informed Techniques
by: Xiong, Lang, et al.
Published: (2025)
by: Xiong, Lang, et al.
Published: (2025)
Do Retrieval Augmented Language Models Know When They Don't Know?
by: Zhou, Youchao, et al.
Published: (2025)
by: Zhou, Youchao, et al.
Published: (2025)
GRAM: A Generative Foundation Reward Model for Reward Generalization
by: Wang, Chenglong, et al.
Published: (2025)
by: Wang, Chenglong, et al.
Published: (2025)
Generating Concept Lexicalizations via Dictionary-Based Cross-Lingual Sense Projection
by: Basil, David, et al.
Published: (2026)
by: Basil, David, et al.
Published: (2026)
ConceptViz: A Visual Analytics Approach for Exploring Concepts in Large Language Models
by: Li, Haoxuan, et al.
Published: (2025)
by: Li, Haoxuan, et al.
Published: (2025)
Persona-Aware Alignment Framework for Personalized Dialogue Generation
by: Li, Guanrong, et al.
Published: (2025)
by: Li, Guanrong, et al.
Published: (2025)
Brilla AI: AI Contestant for the National Science and Maths Quiz
by: Boateng, George, et al.
Published: (2024)
by: Boateng, George, et al.
Published: (2024)
Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation
by: Bai, Jiaxin, et al.
Published: (2023)
by: Bai, Jiaxin, et al.
Published: (2023)
Concept-Based Interpretability for Toxicity Detection
by: Garg, Samarth, et al.
Published: (2025)
by: Garg, Samarth, et al.
Published: (2025)
A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy
by: Zhao, Yang, et al.
Published: (2025)
by: Zhao, Yang, et al.
Published: (2025)
LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
by: Xiao, Yicheng, et al.
Published: (2025)
by: Xiao, Yicheng, et al.
Published: (2025)
Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection
by: Li, Xuecong, et al.
Published: (2026)
by: Li, Xuecong, et al.
Published: (2026)
Large Language Model for Patent Concept Generation
by: Ren, Runtao, et al.
Published: (2024)
by: Ren, Runtao, et al.
Published: (2024)
ConceptPsy:A Benchmark Suite with Conceptual Comprehensiveness in Psychology
by: Zhang, Junlei, et al.
Published: (2023)
by: Zhang, Junlei, et al.
Published: (2023)
MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction
by: Sun, Qiao, et al.
Published: (2024)
by: Sun, Qiao, et al.
Published: (2024)
Pause-Tuning for Long-Context Comprehension: A Lightweight Approach to LLM Attention Recalibration
by: Begin, James, et al.
Published: (2025)
by: Begin, James, et al.
Published: (2025)
Periodic RoPE for Infinite Context LLMs
by: Huo, Simin
Published: (2026)
by: Huo, Simin
Published: (2026)
FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations
by: Wen, Athena, et al.
Published: (2025)
by: Wen, Athena, et al.
Published: (2025)
ConDABench: Interactive Evaluation of Language Models for Data Analysis
by: Dutta, Avik, et al.
Published: (2025)
by: Dutta, Avik, et al.
Published: (2025)
ConsPrompt: Exploiting Contrastive Samples for Fewshot Prompt Learning
by: Weng, Jinta, et al.
Published: (2022)
by: Weng, Jinta, et al.
Published: (2022)
ReFINE: A Reward-Based Framework for Interpretable and Nuanced Evaluation of Radiology Report Generation
by: Liu, Yunyi, et al.
Published: (2024)
by: Liu, Yunyi, et al.
Published: (2024)
Bayesian Calibration of Win Rate Estimation with LLM Evaluators
by: Gao, Yicheng, et al.
Published: (2024)
by: Gao, Yicheng, et al.
Published: (2024)
Attr-Int: A Simple and Effective Entity Alignment Framework for Heterogeneous Knowledge Graphs
by: Yang, Linyan, et al.
Published: (2024)
by: Yang, Linyan, et al.
Published: (2024)
A Concept-Based Explainability Framework for Large Multimodal Models
by: Parekh, Jayneel, et al.
Published: (2024)
by: Parekh, Jayneel, et al.
Published: (2024)
Contrastive Cross-Course Knowledge Tracing via Concept Graph Guided Knowledge Transfer
by: Han, Wenkang, et al.
Published: (2025)
by: Han, Wenkang, et al.
Published: (2025)
C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
by: Huo, Feiye, et al.
Published: (2025)
by: Huo, Feiye, et al.
Published: (2025)
Similar Items
-
Chengyu-Bench: Benchmarking Large Language Models for Chinese Idiom Understanding and Use
by: Fu, Yicheng, et al.
Published: (2025) -
Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks
by: Chernov, Andrei
Published: (2025) -
The Impossible Test: A 2024 Unsolvable Dataset and A Chance for an AGI Quiz
by: Noever, David, et al.
Published: (2024) -
ACE-TA: An Agentic Teaching Assistant for Grounded Q&A, Quiz Generation, and Code Tutoring
by: Tripathi, Himanshu, et al.
Published: (2026) -
ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models
by: Karev, Alexey, et al.
Published: (2025)