Saved in:
| Main Authors: | Zhang, Jinghao, Jiang, Sihang, Guo, Shiwei, Chen, Shisong, Xiao, Yanghua, Feng, Hongwei, Liang, Jiaqing, HE, Minggui, Tao, Shimin, Ma, Hongxia |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.16188 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Do Large Language Models Truly Understand Cross-cultural Differences?
by: Guo, Shiwei, et al.
Published: (2025)
by: Guo, Shiwei, et al.
Published: (2025)
SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment
by: Jiang, Sihang, et al.
Published: (2026)
by: Jiang, Sihang, et al.
Published: (2026)
Enhancing Quantitative Reasoning Skills of Large Language Models through Dimension Perception
by: Huang, Yuncheng, et al.
Published: (2023)
by: Huang, Yuncheng, et al.
Published: (2023)
VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It
by: Zhu, Xiaoxuan, et al.
Published: (2024)
by: Zhu, Xiaoxuan, et al.
Published: (2024)
Selective Expert Guidance for Effective and Diverse Exploration in Reinforcement Learning of LLMs
by: Jiang, Zishang, et al.
Published: (2025)
by: Jiang, Zishang, et al.
Published: (2025)
MIDB: Multilingual Instruction Data Booster for Enhancing Cultural Equality in Multilingual Instruction Synthesis
by: Liu, Yilun, et al.
Published: (2025)
by: Liu, Yilun, et al.
Published: (2025)
Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience
by: Han, Haixia, et al.
Published: (2024)
by: Han, Haixia, et al.
Published: (2024)
EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data
by: Chen, Xuetian, et al.
Published: (2024)
by: Chen, Xuetian, et al.
Published: (2024)
C-Mining: Unsupervised Discovery of Seeds for Cultural Data Synthesis via Geometric Misalignment
by: Zeng, Pufan, et al.
Published: (2026)
by: Zeng, Pufan, et al.
Published: (2026)
Can Pre-trained Language Models Understand Chinese Humor?
by: Chen, Yuyan, et al.
Published: (2024)
by: Chen, Yuyan, et al.
Published: (2024)
Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding
by: Li, Yanda, et al.
Published: (2024)
by: Li, Yanda, et al.
Published: (2024)
HINT: Helping Ineffective Rollouts Navigate Towards Effectiveness
by: Wang, Xinyi, et al.
Published: (2025)
by: Wang, Xinyi, et al.
Published: (2025)
MultiLingPoT: Enhancing Mathematical Reasoning with Multilingual Program Fine-tuning
by: Li, Nianqi, et al.
Published: (2024)
by: Li, Nianqi, et al.
Published: (2024)
CultureForest: Understanding and Evaluating Cultural Norm Grounded Reasoning in LLMs
by: Ye, Yangfan, et al.
Published: (2026)
by: Ye, Yangfan, et al.
Published: (2026)
Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation
by: He, Qianxi, et al.
Published: (2025)
by: He, Qianxi, et al.
Published: (2025)
Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks
by: Huang, Yuncheng, et al.
Published: (2024)
by: Huang, Yuncheng, et al.
Published: (2024)
Adaptive Ordered Information Extraction with Deep Reinforcement Learning
by: Huang, Wenhao, et al.
Published: (2023)
by: Huang, Wenhao, et al.
Published: (2023)
Small Language Model Can Self-correct
by: Han, Haixia, et al.
Published: (2024)
by: Han, Haixia, et al.
Published: (2024)
From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models
by: He, Qianyu, et al.
Published: (2024)
by: He, Qianyu, et al.
Published: (2024)
Is There a One-Model-Fits-All Approach to Information Extraction? Revisiting Task Definition Biases
by: Huang, Wenhao, et al.
Published: (2024)
by: Huang, Wenhao, et al.
Published: (2024)
Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction
by: Ding, Zepeng, et al.
Published: (2024)
by: Ding, Zepeng, et al.
Published: (2024)
Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation
by: Han, Jinyi, et al.
Published: (2025)
by: Han, Jinyi, et al.
Published: (2025)
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning
by: He, Minggui, et al.
Published: (2025)
by: He, Minggui, et al.
Published: (2025)
The GaoYao Benchmark: A Comprehensive Framework for Evaluating Multilingual and Multicultural Abilities of Large Language Models
by: Liu, Yilun, et al.
Published: (2026)
by: Liu, Yilun, et al.
Published: (2026)
RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following
by: Pan, Tianjun, et al.
Published: (2026)
by: Pan, Tianjun, et al.
Published: (2026)
A Stitch in Time Saves Nine: Proactive Self-Refinement for Language Models
by: Han, Jinyi, et al.
Published: (2025)
by: Han, Jinyi, et al.
Published: (2025)
AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation
by: Huang, Wenhao, et al.
Published: (2024)
by: Huang, Wenhao, et al.
Published: (2024)
OVEL: Large Language Model as Memory Manager for Online Video Entity Linking
by: Zhao, Haiquan, et al.
Published: (2024)
by: Zhao, Haiquan, et al.
Published: (2024)
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation
by: Xia, Sirui, et al.
Published: (2024)
by: Xia, Sirui, et al.
Published: (2024)
Can Large Language Models Understand Real-World Complex Instructions?
by: He, Qianyu, et al.
Published: (2023)
by: He, Qianyu, et al.
Published: (2023)
LUK: Empowering Log Understanding with Expert Knowledge from Large Language Models
by: Ma, Lipeng, et al.
Published: (2024)
by: Ma, Lipeng, et al.
Published: (2024)
Structured Reasoning for Large Language Models
by: Han, Jinyi, et al.
Published: (2026)
by: Han, Jinyi, et al.
Published: (2026)
Probing Cultural Awareness in LLMs: A Case Study of Cross-Culture Aesthetic Stylistics
by: Wang, Jiashuo, et al.
Published: (2026)
by: Wang, Jiashuo, et al.
Published: (2026)
Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models
by: Zhang, Yikai, et al.
Published: (2024)
by: Zhang, Yikai, et al.
Published: (2024)
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language Models by Learning from Knowledge Graphs
by: Zhang, Yifei, et al.
Published: (2024)
by: Zhang, Yifei, et al.
Published: (2024)
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation
by: Ran, Yiting, et al.
Published: (2025)
by: Ran, Yiting, et al.
Published: (2025)
CDS: Knowledge Component-Driven Data Synthesis Guided by Cognitive Diagnosis Theory
by: Zhao, Haokun, et al.
Published: (2025)
by: Zhao, Haokun, et al.
Published: (2025)
ConcEPT: Concept-Enhanced Pre-Training for Language Models
by: Wang, Xintao, et al.
Published: (2024)
by: Wang, Xintao, et al.
Published: (2024)
ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base
by: Yuan, Siyu, et al.
Published: (2023)
by: Yuan, Siyu, et al.
Published: (2023)
CEM: A Data-Efficient Method for Large Language Models to Continue Evolving From Mistakes
by: Zhao, Haokun, et al.
Published: (2024)
by: Zhao, Haokun, et al.
Published: (2024)
Similar Items
-
Do Large Language Models Truly Understand Cross-cultural Differences?
by: Guo, Shiwei, et al.
Published: (2025) -
SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment
by: Jiang, Sihang, et al.
Published: (2026) -
Enhancing Quantitative Reasoning Skills of Large Language Models through Dimension Perception
by: Huang, Yuncheng, et al.
Published: (2023) -
VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It
by: Zhu, Xiaoxuan, et al.
Published: (2024) -
Selective Expert Guidance for Effective and Diverse Exploration in Reinforcement Learning of LLMs
by: Jiang, Zishang, et al.
Published: (2025)