Saved in:
| Main Authors: | Zhu, Tingyuan, Liu, Shudong, Wang, Yidong, Wong, Derek F., Yu, Han, Shinozaki, Takahiro, Wang, Jindong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.14121 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries
by: Liu, Shudong, et al.
Published: (2025)
by: Liu, Shudong, et al.
Published: (2025)
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
by: Zhou, Zihao, et al.
Published: (2024)
by: Zhou, Zihao, et al.
Published: (2024)
Silly Salamanders and Other Slightly Stupid Stuff for Readers Theatre.
by: Fredericks, Anthony D.
Published: (2000)
by: Fredericks, Anthony D.
Published: (2000)
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
by: Wang, Rui, et al.
Published: (2025)
by: Wang, Rui, et al.
Published: (2025)
On the Diversity of Synthetic Data and its Impact on Training Large Language Models
by: Chen, Hao, et al.
Published: (2024)
by: Chen, Hao, et al.
Published: (2024)
Understanding and Mitigating Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
by: Li, Miaomiao, et al.
Published: (2025)
by: Li, Miaomiao, et al.
Published: (2025)
Harnessing Temporal Databases for Systematic Evaluation of Factual Time-Sensitive Question-Answering in Large Language Models
by: Kim, Soyeon, et al.
Published: (2025)
by: Kim, Soyeon, et al.
Published: (2025)
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
by: Yu, Zhuohao, et al.
Published: (2024)
by: Yu, Zhuohao, et al.
Published: (2024)
Visual Question Decomposition on Multimodal Large Language Models
by: Zhang, Haowei, et al.
Published: (2024)
by: Zhang, Haowei, et al.
Published: (2024)
KIEval: A Knowledge-grounded Interactive Evaluation Framework for Large Language Models
by: Yu, Zhuohao, et al.
Published: (2024)
by: Yu, Zhuohao, et al.
Published: (2024)
There Are No Silly Questions: Evaluation of Offline LLM Capabilities from a Turkish Perspective
by: Yilmaz, Edibe, et al.
Published: (2026)
by: Yilmaz, Edibe, et al.
Published: (2026)
RAG-Boost: Retrieval-Augmented Generation Enhanced LLM-based Speech Recognition
by: Wang, Pengcheng, et al.
Published: (2025)
by: Wang, Pengcheng, et al.
Published: (2025)
Supervised Knowledge Makes Large Language Models Better In-context Learners
by: Yang, Linyi, et al.
Published: (2023)
by: Yang, Linyi, et al.
Published: (2023)
A Survey on Evaluation of Large Language Models
by: Chang, Yupeng, et al.
Published: (2023)
by: Chang, Yupeng, et al.
Published: (2023)
RLAR: An Agentic Reward System for Multi-task Reinforcement Learning on Large Language Models
by: Feng, Andrew Zhuoer, et al.
Published: (2026)
by: Feng, Andrew Zhuoer, et al.
Published: (2026)
PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts
by: Zhu, Kaijie, et al.
Published: (2023)
by: Zhu, Kaijie, et al.
Published: (2023)
Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT
by: Komatsu, Ryota, et al.
Published: (2024)
by: Komatsu, Ryota, et al.
Published: (2024)
Chain-of-Procedure: Hierarchical Visual-Language Reasoning for Procedural QA
by: Chen, Guanhua, et al.
Published: (2026)
by: Chen, Guanhua, et al.
Published: (2026)
Understanding and Mitigating Political Stance Cross-topic Generalization in Large Language Models
by: Zhang, Jiayi, et al.
Published: (2025)
by: Zhang, Jiayi, et al.
Published: (2025)
StringLLM: Understanding the String Processing Capability of Large Language Models
by: Wang, Xilong, et al.
Published: (2024)
by: Wang, Xilong, et al.
Published: (2024)
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models
by: Pang, Jianhui, et al.
Published: (2024)
by: Pang, Jianhui, et al.
Published: (2024)
LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering
by: Zhu, Yuanjie, et al.
Published: (2025)
by: Zhu, Yuanjie, et al.
Published: (2025)
From Scenes to Elements: Multi-Granularity Evidence Retrieval for Verifiable Multimodal RAG
by: Chen, Guanhua, et al.
Published: (2026)
by: Chen, Guanhua, et al.
Published: (2026)
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering
by: Zhao, Jihao, et al.
Published: (2025)
by: Zhao, Jihao, et al.
Published: (2025)
Continual Learning Using Only Large Language Model Prompting
by: Qiu, Jiabao, et al.
Published: (2024)
by: Qiu, Jiabao, et al.
Published: (2024)
CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task
by: Mo, Haosi, et al.
Published: (2025)
by: Mo, Haosi, et al.
Published: (2025)
Dynamic Evaluation of Large Language Models by Meta Probing Agents
by: Zhu, Kaijie, et al.
Published: (2024)
by: Zhu, Kaijie, et al.
Published: (2024)
Hey, That's My Data! Token-Only Dataset Inference in Large Language Models
by: Xiong, Chen, et al.
Published: (2025)
by: Xiong, Chen, et al.
Published: (2025)
Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
by: Hu, Tianxiang, et al.
Published: (2024)
by: Hu, Tianxiang, et al.
Published: (2024)
Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model
by: Xu, Haoyun, et al.
Published: (2024)
by: Xu, Haoyun, et al.
Published: (2024)
PromptBench: A Unified Library for Evaluation of Large Language Models
by: Zhu, Kaijie, et al.
Published: (2023)
by: Zhu, Kaijie, et al.
Published: (2023)
On Fairness of Unified Multimodal Large Language Model for Image Generation
by: Liu, Ming, et al.
Published: (2025)
by: Liu, Ming, et al.
Published: (2025)
NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli
by: Wang, Xu, et al.
Published: (2024)
by: Wang, Xu, et al.
Published: (2024)
Classroom AI: Large Language Models as Grade-Specific Teachers
by: Oh, Jio, et al.
Published: (2026)
by: Oh, Jio, et al.
Published: (2026)
Intrinsic Model Weaknesses: How Priming Attacks Unveil Vulnerabilities in Large Language Models
by: Huang, Yuyi, et al.
Published: (2025)
by: Huang, Yuyi, et al.
Published: (2025)
Anchor-based Large Language Models
by: Pang, Jianhui, et al.
Published: (2024)
by: Pang, Jianhui, et al.
Published: (2024)
LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models
by: Zhao, Zihan, et al.
Published: (2023)
by: Zhao, Zihan, et al.
Published: (2023)
Rethinking Prompt-based Debiasing in Large Language Models
by: Yang, Xinyi, et al.
Published: (2025)
by: Yang, Xinyi, et al.
Published: (2025)
Contrastive Learning for Knowledge-Based Question Generation in Large Language Models
by: Zhang, Zhenhong, et al.
Published: (2024)
by: Zhang, Zhenhong, et al.
Published: (2024)
Direct Simultaneous Translation Activation for Large Audio-Language Models
by: Zhang, Pei, et al.
Published: (2025)
by: Zhang, Pei, et al.
Published: (2025)
Similar Items
-
CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries
by: Liu, Shudong, et al.
Published: (2025) -
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
by: Zhou, Zihao, et al.
Published: (2024) -
Silly Salamanders and Other Slightly Stupid Stuff for Readers Theatre.
by: Fredericks, Anthony D.
Published: (2000) -
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
by: Wang, Rui, et al.
Published: (2025) -
On the Diversity of Synthetic Data and its Impact on Training Large Language Models
by: Chen, Hao, et al.
Published: (2024)