Saved in:
| Main Authors: | Jiang, Yilin, Zhang, Mingzi, Jin, Sheng, Yu, Zengyi, Kong, Xiangjie, Tu, Binghao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.15250 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EduGuardBench: A Holistic Benchmark for Evaluating the Pedagogical Fidelity and Adversarial Safety of LLMs as Simulated Teachers
by: Jiang, Yilin, et al.
Published: (2025)
by: Jiang, Yilin, et al.
Published: (2025)
Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons
by: Sandan, Isik Baran, et al.
Published: (2025)
by: Sandan, Isik Baran, et al.
Published: (2025)
Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
by: Peters, Sydney, et al.
Published: (2025)
by: Peters, Sydney, et al.
Published: (2025)
Robustness of Large Language Models to Perturbations in Text
by: Singh, Ayush, et al.
Published: (2024)
by: Singh, Ayush, et al.
Published: (2024)
Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
Multi-Turn Interactions for Text-to-SQL with Large Language Models
by: Xiong, Guanming, et al.
Published: (2024)
by: Xiong, Guanming, et al.
Published: (2024)
PatentGPT: A Large Language Model for Intellectual Property
by: Bai, Zilong, et al.
Published: (2024)
by: Bai, Zilong, et al.
Published: (2024)
Large Language Models Can Better Understand Knowledge Graphs Than We Thought
by: Dai, Xinbang, et al.
Published: (2024)
by: Dai, Xinbang, et al.
Published: (2024)
Predictive Simultaneous Interpretation: Harnessing Large Language Models for Democratizing Real-Time Multilingual Communication
by: Iida, Kurando, et al.
Published: (2024)
by: Iida, Kurando, et al.
Published: (2024)
UA-Legal-Bench: A Benchmark for Evaluating Large Language Models on Ukrainian Legal Reasoning
by: Ovcharov, Volodymyr
Published: (2026)
by: Ovcharov, Volodymyr
Published: (2026)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Inference to the Best Explanation in Large Language Models
by: Dalal, Dhairya, et al.
Published: (2024)
by: Dalal, Dhairya, et al.
Published: (2024)
Streamlining Redundant Layers to Compress Large Language Models
by: Chen, Xiaodong, et al.
Published: (2024)
by: Chen, Xiaodong, et al.
Published: (2024)
A comprehensive taxonomy of hallucinations in Large Language Models
by: Cossio, Manuel
Published: (2025)
by: Cossio, Manuel
Published: (2025)
Integrating Emotional and Linguistic Models for Ethical Compliance in Large Language Models
by: Chang, Edward Y.
Published: (2024)
by: Chang, Edward Y.
Published: (2024)
Bielik 11B v3: Multilingual Large Language Model for European Languages
by: Ociepa, Krzysztof, et al.
Published: (2025)
by: Ociepa, Krzysztof, et al.
Published: (2025)
Prompt-Time Symbolic Knowledge Capture with Large Language Models
by: Çöplü, Tolga, et al.
Published: (2024)
by: Çöplü, Tolga, et al.
Published: (2024)
Artificial Phantasia: Emergent Mental Imagery in Large Language Models
by: McCarty, Morgan, et al.
Published: (2025)
by: McCarty, Morgan, et al.
Published: (2025)
Argumentative Large Language Models for Explainable and Contestable Claim Verification
by: Freedman, Gabriel, et al.
Published: (2024)
by: Freedman, Gabriel, et al.
Published: (2024)
Reasoning over Uncertain Text by Generative Large Language Models
by: Nafar, Aliakbar, et al.
Published: (2024)
by: Nafar, Aliakbar, et al.
Published: (2024)
Demystifying Instruction Mixing for Fine-tuning Large Language Models
by: Wang, Renxi, et al.
Published: (2023)
by: Wang, Renxi, et al.
Published: (2023)
EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models
by: Paech, Samuel J.
Published: (2023)
by: Paech, Samuel J.
Published: (2023)
Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Insights from Multi-Agent Collaboration
by: Yuan, Weikang, et al.
Published: (2024)
by: Yuan, Weikang, et al.
Published: (2024)
Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models
by: Venkatasubramanian, Venkat, et al.
Published: (2024)
by: Venkatasubramanian, Venkat, et al.
Published: (2024)
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models
by: Benkirane, Kenza, et al.
Published: (2024)
by: Benkirane, Kenza, et al.
Published: (2024)
BLT: Can Large Language Models Handle Basic Legal Text?
by: Blair-Stanek, Andrew, et al.
Published: (2023)
by: Blair-Stanek, Andrew, et al.
Published: (2023)
Search-R3: Unifying Reasoning and Embedding in Large Language Models
by: Gui, Yuntao, et al.
Published: (2025)
by: Gui, Yuntao, et al.
Published: (2025)
Can Large Language Models perform Relation-based Argument Mining?
by: Gorur, Deniz, et al.
Published: (2024)
by: Gorur, Deniz, et al.
Published: (2024)
Assistive Large Language Model Agents for Socially-Aware Negotiation Dialogues
by: Hua, Yuncheng, et al.
Published: (2024)
by: Hua, Yuncheng, et al.
Published: (2024)
Partially Recentralization Softmax Loss for Vision-Language Models Robustness
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
by: Saji, Alan, et al.
Published: (2025)
by: Saji, Alan, et al.
Published: (2025)
PustakAI: Curriculum-Aligned and Interactive Textbooks Using Large Language Models
by: Sharma, Shivam, et al.
Published: (2025)
by: Sharma, Shivam, et al.
Published: (2025)
Prompt-Time Ontology-Driven Symbolic Knowledge Capture with Large Language Models
by: Çöplü, Tolga, et al.
Published: (2024)
by: Çöplü, Tolga, et al.
Published: (2024)
Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization
by: Nafar, Aliakbar, et al.
Published: (2025)
by: Nafar, Aliakbar, et al.
Published: (2025)
Direct Semantic Communication Between Large Language Models via Vector Translation
by: Yang, Fu-Chun, et al.
Published: (2025)
by: Yang, Fu-Chun, et al.
Published: (2025)
The Generation Gap: Exploring Age Bias in the Value Systems of Large Language Models
by: Liu, Siyang, et al.
Published: (2024)
by: Liu, Siyang, et al.
Published: (2024)
Learning vs Retrieval: The Role of In-Context Examples in Regression with Large Language Models
by: Nafar, Aliakbar, et al.
Published: (2024)
by: Nafar, Aliakbar, et al.
Published: (2024)
MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models
by: Xu, Jingwei, et al.
Published: (2024)
by: Xu, Jingwei, et al.
Published: (2024)
Red Teaming for Large Language Models At Scale: Tackling Hallucinations on Mathematics Tasks
by: Buszydlik, Aleksander, et al.
Published: (2023)
by: Buszydlik, Aleksander, et al.
Published: (2023)
Eliciting Problem Specifications via Large Language Models
by: Wray, Robert E., et al.
Published: (2024)
by: Wray, Robert E., et al.
Published: (2024)
Similar Items
-
EduGuardBench: A Holistic Benchmark for Evaluating the Pedagogical Fidelity and Adversarial Safety of LLMs as Simulated Teachers
by: Jiang, Yilin, et al.
Published: (2025) -
Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons
by: Sandan, Isik Baran, et al.
Published: (2025) -
Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
by: Peters, Sydney, et al.
Published: (2025) -
Robustness of Large Language Models to Perturbations in Text
by: Singh, Ayush, et al.
Published: (2024) -
Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)