Saved in:
| Main Authors: | Zhou, Yue, Di Eugenio, Barbara |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.16128 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Unveiling Performance Challenges of Large Language Models in Low-Resource Healthcare: A Demographic Fairness Perspective
by: Zhou, Yue, et al.
Published: (2024)
by: Zhou, Yue, et al.
Published: (2024)
Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning
by: Li, Alan, et al.
Published: (2025)
by: Li, Alan, et al.
Published: (2025)
Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs
by: Dan, Nifu, et al.
Published: (2025)
by: Dan, Nifu, et al.
Published: (2025)
Zero-Shot Belief: A Hard Problem for LLMs
by: Murzaku, John, et al.
Published: (2025)
by: Murzaku, John, et al.
Published: (2025)
Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
XFinBench: Benchmarking LLMs in Complex Financial Problem Solving and Reasoning
by: Zhang, Zhihan, et al.
Published: (2025)
by: Zhang, Zhihan, et al.
Published: (2025)
Reasoning Beyond Limits: Advances and Open Problems for LLMs
by: Ferrag, Mohamed Amine, et al.
Published: (2025)
by: Ferrag, Mohamed Amine, et al.
Published: (2025)
RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems
by: Qu, Yuxiao, et al.
Published: (2025)
by: Qu, Yuxiao, et al.
Published: (2025)
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
by: Zhou, Yue, et al.
Published: (2024)
by: Zhou, Yue, et al.
Published: (2024)
GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation
by: He, Jiashu, et al.
Published: (2024)
by: He, Jiashu, et al.
Published: (2024)
Investigating Bias: A Multilingual Pipeline for Generating, Solving, and Evaluating Math Problems with LLMs
by: Mahran, Mariam, et al.
Published: (2025)
by: Mahran, Mariam, et al.
Published: (2025)
Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning
by: Ma, Xuetao, et al.
Published: (2025)
by: Ma, Xuetao, et al.
Published: (2025)
Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning
by: Niwa, Ayana, et al.
Published: (2025)
by: Niwa, Ayana, et al.
Published: (2025)
Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning?
by: Zhou, Ruochen, et al.
Published: (2025)
by: Zhou, Ruochen, et al.
Published: (2025)
Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes
by: Myers, Quintin, et al.
Published: (2025)
by: Myers, Quintin, et al.
Published: (2025)
Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models
by: Pang, Xinyu, et al.
Published: (2024)
by: Pang, Xinyu, et al.
Published: (2024)
Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation
by: Zhou, Yue, et al.
Published: (2024)
by: Zhou, Yue, et al.
Published: (2024)
Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information
by: Kuznetsova, Elizaveta, et al.
Published: (2025)
by: Kuznetsova, Elizaveta, et al.
Published: (2025)
Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?
by: Hase, Peter, et al.
Published: (2024)
by: Hase, Peter, et al.
Published: (2024)
Linear Reasoning vs. Proof by Cases: Obstacles for Large Language Models in FOL Problem Solving
by: Ji, Yuliang, et al.
Published: (2026)
by: Ji, Yuliang, et al.
Published: (2026)
Veracity: An Open-Source AI Fact-Checking System
by: Curtis, Taylor Lynn, et al.
Published: (2025)
by: Curtis, Taylor Lynn, et al.
Published: (2025)
Answer-Centric or Reasoning-Driven? Uncovering the Latent Memory Anchor in LLMs
by: Wu, Yang, et al.
Published: (2025)
by: Wu, Yang, et al.
Published: (2025)
Beyond Performance: Quantifying and Mitigating Label Bias in LLMs
by: Reif, Yuval, et al.
Published: (2024)
by: Reif, Yuval, et al.
Published: (2024)
MERMAID: Memory-Enhanced Retrieval and Reasoning with Multi-Agent Iterative Knowledge Grounding for Veracity Assessment
by: Cao, Yupeng, et al.
Published: (2026)
by: Cao, Yupeng, et al.
Published: (2026)
NumPert: Numerical Perturbations to Probe Language Models for Veracity Prediction
by: Aarnes, Peter Røysland, et al.
Published: (2025)
by: Aarnes, Peter Røysland, et al.
Published: (2025)
From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning
by: Dinucu-Jianu, David, et al.
Published: (2025)
by: Dinucu-Jianu, David, et al.
Published: (2025)
Self-consistent Reasoning For Solving Math Word Problems
by: Xiong, Jing, et al.
Published: (2022)
by: Xiong, Jing, et al.
Published: (2022)
CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs
by: Wang, Kangsheng, et al.
Published: (2024)
by: Wang, Kangsheng, et al.
Published: (2024)
Can LLMs Solve longer Math Word Problems Better?
by: Xu, Xin, et al.
Published: (2024)
by: Xu, Xin, et al.
Published: (2024)
Explaining Veracity Predictions with Evidence Summarization: A Multi-Task Model Approach
by: Cekinel, Recep Firat, et al.
Published: (2024)
by: Cekinel, Recep Firat, et al.
Published: (2024)
Verbosity $\neq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
by: Zhang, Yusen, et al.
Published: (2024)
by: Zhang, Yusen, et al.
Published: (2024)
Advances in LLM Reasoning Enable Flexibility in Clinical Problem-Solving
by: Shidara, Kie, et al.
Published: (2026)
by: Shidara, Kie, et al.
Published: (2026)
Beyond English-Centric Training: How Reinforcement Learning Improves Cross-Lingual Reasoning in LLMs
by: Huang, Shulin, et al.
Published: (2025)
by: Huang, Shulin, et al.
Published: (2025)
From Conversation to Automation: Leveraging LLMs for Problem-Solving Therapy Analysis
by: Aghakhani, Elham, et al.
Published: (2025)
by: Aghakhani, Elham, et al.
Published: (2025)
Faithful-Patchscopes: Understanding and Mitigating Model Bias in Hidden Representations Explanation of Large Language Models
by: Gong, Xilin, et al.
Published: (2026)
by: Gong, Xilin, et al.
Published: (2026)
Weakly Supervised Veracity Classification with LLM-Predicted Credibility Signals
by: Leite, João A., et al.
Published: (2023)
by: Leite, João A., et al.
Published: (2023)
Perceived Political Bias in LLMs Reduces Persuasive Abilities
by: DiGiuseppe, Matthew, et al.
Published: (2026)
by: DiGiuseppe, Matthew, et al.
Published: (2026)
MALIBU Benchmark: Multi-Agent LLM Implicit Bias Uncovered
by: Mirza, Imran, et al.
Published: (2025)
by: Mirza, Imran, et al.
Published: (2025)
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
by: Yue, Yang, et al.
Published: (2025)
by: Yue, Yang, et al.
Published: (2025)
The Hidden Puppet Master: Predicting Human Belief Change in Manipulative LLM Dialogues
by: Shen, Jocelyn, et al.
Published: (2026)
by: Shen, Jocelyn, et al.
Published: (2026)
Similar Items
-
Unveiling Performance Challenges of Large Language Models in Low-Resource Healthcare: A Demographic Fairness Perspective
by: Zhou, Yue, et al.
Published: (2024) -
Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning
by: Li, Alan, et al.
Published: (2025) -
Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs
by: Dan, Nifu, et al.
Published: (2025) -
Zero-Shot Belief: A Hard Problem for LLMs
by: Murzaku, John, et al.
Published: (2025) -
Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
by: Xu, Xin, et al.
Published: (2025)