:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhou, Yue, Di Eugenio, Barbara
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2505.16128
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Unveiling Performance Challenges of Large Language Models in Low-Resource Healthcare: A Demographic Fairness Perspective
by: Zhou, Yue, et al.
Published: (2024)

Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning
by: Li, Alan, et al.
Published: (2025)

Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs
by: Dan, Nifu, et al.
Published: (2025)

Zero-Shot Belief: A Hard Problem for LLMs
by: Murzaku, John, et al.
Published: (2025)

Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
by: Xu, Xin, et al.
Published: (2025)

XFinBench: Benchmarking LLMs in Complex Financial Problem Solving and Reasoning
by: Zhang, Zhihan, et al.
Published: (2025)

Reasoning Beyond Limits: Advances and Open Problems for LLMs
by: Ferrag, Mohamed Amine, et al.
Published: (2025)

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems
by: Qu, Yuxiao, et al.
Published: (2025)

Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
by: Zhou, Yue, et al.
Published: (2024)

GIVE: Structured Reasoning of Large Language Models with Knowledge Graph Inspired Veracity Extrapolation
by: He, Jiashu, et al.
Published: (2024)

Investigating Bias: A Multilingual Pipeline for Generating, Solving, and Evaluating Math Problems with LLMs
by: Mahran, Mariam, et al.
Published: (2025)

Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning
by: Ma, Xuetao, et al.
Published: (2025)

Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning
by: Niwa, Ayana, et al.
Published: (2025)

Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning?
by: Zhou, Ruochen, et al.
Published: (2025)

Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes
by: Myers, Quintin, et al.
Published: (2025)

Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models
by: Pang, Xinyu, et al.
Published: (2024)

Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation
by: Zhou, Yue, et al.
Published: (2024)

Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information
by: Kuznetsova, Elizaveta, et al.
Published: (2025)

Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?
by: Hase, Peter, et al.
Published: (2024)

Linear Reasoning vs. Proof by Cases: Obstacles for Large Language Models in FOL Problem Solving
by: Ji, Yuliang, et al.
Published: (2026)

Veracity: An Open-Source AI Fact-Checking System
by: Curtis, Taylor Lynn, et al.
Published: (2025)

Answer-Centric or Reasoning-Driven? Uncovering the Latent Memory Anchor in LLMs
by: Wu, Yang, et al.
Published: (2025)

Beyond Performance: Quantifying and Mitigating Label Bias in LLMs
by: Reif, Yuval, et al.
Published: (2024)

MERMAID: Memory-Enhanced Retrieval and Reasoning with Multi-Agent Iterative Knowledge Grounding for Veracity Assessment
by: Cao, Yupeng, et al.
Published: (2026)

NumPert: Numerical Perturbations to Probe Language Models for Veracity Prediction
by: Aarnes, Peter Røysland, et al.
Published: (2025)

From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning
by: Dinucu-Jianu, David, et al.
Published: (2025)

Self-consistent Reasoning For Solving Math Word Problems
by: Xiong, Jing, et al.
Published: (2022)

CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs
by: Wang, Kangsheng, et al.
Published: (2024)

Can LLMs Solve longer Math Word Problems Better?
by: Xu, Xin, et al.
Published: (2024)

Explaining Veracity Predictions with Evidence Summarization: A Multi-Task Model Approach
by: Cekinel, Recep Firat, et al.
Published: (2024)

Verbosity $\neq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
by: Zhang, Yusen, et al.
Published: (2024)

Advances in LLM Reasoning Enable Flexibility in Clinical Problem-Solving
by: Shidara, Kie, et al.
Published: (2026)

Beyond English-Centric Training: How Reinforcement Learning Improves Cross-Lingual Reasoning in LLMs
by: Huang, Shulin, et al.
Published: (2025)

From Conversation to Automation: Leveraging LLMs for Problem-Solving Therapy Analysis
by: Aghakhani, Elham, et al.
Published: (2025)

Faithful-Patchscopes: Understanding and Mitigating Model Bias in Hidden Representations Explanation of Large Language Models
by: Gong, Xilin, et al.
Published: (2026)

Weakly Supervised Veracity Classification with LLM-Predicted Credibility Signals
by: Leite, João A., et al.
Published: (2023)

Perceived Political Bias in LLMs Reduces Persuasive Abilities
by: DiGiuseppe, Matthew, et al.
Published: (2026)

MALIBU Benchmark: Multi-Agent LLM Implicit Bias Uncovered
by: Mirza, Imran, et al.
Published: (2025)

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
by: Yue, Yang, et al.
Published: (2025)

The Hidden Puppet Master: Predicting Human Belief Change in Manipulative LLM Dialogues
by: Shen, Jocelyn, et al.
Published: (2026)