Saved in:
| Main Authors: | Schrader, Timo Pierre, Lange, Lukas, Kaminski, Tobias, Razniewski, Simon, Friedrich, Annemarie |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.17093 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios
by: Schrader, Timo Pierre, et al.
Published: (2024)
by: Schrader, Timo Pierre, et al.
Published: (2024)
Pap2Pat: Benchmarking Outline-Guided Long-Text Patent Generation with Patent-Paper Pairs
by: Knappich, Valentin, et al.
Published: (2024)
by: Knappich, Valentin, et al.
Published: (2024)
Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval
by: Knappich, Valentin, et al.
Published: (2026)
by: Knappich, Valentin, et al.
Published: (2026)
ASP-FZN: A Translation-based Constraint Answer Set Solver
by: Eiter, Thomas, et al.
Published: (2025)
by: Eiter, Thomas, et al.
Published: (2025)
Foundations of LLM Knowledge Materialization: Termination, Reproducibility, Robustness
by: Giordano, Luca, et al.
Published: (2025)
by: Giordano, Luca, et al.
Published: (2025)
Beyond Questions: Evaluating What Large Language Models (Actually) Know
by: Giordano, Luca, et al.
Published: (2026)
by: Giordano, Luca, et al.
Published: (2026)
Relating Answer Set Programming and Many-sorted Logics for Formal Verification
by: Hansen, Zachary
Published: (2025)
by: Hansen, Zachary
Published: (2025)
Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?
by: Tyagi, Nemika, et al.
Published: (2024)
by: Tyagi, Nemika, et al.
Published: (2024)
Enabling LLM Knowledge Analysis via Extensive Materialization
by: Hu, Yujia, et al.
Published: (2024)
by: Hu, Yujia, et al.
Published: (2024)
PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims
by: Knappich, Valentin, et al.
Published: (2025)
by: Knappich, Valentin, et al.
Published: (2025)
Mining the Mind: What 100M Beliefs Reveal About Frontier LLM Knowledge
by: Ghosh, Shrestha, et al.
Published: (2025)
by: Ghosh, Shrestha, et al.
Published: (2025)
Question Answering with LLMs and Learning from Answer Sets
by: Borroto, Manuel, et al.
Published: (2025)
by: Borroto, Manuel, et al.
Published: (2025)
Guiding and Diversifying LLM-Based Story Generation via Answer Set Programming
by: Wang, Phoebe J., et al.
Published: (2024)
by: Wang, Phoebe J., et al.
Published: (2024)
From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems
by: Rahman, A M Muntasir, et al.
Published: (2024)
by: Rahman, A M Muntasir, et al.
Published: (2024)
Puzzle Solving using Reasoning of Large Language Models: A Survey
by: Giadikiaroglou, Panagiotis, et al.
Published: (2024)
by: Giadikiaroglou, Panagiotis, et al.
Published: (2024)
Logic-of-Thought: Empowering Large Language Models with Logic Programs for Solving Puzzles in Natural Language
by: Li, Naiqi, et al.
Published: (2025)
by: Li, Naiqi, et al.
Published: (2025)
Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning
by: Ma, Xuetao, et al.
Published: (2025)
by: Ma, Xuetao, et al.
Published: (2025)
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
by: Chen, Jiangjie, et al.
Published: (2025)
by: Chen, Jiangjie, et al.
Published: (2025)
AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports
by: Lange, Lukas, et al.
Published: (2024)
by: Lange, Lukas, et al.
Published: (2024)
Quantifying over Optimum Answer Sets
by: Mazzotta, Giuseppe, et al.
Published: (2024)
by: Mazzotta, Giuseppe, et al.
Published: (2024)
SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas
by: Wei, Anjiang, et al.
Published: (2025)
by: Wei, Anjiang, et al.
Published: (2025)
Integrating Expert Knowledge into Logical Programs via LLMs
by: Górski, Franciszek, et al.
Published: (2025)
by: Górski, Franciszek, et al.
Published: (2025)
Evaluating Implicit Biases in LLM Reasoning through Logic Grid Puzzles
by: Jahara, Fatima, et al.
Published: (2025)
by: Jahara, Fatima, et al.
Published: (2025)
Collective Reasoning Among LLMs: A Framework for Answer Validation Without Ground Truth
by: Davoudi, Seyed Pouyan Mousavi, et al.
Published: (2025)
by: Davoudi, Seyed Pouyan Mousavi, et al.
Published: (2025)
Solving Decision Theory Problems with Probabilistic Answer Set Programming
by: Azzolini, Damiano, et al.
Published: (2024)
by: Azzolini, Damiano, et al.
Published: (2024)
A Machine Learning-based Approach for Solving Recurrence Relations and its use in Cost Analysis of Logic Programs
by: Rustenholz, Louis, et al.
Published: (2024)
by: Rustenholz, Louis, et al.
Published: (2024)
Applications of Intuitionistic Temporal Logic to Temporal Answer Set Programming
by: Cabalar, Pedro, et al.
Published: (2026)
by: Cabalar, Pedro, et al.
Published: (2026)
Grammar-Forced Translation of Natural Language to Temporal Logic using LLMs
by: English, William, et al.
Published: (2025)
by: English, William, et al.
Published: (2025)
Computational methods for Dynamic Answer Set Programming
by: Hahn, Susana
Published: (2025)
by: Hahn, Susana
Published: (2025)
PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles
by: Long, Yitao, et al.
Published: (2025)
by: Long, Yitao, et al.
Published: (2025)
Debating with More Persuasive LLMs Leads to More Truthful Answers
by: Khan, Akbir, et al.
Published: (2024)
by: Khan, Akbir, et al.
Published: (2024)
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
by: Yao, Jihan, et al.
Published: (2024)
by: Yao, Jihan, et al.
Published: (2024)
FoodPuzzle: Developing Large Language Model Agents as Flavor Scientists
by: Huang, Tenghao, et al.
Published: (2024)
by: Huang, Tenghao, et al.
Published: (2024)
Are LLMs Good Cryptic Crossword Solvers?
by: Sadallah, Abdelrahman, et al.
Published: (2024)
by: Sadallah, Abdelrahman, et al.
Published: (2024)
SM-based Semantics for Answer Set Programs Containing Conditional Literals and Arithmetic
by: Hansen, Zachary, et al.
Published: (2025)
by: Hansen, Zachary, et al.
Published: (2025)
Integrated Framework for LLM Evaluation with Answer Generation
by: Lee, Sujeong, et al.
Published: (2025)
by: Lee, Sujeong, et al.
Published: (2025)
How Multimodal LLMs Solve Image Tasks: A Lens on Visual Grounding, Task Reasoning, and Answer Decoding
by: Yu, Zhuoran, et al.
Published: (2025)
by: Yu, Zhuoran, et al.
Published: (2025)
Context Over Compute Human-in-the-Loop Outperforms Iterative Chain-of-Thought Prompting in Interview Answer Quality
by: Zhu, Kewen, et al.
Published: (2026)
by: Zhu, Kewen, et al.
Published: (2026)
The Two Sides of the Coin: Hallucination Generation and Detection with LLMs as Evaluators for LLMs
by: Bui, Anh Thu Maria, et al.
Published: (2024)
by: Bui, Anh Thu Maria, et al.
Published: (2024)
The Potential of LLMs in Medical Education: Generating Questions and Answers for Qualification Exams
by: Zhu, Yunqi, et al.
Published: (2024)
by: Zhu, Yunqi, et al.
Published: (2024)
Similar Items
-
QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios
by: Schrader, Timo Pierre, et al.
Published: (2024) -
Pap2Pat: Benchmarking Outline-Guided Long-Text Patent Generation with Patent-Paper Pairs
by: Knappich, Valentin, et al.
Published: (2024) -
Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval
by: Knappich, Valentin, et al.
Published: (2026) -
ASP-FZN: A Translation-based Constraint Answer Set Solver
by: Eiter, Thomas, et al.
Published: (2025) -
Foundations of LLM Knowledge Materialization: Termination, Reproducibility, Robustness
by: Giordano, Luca, et al.
Published: (2025)