Saved in:
| Main Authors: | Giordano, Luca, Razniewski, Simon |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.06780 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Beyond Questions: Evaluating What Large Language Models (Actually) Know
by: Giordano, Luca, et al.
Published: (2026)
by: Giordano, Luca, et al.
Published: (2026)
Mining the Mind: What 100M Beliefs Reveal About Frontier LLM Knowledge
by: Ghosh, Shrestha, et al.
Published: (2025)
by: Ghosh, Shrestha, et al.
Published: (2025)
Enabling LLM Knowledge Analysis via Extensive Materialization
by: Hu, Yujia, et al.
Published: (2024)
by: Hu, Yujia, et al.
Published: (2024)
LLMpedia: A Transparent Framework to Materialize an LLM's Encyclopedic Knowledge at Scale
by: Saeed, Muhammed, et al.
Published: (2026)
by: Saeed, Muhammed, et al.
Published: (2026)
Pap2Pat: Benchmarking Outline-Guided Long-Text Patent Generation with Patent-Paper Pairs
by: Knappich, Valentin, et al.
Published: (2024)
by: Knappich, Valentin, et al.
Published: (2024)
Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval
by: Knappich, Valentin, et al.
Published: (2026)
by: Knappich, Valentin, et al.
Published: (2026)
A Solver-in-the-Loop Framework for Improving LLMs on Answer Set Programming for Logic Puzzle Solving
by: Schrader, Timo Pierre, et al.
Published: (2025)
by: Schrader, Timo Pierre, et al.
Published: (2025)
Lie to Me: Knowledge Graphs for Robust Hallucination Self-Detection in LLMs
by: Kale, Sahil, et al.
Published: (2025)
by: Kale, Sahil, et al.
Published: (2025)
Can Coding Agents Reproduce Findings in Computational Materials Science?
by: Huang, Ziyang, et al.
Published: (2026)
by: Huang, Ziyang, et al.
Published: (2026)
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements
by: Zhao, Bingchen, et al.
Published: (2025)
by: Zhao, Bingchen, et al.
Published: (2025)
Towards Foundation Models for Knowledge Graph Reasoning
by: Galkin, Mikhail, et al.
Published: (2023)
by: Galkin, Mikhail, et al.
Published: (2023)
Terminal-World: Scaling Terminal-Agent Environments via Agent Skills
by: Cheng, Zihao, et al.
Published: (2026)
by: Cheng, Zihao, et al.
Published: (2026)
Cultural Commonsense Knowledge for Intercultural Dialogues
by: Nguyen, Tuan-Phong, et al.
Published: (2024)
by: Nguyen, Tuan-Phong, et al.
Published: (2024)
GPTKB v1.5: A Massive Knowledge Base for Exploring Factual LLM Knowledge
by: Hu, Yujia, et al.
Published: (2025)
by: Hu, Yujia, et al.
Published: (2025)
Incorporating Domain Knowledge into Materials Tokenization
by: Oh, Yerim, et al.
Published: (2025)
by: Oh, Yerim, et al.
Published: (2025)
SelfPrompt: Autonomously Evaluating LLM Robustness via Domain-Constrained Knowledge Guidelines and Refined Adversarial Prompts
by: Pei, Aihua, et al.
Published: (2024)
by: Pei, Aihua, et al.
Published: (2024)
SEMMA: A Semantic Aware Knowledge Graph Foundation Model
by: Arun, Arvindh, et al.
Published: (2025)
by: Arun, Arvindh, et al.
Published: (2025)
Beyond Completion: A Foundation Model for General Knowledge Graph Reasoning
by: Hua, Yin, et al.
Published: (2025)
by: Hua, Yin, et al.
Published: (2025)
Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model
by: Ye, Yanpeng, et al.
Published: (2024)
by: Ye, Yanpeng, et al.
Published: (2024)
Context-Robust Knowledge Editing for Language Models
by: Park, Haewon, et al.
Published: (2025)
by: Park, Haewon, et al.
Published: (2025)
Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach
by: Gundawar, Atharva, et al.
Published: (2024)
by: Gundawar, Atharva, et al.
Published: (2024)
LLM-Oriented Token-Adaptive Knowledge Distillation
by: Xie, Xurong, et al.
Published: (2025)
by: Xie, Xurong, et al.
Published: (2025)
The Knowledge-Behaviour Disconnect in LLM-based Chatbots
by: Broersen, Jan
Published: (2025)
by: Broersen, Jan
Published: (2025)
A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context Reasoning
by: Cui, Yuanning, et al.
Published: (2024)
by: Cui, Yuanning, et al.
Published: (2024)
Efficient Adaptive Transformer: An Empirical Study and Reproducible Framework
by: Miller, Jan
Published: (2025)
by: Miller, Jan
Published: (2025)
LLaSO: A Foundational Framework for Reproducible Research in Large Language and Speech Model
by: Sun, Yirong, et al.
Published: (2025)
by: Sun, Yirong, et al.
Published: (2025)
TS-Reasoner: Aligning Time Series Foundation Models with LLM Reasoning
by: Yu, Fangxu, et al.
Published: (2025)
by: Yu, Fangxu, et al.
Published: (2025)
Krutrim LLM: Multilingual Foundational Model for over a Billion People
by: Kallappa, Aditya, et al.
Published: (2025)
by: Kallappa, Aditya, et al.
Published: (2025)
Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility
by: Chmura, Jacob, et al.
Published: (2025)
by: Chmura, Jacob, et al.
Published: (2025)
CTourLLM: Enhancing LLMs with Chinese Tourism Knowledge
by: Wei, Qikai, et al.
Published: (2024)
by: Wei, Qikai, et al.
Published: (2024)
Efficient Knowledge Infusion via KG-LLM Alignment
by: Jiang, Zhouyu, et al.
Published: (2024)
by: Jiang, Zhouyu, et al.
Published: (2024)
Benchmarking and Improving LLM Robustness for Personalized Generation
by: Okite, Chimaobi, et al.
Published: (2025)
by: Okite, Chimaobi, et al.
Published: (2025)
From Confidence to Collapse in LLM Factual Robustness
by: Fastowski, Alina, et al.
Published: (2025)
by: Fastowski, Alina, et al.
Published: (2025)
LLMs versus the Halting Problem: Characterizing Program Termination Reasoning
by: Sultan, Oren, et al.
Published: (2026)
by: Sultan, Oren, et al.
Published: (2026)
Reproducibility Study of "XRec: Large Language Models for Explainable Recommendation"
by: Mishra, Ranjan, et al.
Published: (2025)
by: Mishra, Ranjan, et al.
Published: (2025)
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models
by: Xu, Fangzhi, et al.
Published: (2023)
by: Xu, Fangzhi, et al.
Published: (2023)
RvLLM: LLM Runtime Verification with Domain Knowledge
by: Zhang, Yedi, et al.
Published: (2025)
by: Zhang, Yedi, et al.
Published: (2025)
Improving the Robustness of Knowledge-Grounded Dialogue via Contrastive Learning
by: Wang, Jiaan, et al.
Published: (2024)
by: Wang, Jiaan, et al.
Published: (2024)
Quantifying Self-diagnostic Atomic Knowledge in Chinese Medical Foundation Model: A Computational Analysis
by: Fan, Yaxin, et al.
Published: (2023)
by: Fan, Yaxin, et al.
Published: (2023)
Knowledge Distillation of LLM for Automatic Scoring of Science Education Assessments
by: Latif, Ehsan, et al.
Published: (2023)
by: Latif, Ehsan, et al.
Published: (2023)
Similar Items
-
Beyond Questions: Evaluating What Large Language Models (Actually) Know
by: Giordano, Luca, et al.
Published: (2026) -
Mining the Mind: What 100M Beliefs Reveal About Frontier LLM Knowledge
by: Ghosh, Shrestha, et al.
Published: (2025) -
Enabling LLM Knowledge Analysis via Extensive Materialization
by: Hu, Yujia, et al.
Published: (2024) -
LLMpedia: A Transparent Framework to Materialize an LLM's Encyclopedic Knowledge at Scale
by: Saeed, Muhammed, et al.
Published: (2026) -
Pap2Pat: Benchmarking Outline-Guided Long-Text Patent Generation with Patent-Paper Pairs
by: Knappich, Valentin, et al.
Published: (2024)