Saved in:
| Main Authors: | Rückert, Johannes, Bloch, Louise, Friedrich, Christoph M. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.19825 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains
by: Fonseca, Marcio, et al.
Published: (2023)
by: Fonseca, Marcio, et al.
Published: (2023)
Large Language Models as Evaluators for Scientific Synthesis
by: Evans, Julia, et al.
Published: (2024)
by: Evans, Julia, et al.
Published: (2024)
Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models
by: Tan, Xingwei, et al.
Published: (2026)
by: Tan, Xingwei, et al.
Published: (2026)
CNS-Obsidian: A Neurosurgical Vision-Language Model Built From Scientific Publications
by: Alyakin, Anton, et al.
Published: (2025)
by: Alyakin, Anton, et al.
Published: (2025)
ChatVis: Automating Scientific Visualization with a Large Language Model
by: Mallick, Tanwi, et al.
Published: (2024)
by: Mallick, Tanwi, et al.
Published: (2024)
AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research
by: Zhao, Yilun, et al.
Published: (2025)
by: Zhao, Yilun, et al.
Published: (2025)
Draw with Thought: Unleashing Multimodal Reasoning for Scientific Diagram Generation
by: Cui, Zhiqing, et al.
Published: (2025)
by: Cui, Zhiqing, et al.
Published: (2025)
Toward Reliable Scientific Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models
by: Xiong, Guangzhi, et al.
Published: (2025)
by: Xiong, Guangzhi, et al.
Published: (2025)
Using Large Language Models for the Interpretation of Building Regulations
by: Fuchs, Stefan, et al.
Published: (2024)
by: Fuchs, Stefan, et al.
Published: (2024)
Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models
by: Silvestri, Matteo, et al.
Published: (2025)
by: Silvestri, Matteo, et al.
Published: (2025)
GAPMAP: Mapping Scientific Knowledge Gaps in Biomedical Literature Using Large Language Models
by: Salem, Nourah M, et al.
Published: (2025)
by: Salem, Nourah M, et al.
Published: (2025)
Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets
by: Eiter, Thomas, et al.
Published: (2024)
by: Eiter, Thomas, et al.
Published: (2024)
Scientific Computing with Large Language Models
by: Culver, Christopher, et al.
Published: (2024)
by: Culver, Christopher, et al.
Published: (2024)
Word-level Annotation of GDPR Transparency Compliance in Privacy Policies using Large Language Models
by: Cory, Thomas, et al.
Published: (2025)
by: Cory, Thomas, et al.
Published: (2025)
Experiments or Outcomes? Probing Scientific Feasibility in Large Language Models
by: Mohammadi, Seyedali, et al.
Published: (2026)
by: Mohammadi, Seyedali, et al.
Published: (2026)
The Model Agreed, But Didn't Learn: Diagnosing Surface Compliance in Large Language Models
by: Gu, Xiaojie, et al.
Published: (2026)
by: Gu, Xiaojie, et al.
Published: (2026)
CityLens: Evaluating Large Vision-Language Models for Urban Socioeconomic Sensing
by: Liu, Tianhui, et al.
Published: (2025)
by: Liu, Tianhui, et al.
Published: (2025)
Integrating Emotional and Linguistic Models for Ethical Compliance in Large Language Models
by: Chang, Edward Y.
Published: (2024)
by: Chang, Edward Y.
Published: (2024)
ArxEval: Evaluating Retrieval and Generation in Language Models for Scientific Literature
by: Sinha, Aarush, et al.
Published: (2025)
by: Sinha, Aarush, et al.
Published: (2025)
ChatSR: Multimodal Large Language Models for Scientific Formula Discovery
by: Li, Yanjie, et al.
Published: (2024)
by: Li, Yanjie, et al.
Published: (2024)
Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models
by: Xiong, Guangzhi, et al.
Published: (2024)
by: Xiong, Guangzhi, et al.
Published: (2024)
Towards Efficient Large Language Models for Scientific Text: A Review
by: To, Huy Quoc, et al.
Published: (2024)
by: To, Huy Quoc, et al.
Published: (2024)
Large Language Models for Automated Open-domain Scientific Hypotheses Discovery
by: Yang, Zonglin, et al.
Published: (2023)
by: Yang, Zonglin, et al.
Published: (2023)
Towards Automated Regulatory Compliance Verification in Financial Auditing with Large Language Models
by: Berger, Armin, et al.
Published: (2025)
by: Berger, Armin, et al.
Published: (2025)
Enhancing Large Language Models for Clinical Decision Support by Incorporating Clinical Practice Guidelines
by: Oniani, David, et al.
Published: (2024)
by: Oniani, David, et al.
Published: (2024)
To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models
by: Luo, Jiayun, et al.
Published: (2025)
by: Luo, Jiayun, et al.
Published: (2025)
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
by: Wang, Xiaoxuan, et al.
Published: (2023)
by: Wang, Xiaoxuan, et al.
Published: (2023)
SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning
by: Xu, Tianyang, et al.
Published: (2025)
by: Xu, Tianyang, et al.
Published: (2025)
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
by: Yan, Yibo, et al.
Published: (2025)
by: Yan, Yibo, et al.
Published: (2025)
MAC: A Live Benchmark for Multimodal Large Language Models in Scientific Understanding
by: Jiang, Mohan, et al.
Published: (2025)
by: Jiang, Mohan, et al.
Published: (2025)
Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?
by: Fonseca, Marcio, et al.
Published: (2024)
by: Fonseca, Marcio, et al.
Published: (2024)
Inclusivity in Large Language Models: Personality Traits and Gender Bias in Scientific Abstracts
by: Pervez, Naseela, et al.
Published: (2024)
by: Pervez, Naseela, et al.
Published: (2024)
ChemVA: Advancing Large Language Models on Chemical Reaction Diagrams Understanding
by: Rao, Mingyang, et al.
Published: (2026)
by: Rao, Mingyang, et al.
Published: (2026)
CausalVLBench: Benchmarking Visual Causal Reasoning in Large Vision-Language Models
by: Komanduri, Aneesh, et al.
Published: (2025)
by: Komanduri, Aneesh, et al.
Published: (2025)
Using Counterfactual Tasks to Evaluate the Generality of Analogical Reasoning in Large Language Models
by: Lewis, Martha, et al.
Published: (2024)
by: Lewis, Martha, et al.
Published: (2024)
Multilingual Training and Evaluation Resources for Vision-Language Models
by: Baiamonte, Daniela, et al.
Published: (2026)
by: Baiamonte, Daniela, et al.
Published: (2026)
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
by: Hu, Ming, et al.
Published: (2025)
by: Hu, Ming, et al.
Published: (2025)
On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models
by: Seo, Hoigi, et al.
Published: (2025)
by: Seo, Hoigi, et al.
Published: (2025)
Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models
by: Lee, Yi-Lun, et al.
Published: (2024)
by: Lee, Yi-Lun, et al.
Published: (2024)
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models
by: Jiang, Chaoya, et al.
Published: (2024)
by: Jiang, Chaoya, et al.
Published: (2024)
Similar Items
-
Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains
by: Fonseca, Marcio, et al.
Published: (2023) -
Large Language Models as Evaluators for Scientific Synthesis
by: Evans, Julia, et al.
Published: (2024) -
Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models
by: Tan, Xingwei, et al.
Published: (2026) -
CNS-Obsidian: A Neurosurgical Vision-Language Model Built From Scientific Publications
by: Alyakin, Anton, et al.
Published: (2025) -
ChatVis: Automating Scientific Visualization with a Large Language Model
by: Mallick, Tanwi, et al.
Published: (2024)