:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rose, Michael E., Herrmann, Nils A., Erhardt, Sebastian
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2512.24459
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Logic Mill -- A Knowledge Navigation System
by: Erhardt, Sebastian, et al.
Published: (2022)

PaECTER: Patent-level Representation Learning using Citation-informed Transformers
by: Ghosh, Mainak, et al.
Published: (2024)

Tracing the Flow of Knowledge From Science to Technology Using Deep Learning
by: Rose, Michael E., et al.
Published: (2025)

LOCA: Logical Chain Augmentation for Scientific Corpus Cleaning
by: Fang, You-Le, et al.
Published: (2025)

Detection of Fake Generated Scientific Abstracts
by: Theocharopoulos, Panagiotis C., et al.
Published: (2023)

Does Scientific Writing Converge to U.S. English? Evidence from Generative AI-Assisted Publications
by: Filimonovic, Dragan, et al.
Published: (2025)

DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation
by: Ning, Xinyu, et al.
Published: (2024)

NUTSHELL: A Dataset for Abstract Generation from Scientific Talks
by: Züfle, Maike, et al.
Published: (2025)

Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data
by: Borisova, Ekaterina, et al.
Published: (2025)

Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation
by: Herrmann, Nils A., et al.
Published: (2026)

Classifying Graphemes in English Words Through the Application of a Fuzzy Inference System
by: Rose, Samuel, et al.
Published: (2024)

AINL-Eval 2025 Shared Task: Detection of AI-Generated Scientific Abstracts in Russian
by: Batura, Tatiana, et al.
Published: (2025)

LSTM-based Deep Neural Network With A Focus on Sentence Representation for Sequential Sentence Classification in Medical Scientific Abstracts
by: Lam, Phat, et al.
Published: (2024)

Grounding Fallacies Misrepresenting Scientific Publications in Evidence
by: Glockner, Max, et al.
Published: (2024)

HyperPIE: Hyperparameter Information Extraction from Scientific Publications
by: Saier, Tarek, et al.
Published: (2023)

MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
by: Liang, Hao, et al.
Published: (2025)

Inclusivity in Large Language Models: Personality Traits and Gender Bias in Scientific Abstracts
by: Pervez, Naseela, et al.
Published: (2024)

Automatic Detection of Research Values from Scientific Abstracts Across Computer Science Subfields
by: Jiang, Hang, et al.
Published: (2025)

Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5
by: Wang, Qiao, et al.
Published: (2024)

Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection
by: Gamba, Federica, et al.
Published: (2025)

DynClean: Training Dynamics-based Label Cleaning for Distantly-Supervised Named Entity Recognition
by: Zhang, Qi, et al.
Published: (2025)

ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications
by: Takeshita, Sotaro, et al.
Published: (2024)

Language-agnostic, automated assessment of listeners' speech recall using large language models
by: Herrmann, Björn
Published: (2025)

Validation of the Scientific Literature via Chemputation Augmented by Large Language Models
by: Pagel, Sebastian, et al.
Published: (2024)

Data Augmentation Techniques for Process Extraction from Scientific Publications
by: Susanti, Yuni
Published: (2024)

CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion
by: Bikaun, Tyler, et al.
Published: (2024)

SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts
by: Brinner, Marc, et al.
Published: (2025)

Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment
by: Yeh, Samuel, et al.
Published: (2025)

SimulRAG: Simulator-based RAG for Grounding LLMs in Long-form Scientific QA
by: Xu, Haozhou, et al.
Published: (2025)

Clean & Clear: Feasibility of Safe LLM Clinical Guidance
by: Ive, Julia, et al.
Published: (2025)

Delving into the Utilisation of ChatGPT in Scientific Publications in Astronomy
by: Astarita, Simone, et al.
Published: (2024)

Do Multilingual LLMs Think In English?
by: Schut, Lisa, et al.
Published: (2025)

CNS-Obsidian: A Neurosurgical Vision-Language Model Built From Scientific Publications
by: Alyakin, Anton, et al.
Published: (2025)

Clean Evaluations on Contaminated Visual Language Models
by: Lu, Hongyuan, et al.
Published: (2024)

AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations
by: Zhu, Minjun, et al.
Published: (2026)

Contrastive Learning with Enhanced Abstract Representations using Grouped Loss of Abstract Semantic Supervision
by: Suissa, Omri, et al.
Published: (2025)

HalluClean: A Unified Framework to Combat Hallucinations in LLMs
by: Zhao, Yaxin, et al.
Published: (2025)

CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models
by: Zhu, Wenhong, et al.
Published: (2023)

CleanComedy: Creating Friendly Humor through Generative Techniques
by: Vikhorev, Dmitry, et al.
Published: (2024)

Proceedings of the ISCA/ITG Workshop on Diversity in Large Speech and Language Models
by: Möller, Sebastian, et al.
Published: (2025)