Saved in:
| Main Authors: | Rose, Michael E., Herrmann, Nils A., Erhardt, Sebastian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.24459 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Logic Mill -- A Knowledge Navigation System
by: Erhardt, Sebastian, et al.
Published: (2022)
by: Erhardt, Sebastian, et al.
Published: (2022)
PaECTER: Patent-level Representation Learning using Citation-informed Transformers
by: Ghosh, Mainak, et al.
Published: (2024)
by: Ghosh, Mainak, et al.
Published: (2024)
Tracing the Flow of Knowledge From Science to Technology Using Deep Learning
by: Rose, Michael E., et al.
Published: (2025)
by: Rose, Michael E., et al.
Published: (2025)
LOCA: Logical Chain Augmentation for Scientific Corpus Cleaning
by: Fang, You-Le, et al.
Published: (2025)
by: Fang, You-Le, et al.
Published: (2025)
Detection of Fake Generated Scientific Abstracts
by: Theocharopoulos, Panagiotis C., et al.
Published: (2023)
by: Theocharopoulos, Panagiotis C., et al.
Published: (2023)
Does Scientific Writing Converge to U.S. English? Evidence from Generative AI-Assisted Publications
by: Filimonovic, Dragan, et al.
Published: (2025)
by: Filimonovic, Dragan, et al.
Published: (2025)
DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation
by: Ning, Xinyu, et al.
Published: (2024)
by: Ning, Xinyu, et al.
Published: (2024)
NUTSHELL: A Dataset for Abstract Generation from Scientific Talks
by: Züfle, Maike, et al.
Published: (2025)
by: Züfle, Maike, et al.
Published: (2025)
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data
by: Borisova, Ekaterina, et al.
Published: (2025)
by: Borisova, Ekaterina, et al.
Published: (2025)
Beyond Hate: Differentiating Uncivil and Intolerant Speech in Multimodal Content Moderation
by: Herrmann, Nils A., et al.
Published: (2026)
by: Herrmann, Nils A., et al.
Published: (2026)
Classifying Graphemes in English Words Through the Application of a Fuzzy Inference System
by: Rose, Samuel, et al.
Published: (2024)
by: Rose, Samuel, et al.
Published: (2024)
AINL-Eval 2025 Shared Task: Detection of AI-Generated Scientific Abstracts in Russian
by: Batura, Tatiana, et al.
Published: (2025)
by: Batura, Tatiana, et al.
Published: (2025)
LSTM-based Deep Neural Network With A Focus on Sentence Representation for Sequential Sentence Classification in Medical Scientific Abstracts
by: Lam, Phat, et al.
Published: (2024)
by: Lam, Phat, et al.
Published: (2024)
Grounding Fallacies Misrepresenting Scientific Publications in Evidence
by: Glockner, Max, et al.
Published: (2024)
by: Glockner, Max, et al.
Published: (2024)
HyperPIE: Hyperparameter Information Extraction from Scientific Publications
by: Saier, Tarek, et al.
Published: (2023)
by: Saier, Tarek, et al.
Published: (2023)
MathClean: A Benchmark for Synthetic Mathematical Data Cleaning
by: Liang, Hao, et al.
Published: (2025)
by: Liang, Hao, et al.
Published: (2025)
Inclusivity in Large Language Models: Personality Traits and Gender Bias in Scientific Abstracts
by: Pervez, Naseela, et al.
Published: (2024)
by: Pervez, Naseela, et al.
Published: (2024)
Automatic Detection of Research Values from Scientific Abstracts Across Computer Science Subfields
by: Jiang, Hang, et al.
Published: (2025)
by: Jiang, Hang, et al.
Published: (2025)
Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5
by: Wang, Qiao, et al.
Published: (2024)
by: Wang, Qiao, et al.
Published: (2024)
Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection
by: Gamba, Federica, et al.
Published: (2025)
by: Gamba, Federica, et al.
Published: (2025)
DynClean: Training Dynamics-based Label Cleaning for Distantly-Supervised Named Entity Recognition
by: Zhang, Qi, et al.
Published: (2025)
by: Zhang, Qi, et al.
Published: (2025)
ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications
by: Takeshita, Sotaro, et al.
Published: (2024)
by: Takeshita, Sotaro, et al.
Published: (2024)
Language-agnostic, automated assessment of listeners' speech recall using large language models
by: Herrmann, Björn
Published: (2025)
by: Herrmann, Björn
Published: (2025)
Validation of the Scientific Literature via Chemputation Augmented by Large Language Models
by: Pagel, Sebastian, et al.
Published: (2024)
by: Pagel, Sebastian, et al.
Published: (2024)
Data Augmentation Techniques for Process Extraction from Scientific Publications
by: Susanti, Yuni
Published: (2024)
by: Susanti, Yuni
Published: (2024)
CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion
by: Bikaun, Tyler, et al.
Published: (2024)
by: Bikaun, Tyler, et al.
Published: (2024)
SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts
by: Brinner, Marc, et al.
Published: (2025)
by: Brinner, Marc, et al.
Published: (2025)
Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment
by: Yeh, Samuel, et al.
Published: (2025)
by: Yeh, Samuel, et al.
Published: (2025)
SimulRAG: Simulator-based RAG for Grounding LLMs in Long-form Scientific QA
by: Xu, Haozhou, et al.
Published: (2025)
by: Xu, Haozhou, et al.
Published: (2025)
Clean & Clear: Feasibility of Safe LLM Clinical Guidance
by: Ive, Julia, et al.
Published: (2025)
by: Ive, Julia, et al.
Published: (2025)
Delving into the Utilisation of ChatGPT in Scientific Publications in Astronomy
by: Astarita, Simone, et al.
Published: (2024)
by: Astarita, Simone, et al.
Published: (2024)
Do Multilingual LLMs Think In English?
by: Schut, Lisa, et al.
Published: (2025)
by: Schut, Lisa, et al.
Published: (2025)
CNS-Obsidian: A Neurosurgical Vision-Language Model Built From Scientific Publications
by: Alyakin, Anton, et al.
Published: (2025)
by: Alyakin, Anton, et al.
Published: (2025)
Clean Evaluations on Contaminated Visual Language Models
by: Lu, Hongyuan, et al.
Published: (2024)
by: Lu, Hongyuan, et al.
Published: (2024)
AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations
by: Zhu, Minjun, et al.
Published: (2026)
by: Zhu, Minjun, et al.
Published: (2026)
Contrastive Learning with Enhanced Abstract Representations using Grouped Loss of Abstract Semantic Supervision
by: Suissa, Omri, et al.
Published: (2025)
by: Suissa, Omri, et al.
Published: (2025)
HalluClean: A Unified Framework to Combat Hallucinations in LLMs
by: Zhao, Yaxin, et al.
Published: (2025)
by: Zhao, Yaxin, et al.
Published: (2025)
CLEAN-EVAL: Clean Evaluation on Contaminated Large Language Models
by: Zhu, Wenhong, et al.
Published: (2023)
by: Zhu, Wenhong, et al.
Published: (2023)
CleanComedy: Creating Friendly Humor through Generative Techniques
by: Vikhorev, Dmitry, et al.
Published: (2024)
by: Vikhorev, Dmitry, et al.
Published: (2024)
Proceedings of the ISCA/ITG Workshop on Diversity in Large Speech and Language Models
by: Möller, Sebastian, et al.
Published: (2025)
by: Möller, Sebastian, et al.
Published: (2025)
Similar Items
-
Logic Mill -- A Knowledge Navigation System
by: Erhardt, Sebastian, et al.
Published: (2022) -
PaECTER: Patent-level Representation Learning using Citation-informed Transformers
by: Ghosh, Mainak, et al.
Published: (2024) -
Tracing the Flow of Knowledge From Science to Technology Using Deep Learning
by: Rose, Michael E., et al.
Published: (2025) -
LOCA: Logical Chain Augmentation for Scientific Corpus Cleaning
by: Fang, You-Le, et al.
Published: (2025) -
Detection of Fake Generated Scientific Abstracts
by: Theocharopoulos, Panagiotis C., et al.
Published: (2023)