:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gromadzki, Michał, Wróblewska, Anna, Kaliska, Agnieszka
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence I.2.7
Online Access:	https://arxiv.org/abs/2601.20006
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification
by: Bucher, Martin Juan José, et al.
Published: (2024)

Detecting AI-Generated Texts in Cross-Domains
by: Zhou, You, et al.
Published: (2024)

Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)

AEyeDE: An Attention-Based Attribution Framework for AI-Generated Text Detection
by: Nourbakhsh, Aria, et al.
Published: (2026)

A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text
by: Sagae, Alicia, et al.
Published: (2025)

PetKaz at SemEval-2024 Task 8: Can Linguistics Capture the Specifics of LLM-generated Text?
by: Petukhova, Kseniia, et al.
Published: (2024)

Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs
by: Tereshchenko, Yehor, et al.
Published: (2025)

Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
by: Peters, Sydney, et al.
Published: (2025)

Spotlights and Blindspots: Evaluating Machine-Generated Text Detection
by: Stowe, Kevin, et al.
Published: (2026)

A Lightweight Approach to Detection of AI-Generated Texts Using Stylometric Features
by: Aityan, Sergey K., et al.
Published: (2025)

mEdIT: Multilingual Text Editing via Instruction Tuning
by: Raheja, Vipul, et al.
Published: (2024)

Understanding the Effects of RLHF on the Quality and Detectability of LLM-Generated Texts
by: Xu, Beining, et al.
Published: (2025)

Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic
by: Pan, Muyu, et al.
Published: (2025)

LLMs and Memorization: On Quality and Specificity of Copyright Compliance
by: Mueller, Felix B, et al.
Published: (2024)

Ontology-Constrained Generation of Domain-Specific Clinical Summaries
by: Mehenni, Gaya, et al.
Published: (2024)

Intent Classification for Bank Chatbots through LLM Fine-Tuning
by: Lajčinová, Bibiána, et al.
Published: (2024)

Identifying Bias in Machine-generated Text Detection
by: Stowe, Kevin, et al.
Published: (2025)

Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models
by: Darm, Paul, et al.
Published: (2025)

Detecting Data Contamination in LLMs via In-Context Learning
by: Zawalski, Michał, et al.
Published: (2025)

A Comparative Study of DSL Code Generation: Fine-Tuning vs. Optimized Retrieval Augmentation
by: Bassamzadeh, Nastaran, et al.
Published: (2024)

Dual Debiasing for Noisy In-Context Learning for Text Generation
by: Liang, Siqi, et al.
Published: (2025)

RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
by: Saji, Alan, et al.
Published: (2025)

Seeing Through the Fog: A Cost-Effectiveness Analysis of Hallucination Detection Systems
by: Thomas, Alexander, et al.
Published: (2024)

StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis
by: Opara, Chidimma
Published: (2024)

Enigme: Generative Text Puzzles for Evaluating Reasoning in Language Models
by: Hawkins, John
Published: (2025)

Reasoning over Uncertain Text by Generative Large Language Models
by: Nafar, Aliakbar, et al.
Published: (2024)

Beyond the Mean: Within-Model Reliable Change Detection for LLM Evaluation
by: Cacioli, Jon-Paul
Published: (2026)

Technical Report on the Pangram AI-Generated Text Classifier
by: Emi, Bradley, et al.
Published: (2024)

Xinyu: An Efficient LLM-based System for Commentary Generation
by: Wu, Yiquan, et al.
Published: (2024)

Exploiting LLM-as-a-Judge Disposition on Free Text Legal QA via Prompt Optimization
by: Elganayni, Mohamed Hesham, et al.
Published: (2026)

Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)

Robustness of Large Language Models to Perturbations in Text
by: Singh, Ayush, et al.
Published: (2024)

Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters
by: Shah, Aaryan, et al.
Published: (2026)

Hallucination or Creativity: How to Evaluate AI-Generated Scientific Stories?
by: Argese, Alex, et al.
Published: (2026)

Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches
by: Mishra, Saurabh, et al.
Published: (2026)

Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2026)

Blessing or curse? A survey on the Impact of Generative AI on Fake News
by: Loth, Alexander, et al.
Published: (2024)

SPRInG: Continual LLM Personalization via Selective Parametric Adaptation and Retrieval-Interpolated Generation
by: Kim, Seoyeon, et al.
Published: (2026)

On Preserving the Knowledge of Long Clinical Texts
by: Hasan, Mohammad Junayed, et al.
Published: (2023)

Demystifying Instruction Mixing for Fine-tuning Large Language Models
by: Wang, Renxi, et al.
Published: (2023)