Saved in:
| Main Authors: | Gromadzki, Michał, Wróblewska, Anna, Kaliska, Agnieszka |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.20006 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification
by: Bucher, Martin Juan José, et al.
Published: (2024)
by: Bucher, Martin Juan José, et al.
Published: (2024)
Detecting AI-Generated Texts in Cross-Domains
by: Zhou, You, et al.
Published: (2024)
by: Zhou, You, et al.
Published: (2024)
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)
by: Oketunji, Abiodun Finbarrs
Published: (2023)
AEyeDE: An Attention-Based Attribution Framework for AI-Generated Text Detection
by: Nourbakhsh, Aria, et al.
Published: (2026)
by: Nourbakhsh, Aria, et al.
Published: (2026)
A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text
by: Sagae, Alicia, et al.
Published: (2025)
by: Sagae, Alicia, et al.
Published: (2025)
PetKaz at SemEval-2024 Task 8: Can Linguistics Capture the Specifics of LLM-generated Text?
by: Petukhova, Kseniia, et al.
Published: (2024)
by: Petukhova, Kseniia, et al.
Published: (2024)
Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs
by: Tereshchenko, Yehor, et al.
Published: (2025)
by: Tereshchenko, Yehor, et al.
Published: (2025)
Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
by: Peters, Sydney, et al.
Published: (2025)
by: Peters, Sydney, et al.
Published: (2025)
Spotlights and Blindspots: Evaluating Machine-Generated Text Detection
by: Stowe, Kevin, et al.
Published: (2026)
by: Stowe, Kevin, et al.
Published: (2026)
A Lightweight Approach to Detection of AI-Generated Texts Using Stylometric Features
by: Aityan, Sergey K., et al.
Published: (2025)
by: Aityan, Sergey K., et al.
Published: (2025)
mEdIT: Multilingual Text Editing via Instruction Tuning
by: Raheja, Vipul, et al.
Published: (2024)
by: Raheja, Vipul, et al.
Published: (2024)
Understanding the Effects of RLHF on the Quality and Detectability of LLM-Generated Texts
by: Xu, Beining, et al.
Published: (2025)
by: Xu, Beining, et al.
Published: (2025)
Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic
by: Pan, Muyu, et al.
Published: (2025)
by: Pan, Muyu, et al.
Published: (2025)
LLMs and Memorization: On Quality and Specificity of Copyright Compliance
by: Mueller, Felix B, et al.
Published: (2024)
by: Mueller, Felix B, et al.
Published: (2024)
Ontology-Constrained Generation of Domain-Specific Clinical Summaries
by: Mehenni, Gaya, et al.
Published: (2024)
by: Mehenni, Gaya, et al.
Published: (2024)
Intent Classification for Bank Chatbots through LLM Fine-Tuning
by: Lajčinová, Bibiána, et al.
Published: (2024)
by: Lajčinová, Bibiána, et al.
Published: (2024)
Identifying Bias in Machine-generated Text Detection
by: Stowe, Kevin, et al.
Published: (2025)
by: Stowe, Kevin, et al.
Published: (2025)
Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models
by: Darm, Paul, et al.
Published: (2025)
by: Darm, Paul, et al.
Published: (2025)
Detecting Data Contamination in LLMs via In-Context Learning
by: Zawalski, Michał, et al.
Published: (2025)
by: Zawalski, Michał, et al.
Published: (2025)
A Comparative Study of DSL Code Generation: Fine-Tuning vs. Optimized Retrieval Augmentation
by: Bassamzadeh, Nastaran, et al.
Published: (2024)
by: Bassamzadeh, Nastaran, et al.
Published: (2024)
Dual Debiasing for Noisy In-Context Learning for Text Generation
by: Liang, Siqi, et al.
Published: (2025)
by: Liang, Siqi, et al.
Published: (2025)
RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
by: Saji, Alan, et al.
Published: (2025)
by: Saji, Alan, et al.
Published: (2025)
Seeing Through the Fog: A Cost-Effectiveness Analysis of Hallucination Detection Systems
by: Thomas, Alexander, et al.
Published: (2024)
by: Thomas, Alexander, et al.
Published: (2024)
StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis
by: Opara, Chidimma
Published: (2024)
by: Opara, Chidimma
Published: (2024)
Enigme: Generative Text Puzzles for Evaluating Reasoning in Language Models
by: Hawkins, John
Published: (2025)
by: Hawkins, John
Published: (2025)
Reasoning over Uncertain Text by Generative Large Language Models
by: Nafar, Aliakbar, et al.
Published: (2024)
by: Nafar, Aliakbar, et al.
Published: (2024)
Beyond the Mean: Within-Model Reliable Change Detection for LLM Evaluation
by: Cacioli, Jon-Paul
Published: (2026)
by: Cacioli, Jon-Paul
Published: (2026)
Technical Report on the Pangram AI-Generated Text Classifier
by: Emi, Bradley, et al.
Published: (2024)
by: Emi, Bradley, et al.
Published: (2024)
Xinyu: An Efficient LLM-based System for Commentary Generation
by: Wu, Yiquan, et al.
Published: (2024)
by: Wu, Yiquan, et al.
Published: (2024)
Exploiting LLM-as-a-Judge Disposition on Free Text Legal QA via Prompt Optimization
by: Elganayni, Mohamed Hesham, et al.
Published: (2026)
by: Elganayni, Mohamed Hesham, et al.
Published: (2026)
Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
Robustness of Large Language Models to Perturbations in Text
by: Singh, Ayush, et al.
Published: (2024)
by: Singh, Ayush, et al.
Published: (2024)
Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters
by: Shah, Aaryan, et al.
Published: (2026)
by: Shah, Aaryan, et al.
Published: (2026)
Hallucination or Creativity: How to Evaluate AI-Generated Scientific Stories?
by: Argese, Alex, et al.
Published: (2026)
by: Argese, Alex, et al.
Published: (2026)
Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches
by: Mishra, Saurabh, et al.
Published: (2026)
by: Mishra, Saurabh, et al.
Published: (2026)
Generative Active Testing: Efficient LLM Evaluation via Proxy Task Adaptation
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2026)
by: Ramakrishnan, Aashish Anantha, et al.
Published: (2026)
Blessing or curse? A survey on the Impact of Generative AI on Fake News
by: Loth, Alexander, et al.
Published: (2024)
by: Loth, Alexander, et al.
Published: (2024)
SPRInG: Continual LLM Personalization via Selective Parametric Adaptation and Retrieval-Interpolated Generation
by: Kim, Seoyeon, et al.
Published: (2026)
by: Kim, Seoyeon, et al.
Published: (2026)
On Preserving the Knowledge of Long Clinical Texts
by: Hasan, Mohammad Junayed, et al.
Published: (2023)
by: Hasan, Mohammad Junayed, et al.
Published: (2023)
Demystifying Instruction Mixing for Fine-tuning Large Language Models
by: Wang, Renxi, et al.
Published: (2023)
by: Wang, Renxi, et al.
Published: (2023)
Similar Items
-
Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification
by: Bucher, Martin Juan José, et al.
Published: (2024) -
Detecting AI-Generated Texts in Cross-Domains
by: Zhou, You, et al.
Published: (2024) -
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023) -
AEyeDE: An Attention-Based Attribution Framework for AI-Generated Text Detection
by: Nourbakhsh, Aria, et al.
Published: (2026) -
A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text
by: Sagae, Alicia, et al.
Published: (2025)