:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sagae, Alicia, Lee, Chia-Jung, Avula, Sandeep, Dang, Brandon, Murdock, Vanessa
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence I.2.7
Online Access:	https://arxiv.org/abs/2510.20782
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PetKaz at SemEval-2024 Task 8: Can Linguistics Capture the Specifics of LLM-generated Text?
by: Petukhova, Kseniia, et al.
Published: (2024)

Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset
by: Palit, Sayon, et al.
Published: (2025)

On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text
by: Gromadzki, Michał, et al.
Published: (2026)

SEED: Enhancing Text-to-SQL Performance and Practical Usability Through Automatic Evidence Generation
by: Yun, Janghyeon, et al.
Published: (2025)

Layer-Aware Embedding Fusion for LLMs in Text Classifications
by: Gwak, Jiho, et al.
Published: (2025)

Text-Based Approaches to Item Difficulty Modeling in Large-Scale Assessments: A Systematic Review
by: Peters, Sydney, et al.
Published: (2025)

LLM generated responses to mitigate the impact of hate speech
by: Podolak, Jakub, et al.
Published: (2023)

SAGE: Hierarchical LLM-Based Literary Evaluation through Ontology-Grounded Interpretive Dimensions
by: Wang, Tianyu, et al.
Published: (2026)

Toward Architecture-Aware Evaluation Metrics for LLM Agents
by: Souza, Débora, et al.
Published: (2026)

SeLeRoSa: Sentence-Level Romanian Satire Detection Dataset
by: Smădu, Răzvan-Alexandru, et al.
Published: (2025)

Fuzzy, Symbolic, and Contextual: Enhancing LLM Instruction via Cognitive Scaffolding
by: Figueiredo, Vanessa
Published: (2025)

Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)

Mitigating Trojanized Prompt Chains in Educational LLM Use Cases: Experimental Findings and Detection Tool Design
by: Charles, Richard M., et al.
Published: (2025)

A Fuzzy Logic Prompting Framework for Large Language Models in Adaptive and Uncertain Tasks
by: Figueiredo, Vanessa
Published: (2025)

Identifying Bias in Machine-generated Text Detection
by: Stowe, Kevin, et al.
Published: (2025)

CRISP: Persistent Concept Unlearning via Sparse Autoencoders
by: Ashuach, Tomer, et al.
Published: (2025)

The Curious Case of Visual Grounding: Different Effects for Speech- and Text-based Language Encoders
by: Sauter, Adrian, et al.
Published: (2025)

LLM Uncertainty Quantification through Directional Entailment Graph and Claim Level Response Augmentation
by: Da, Longchao, et al.
Published: (2024)

Emergent Lexical Semantics in Neural Language Models: Testing Martin's Law on LLM-Generated Text
by: Kugler, Kai
Published: (2025)

Decoding-Free Sampling Strategies for LLM Marginalization
by: Pohl, David, et al.
Published: (2025)

Extreme AutoML: Analysis of Classification, Regression, and NLP Performance
by: Ratner, Edward, et al.
Published: (2024)

Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)

ML-Promise: A Multilingual Dataset for Corporate Promise Verification
by: Seki, Yohei, et al.
Published: (2024)

Mobile Phone Sensor-based Nigerian Driving Dataset to Detect Alcohol-influenced Behaviours
by: Thompson, Iniakpokeikiye Peter, et al.
Published: (2025)

Introducing Three New Benchmark Datasets for Hierarchical Text Classification
by: Toit, Jaco du, et al.
Published: (2024)

Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)

Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
by: Dang, Thao Anh, et al.
Published: (2024)

Robustness of Large Language Models to Perturbations in Text
by: Singh, Ayush, et al.
Published: (2024)

Knowledge Graphs as the Missing Data Layer for LLM-Based Industrial Asset Operations
by: Mandarapu, Madhulatha, et al.
Published: (2026)

FC-TTS: Style and Timbre Control in Zero-Shot Text-to-Speech with Disentangled Speech Representations
by: Lee, Yoonhyung, et al.
Published: (2026)

Blocks Architecture (BloArk): Efficient, Cost-Effective, and Incremental Dataset Architecture for Wikipedia Revision History
by: Li, Lingxi, et al.
Published: (2024)

LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
by: Hashemi, Helia, et al.
Published: (2024)

DimStance: Multilingual Datasets for Dimensional Stance Analysis
by: Becker, Jonas, et al.
Published: (2026)

Contrasting Linguistic Patterns in Human and LLM-Generated News Text
by: Muñoz-Ortiz, Alberto, et al.
Published: (2023)

RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
by: Saji, Alan, et al.
Published: (2025)

Exploiting LLM-as-a-Judge Disposition on Free Text Legal QA via Prompt Optimization
by: Elganayni, Mohamed Hesham, et al.
Published: (2026)

Heimdall: test-time scaling on the generative verification
by: Shi, Wenlei, et al.
Published: (2025)

Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters
by: Shah, Aaryan, et al.
Published: (2026)

Credit C-GPT: A Domain-Specialized Large Language Model for Conversational Understanding in Vietnamese Debt Collection
by: Hong, Nhung Nguyen Thi, et al.
Published: (2026)

PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions
by: Dai, Song, et al.
Published: (2025)