:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Mayilvaghanan, Kawin, Gupta, Siddhant, Kumar, Ayush
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Computation and Language Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2508.13124
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Counterfactual Fairness Evaluation of LLM-Based Contact Center Agent Quality Assurance System
di: Mayilvaghanan, Kawin, et al.
Pubblicazione: (2026)

Linguistic Blind Spots in Clinical Decision Extraction
di: Elgaar, Mohamed, et al.
Pubblicazione: (2026)

Fluent but Unfeeling: The Emotional Blind Spots of Language Models
di: Shu, Bangzhao, et al.
Pubblicazione: (2025)

Simple Linguistic Inferences of Large Language Models (LLMs): Blind Spots and Blinds
di: Basmov, Victoria, et al.
Pubblicazione: (2023)

Linguistic Blind Spots of Large Language Models
di: Cheng, Jiali, et al.
Pubblicazione: (2025)

Belief in the Machine: Investigating Epistemological Blind Spots of Language Models
di: Suzgun, Mirac, et al.
Pubblicazione: (2024)

AraSpot: Arabic Spoken Command Spotting
di: Salhab, Mahmoud, et al.
Pubblicazione: (2023)

Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation
di: Devanathan, Rishikesh, et al.
Pubblicazione: (2025)

ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
di: Dong, Wenhan, et al.
Pubblicazione: (2025)

Learning to Verify Summary Facts with Fine-Grained LLM Feedback
di: Oh, Jihwan, et al.
Pubblicazione: (2024)

A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting
di: Li, Yuang, et al.
Pubblicazione: (2023)

Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text
di: Li, Yafu, et al.
Pubblicazione: (2024)

Question Answering on Patient Medical Records with Private Fine-Tuned LLMs
di: Kothari, Sara, et al.
Pubblicazione: (2025)

Self-Correction Bench: Uncovering and Addressing the Self-Correction Blind Spot in Large Language Models
di: Tsui, Ken
Pubblicazione: (2025)

Systematic Biases in LLM Simulations of Debates
di: Taubenfeld, Amir, et al.
Pubblicazione: (2024)

Blind Spots in the Guard: How Domain-Camouflaged Injection Attacks Evade Detection in Multi-Agent LLM Systems
di: Pai, Aaditya
Pubblicazione: (2026)

Positional Failures in Long-Context LLMs: A Blind Spot in Reasoning Benchmarks
di: Zhang, Chuyifei, et al.
Pubblicazione: (2026)

Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLP
di: Gautam, Sanjana, et al.
Pubblicazione: (2024)

Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots
di: Shayegani, Erfan, et al.
Pubblicazione: (2025)

DOCE: Finding the Sweet Spot for Execution-Based Code Generation
di: Li, Haau-Sing, et al.
Pubblicazione: (2024)

Signs of Struggle: Spotting Cognitive Distortions across Language and Register
di: Kuber, Abhishek, et al.
Pubblicazione: (2025)

Biases in the Blind Spot: Detecting What LLMs Fail to Mention
di: Arcuschin, Iván, et al.
Pubblicazione: (2026)

LUDOBENCH: Evaluating LLM Behavioural Decision-Making Through Spot-Based Board Game Scenarios in Ludo
di: Jain, Ojas, et al.
Pubblicazione: (2026)

Fine-grained Narrative Classification in Biased News Articles
di: Afroz, Zeba, et al.
Pubblicazione: (2025)

Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs
di: Singh, Sanjeet, et al.
Pubblicazione: (2024)

Robust and Fine-Grained Detection of AI Generated Texts
di: Kadiyala, Ram Mohan Rao, et al.
Pubblicazione: (2025)

Simulation, Modelling and Classification of Wiki Contributors: Spotting The Good, The Bad, and The Ugly
di: Méndez, Silvia García, et al.
Pubblicazione: (2024)

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
di: Hans, Abhimanyu, et al.
Pubblicazione: (2024)

Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning
di: Pang, Jinlong, et al.
Pubblicazione: (2025)

Catching The Correct Answer Trap: Characterising AI Tutor Blind Spots When Analysing Student Reasoning
di: Imran, Moiz, et al.
Pubblicazione: (2026)

Do Cognitively Interpretable Reasoning Traces Improve LLM Performance?
di: Bhambri, Siddhant, et al.
Pubblicazione: (2025)

Bona fide Cross Testing Reveals Weak Spot in Audio Deepfake Detection Systems
di: Kwok, Chin Yuen, et al.
Pubblicazione: (2025)

On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception
di: Norouzi, Sanaz Saki, et al.
Pubblicazione: (2025)

To Err Is Human: Systematic Quantification of Errors in Published AI Papers via LLM Analysis
di: Bianchi, Federico, et al.
Pubblicazione: (2025)

Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information
di: Ethayarajh, Kawin, et al.
Pubblicazione: (2021)

SpotIt+: Verification-based Text-to-SQL Evaluation with Database Constraints
di: Tremante, Andrew, et al.
Pubblicazione: (2026)

Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended Generation
di: Shin, Jisu, et al.
Pubblicazione: (2025)

Self-Blinding and Counterfactual Self-Simulation Mitigate Biases and Sycophancy in Large Language Models
di: Christian, Brian, et al.
Pubblicazione: (2026)

SpotIt: Evaluating Text-to-SQL Evaluation with Formal Verification
di: Klopfenstein, Rocky, et al.
Pubblicazione: (2025)

SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning
di: Peng, Jinjun, et al.
Pubblicazione: (2026)