:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Plunkett, Dillon, Morris, Adam, Reddy, Keerthi, Morales, Jorge
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2505.17120
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
por: Kirchhof, Michael, et al.
Publicado: (2025)

Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits
por: Ghasemabadi, Amirhosein, et al.
Publicado: (2025)

Can Large Language Models (LLMs) Describe Pictures Like Children? A Comparative Corpus Study
por: Woloszyn, Hanna, et al.
Publicado: (2025)

Naming, Describing, and Quantifying Visual Objects in Humans and LLMs
por: Testoni, Alberto, et al.
Publicado: (2024)

Toward Cultural Interpretability: A Linguistic Anthropological Framework for Describing and Evaluating Large Language Models (LLMs)
por: Jones, Graham M., et al.
Publicado: (2024)

Interpretable dimensions support an effect of agentivity and telicity on split intransitivity
por: Neu, Eva, et al.
Publicado: (2025)

Can LLMs Estimate Cognitive Complexity of Reading Comprehension Items?
por: Hwang, Seonjeong, et al.
Publicado: (2025)

Comprehensive Modeling and Question Answering of Cancer Clinical Practice Guidelines using LLMs
por: Gupta, Bhumika, et al.
Publicado: (2025)

Dispersion Measures as Predictors of Lexical Decision Time, Word Familiarity, and Lexical Complexity
por: Nohejl, Adam, et al.
Publicado: (2025)

Argumentation for Explainable and Globally Contestable Decision Support with LLMs
por: Dejl, Adam, et al.
Publicado: (2026)

Can LLMs Interpret and Leverage Structured Linguistic Representations? A Case Study with AMRs
por: Raut, Ankush, et al.
Publicado: (2025)

Can LLMs Translate Human Instructions into a Reinforcement Learning Agent's Internal Emergent Symbolic Representation?
por: Ma, Ziqi, et al.
Publicado: (2025)

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
por: Tie, Guiyao, et al.
Publicado: (2025)

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
por: Ghosh, Sreyan, et al.
Publicado: (2024)

Rational Decision-Making Agent with Internalized Utility Judgment
por: Ye, Yining, et al.
Publicado: (2023)

LLMs Don't Know Their Own Decision Boundaries: The Unreliability of Self-Generated Counterfactual Explanations
por: Mayne, Harry, et al.
Publicado: (2025)

Legal Minds, Algorithmic Decisions: How LLMs Apply Constitutional Principles in Complex Scenarios
por: Bignotti, Camilla, et al.
Publicado: (2024)

InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
por: Beigi, Mohammad, et al.
Publicado: (2024)

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
por: Kamoi, Ryo, et al.
Publicado: (2024)

Crafting Interpretable Embeddings by Asking LLMs Questions
por: Benara, Vinamra, et al.
Publicado: (2024)

Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?
por: Balasubramanian, Nikil Sharan Prabahar, et al.
Publicado: (2024)

Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge Graphs
por: Hu, Nan, et al.
Publicado: (2024)

Hallucination Detection with the Internal Layers of LLMs
por: Preiß, Martin
Publicado: (2025)

Can AI Validate Science? Benchmarking LLMs for Accurate Scientific Claim $\rightarrow$ Evidence Reasoning
por: Javaji, Shashidhar Reddy, et al.
Publicado: (2025)

Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion
por: Vu, Tuan-Anh, et al.
Publicado: (2023)

Can LLMs Play Ô Ăn Quan Game? A Study of Multi-Step Planning and Decision Making
por: Nguyen, Sang Quang, et al.
Publicado: (2025)

BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?
por: Chambon, Pierre, et al.
Publicado: (2025)

Probing the Lack of Stable Internal Beliefs in LLMs
por: Luo, Yifan, et al.
Publicado: (2026)

Mechanistic Interpretability of Cognitive Complexity in LLMs via Linear Probing using Bloom's Taxonomy
por: Raimondi, Bianca, et al.
Publicado: (2026)

Do Internal Layers of LLMs Reveal Patterns for Jailbreak Detection?
por: Kadali, Sri Durga Sai Sowmya, et al.
Publicado: (2025)

Enhancing Uncertainty Estimation in LLMs with Expectation of Aggregated Internal Belief
por: Xiao, Zeguan, et al.
Publicado: (2025)

Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation
por: Shen, Jiajun, et al.
Publicado: (2025)

The Straight and Narrow: Do LLMs Possess an Internal Moral Path?
por: Hu, Luoming, et al.
Publicado: (2026)

INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection
por: Chen, Chao, et al.
Publicado: (2024)

Talking Points: Describing and Localizing Pixels
por: Rusanovsky, Matan, et al.
Publicado: (2025)

Break Me If You Can: Self-Jailbreaking of Aligned LLMs via Lexical Insertion Prompting
por: Kulshreshtha, Devang, et al.
Publicado: (2026)

Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above
por: Balepur, Nishant, et al.
Publicado: (2025)

The Curious Case of Factuality Finetuning: Models' Internal Beliefs Can Improve Factuality
por: Newman, Benjamin, et al.
Publicado: (2025)

Retracing the Past: LLMs Emit Training Data When They Get Lost
por: Ko, Myeongseob, et al.
Publicado: (2025)

SceneGram: Conceptualizing and Describing Tangrams in Scene Context
por: Junker, Simeon, et al.
Publicado: (2025)