Guardado en:
| Autores principales: | Plunkett, Dillon, Morris, Adam, Reddy, Keerthi, Morales, Jorge |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2505.17120 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
por: Kirchhof, Michael, et al.
Publicado: (2025)
por: Kirchhof, Michael, et al.
Publicado: (2025)
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits
por: Ghasemabadi, Amirhosein, et al.
Publicado: (2025)
por: Ghasemabadi, Amirhosein, et al.
Publicado: (2025)
Can Large Language Models (LLMs) Describe Pictures Like Children? A Comparative Corpus Study
por: Woloszyn, Hanna, et al.
Publicado: (2025)
por: Woloszyn, Hanna, et al.
Publicado: (2025)
Naming, Describing, and Quantifying Visual Objects in Humans and LLMs
por: Testoni, Alberto, et al.
Publicado: (2024)
por: Testoni, Alberto, et al.
Publicado: (2024)
Toward Cultural Interpretability: A Linguistic Anthropological Framework for Describing and Evaluating Large Language Models (LLMs)
por: Jones, Graham M., et al.
Publicado: (2024)
por: Jones, Graham M., et al.
Publicado: (2024)
Interpretable dimensions support an effect of agentivity and telicity on split intransitivity
por: Neu, Eva, et al.
Publicado: (2025)
por: Neu, Eva, et al.
Publicado: (2025)
Can LLMs Estimate Cognitive Complexity of Reading Comprehension Items?
por: Hwang, Seonjeong, et al.
Publicado: (2025)
por: Hwang, Seonjeong, et al.
Publicado: (2025)
Comprehensive Modeling and Question Answering of Cancer Clinical Practice Guidelines using LLMs
por: Gupta, Bhumika, et al.
Publicado: (2025)
por: Gupta, Bhumika, et al.
Publicado: (2025)
Dispersion Measures as Predictors of Lexical Decision Time, Word Familiarity, and Lexical Complexity
por: Nohejl, Adam, et al.
Publicado: (2025)
por: Nohejl, Adam, et al.
Publicado: (2025)
Argumentation for Explainable and Globally Contestable Decision Support with LLMs
por: Dejl, Adam, et al.
Publicado: (2026)
por: Dejl, Adam, et al.
Publicado: (2026)
Can LLMs Interpret and Leverage Structured Linguistic Representations? A Case Study with AMRs
por: Raut, Ankush, et al.
Publicado: (2025)
por: Raut, Ankush, et al.
Publicado: (2025)
Can LLMs Translate Human Instructions into a Reinforcement Learning Agent's Internal Emergent Symbolic Representation?
por: Ma, Ziqi, et al.
Publicado: (2025)
por: Ma, Ziqi, et al.
Publicado: (2025)
Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
por: Tie, Guiyao, et al.
Publicado: (2025)
por: Tie, Guiyao, et al.
Publicado: (2025)
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
por: Ghosh, Sreyan, et al.
Publicado: (2024)
por: Ghosh, Sreyan, et al.
Publicado: (2024)
Rational Decision-Making Agent with Internalized Utility Judgment
por: Ye, Yining, et al.
Publicado: (2023)
por: Ye, Yining, et al.
Publicado: (2023)
LLMs Don't Know Their Own Decision Boundaries: The Unreliability of Self-Generated Counterfactual Explanations
por: Mayne, Harry, et al.
Publicado: (2025)
por: Mayne, Harry, et al.
Publicado: (2025)
Legal Minds, Algorithmic Decisions: How LLMs Apply Constitutional Principles in Complex Scenarios
por: Bignotti, Camilla, et al.
Publicado: (2024)
por: Bignotti, Camilla, et al.
Publicado: (2024)
InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
por: Beigi, Mohammad, et al.
Publicado: (2024)
por: Beigi, Mohammad, et al.
Publicado: (2024)
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
por: Kamoi, Ryo, et al.
Publicado: (2024)
por: Kamoi, Ryo, et al.
Publicado: (2024)
Crafting Interpretable Embeddings by Asking LLMs Questions
por: Benara, Vinamra, et al.
Publicado: (2024)
por: Benara, Vinamra, et al.
Publicado: (2024)
Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?
por: Balasubramanian, Nikil Sharan Prabahar, et al.
Publicado: (2024)
por: Balasubramanian, Nikil Sharan Prabahar, et al.
Publicado: (2024)
Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge Graphs
por: Hu, Nan, et al.
Publicado: (2024)
por: Hu, Nan, et al.
Publicado: (2024)
Hallucination Detection with the Internal Layers of LLMs
por: Preiß, Martin
Publicado: (2025)
por: Preiß, Martin
Publicado: (2025)
Can AI Validate Science? Benchmarking LLMs for Accurate Scientific Claim $\rightarrow$ Evidence Reasoning
por: Javaji, Shashidhar Reddy, et al.
Publicado: (2025)
por: Javaji, Shashidhar Reddy, et al.
Publicado: (2025)
Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion
por: Vu, Tuan-Anh, et al.
Publicado: (2023)
por: Vu, Tuan-Anh, et al.
Publicado: (2023)
Can LLMs Play Ô Ăn Quan Game? A Study of Multi-Step Planning and Decision Making
por: Nguyen, Sang Quang, et al.
Publicado: (2025)
por: Nguyen, Sang Quang, et al.
Publicado: (2025)
BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?
por: Chambon, Pierre, et al.
Publicado: (2025)
por: Chambon, Pierre, et al.
Publicado: (2025)
Probing the Lack of Stable Internal Beliefs in LLMs
por: Luo, Yifan, et al.
Publicado: (2026)
por: Luo, Yifan, et al.
Publicado: (2026)
Mechanistic Interpretability of Cognitive Complexity in LLMs via Linear Probing using Bloom's Taxonomy
por: Raimondi, Bianca, et al.
Publicado: (2026)
por: Raimondi, Bianca, et al.
Publicado: (2026)
Do Internal Layers of LLMs Reveal Patterns for Jailbreak Detection?
por: Kadali, Sri Durga Sai Sowmya, et al.
Publicado: (2025)
por: Kadali, Sri Durga Sai Sowmya, et al.
Publicado: (2025)
Enhancing Uncertainty Estimation in LLMs with Expectation of Aggregated Internal Belief
por: Xiao, Zeguan, et al.
Publicado: (2025)
por: Xiao, Zeguan, et al.
Publicado: (2025)
Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation
por: Shen, Jiajun, et al.
Publicado: (2025)
por: Shen, Jiajun, et al.
Publicado: (2025)
The Straight and Narrow: Do LLMs Possess an Internal Moral Path?
por: Hu, Luoming, et al.
Publicado: (2026)
por: Hu, Luoming, et al.
Publicado: (2026)
INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection
por: Chen, Chao, et al.
Publicado: (2024)
por: Chen, Chao, et al.
Publicado: (2024)
Talking Points: Describing and Localizing Pixels
por: Rusanovsky, Matan, et al.
Publicado: (2025)
por: Rusanovsky, Matan, et al.
Publicado: (2025)
Break Me If You Can: Self-Jailbreaking of Aligned LLMs via Lexical Insertion Prompting
por: Kulshreshtha, Devang, et al.
Publicado: (2026)
por: Kulshreshtha, Devang, et al.
Publicado: (2026)
Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above
por: Balepur, Nishant, et al.
Publicado: (2025)
por: Balepur, Nishant, et al.
Publicado: (2025)
The Curious Case of Factuality Finetuning: Models' Internal Beliefs Can Improve Factuality
por: Newman, Benjamin, et al.
Publicado: (2025)
por: Newman, Benjamin, et al.
Publicado: (2025)
Retracing the Past: LLMs Emit Training Data When They Get Lost
por: Ko, Myeongseob, et al.
Publicado: (2025)
por: Ko, Myeongseob, et al.
Publicado: (2025)
SceneGram: Conceptualizing and Describing Tangrams in Scene Context
por: Junker, Simeon, et al.
Publicado: (2025)
por: Junker, Simeon, et al.
Publicado: (2025)
Ejemplares similares
-
SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
por: Kirchhof, Michael, et al.
Publicado: (2025) -
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits
por: Ghasemabadi, Amirhosein, et al.
Publicado: (2025) -
Can Large Language Models (LLMs) Describe Pictures Like Children? A Comparative Corpus Study
por: Woloszyn, Hanna, et al.
Publicado: (2025) -
Naming, Describing, and Quantifying Visual Objects in Humans and LLMs
por: Testoni, Alberto, et al.
Publicado: (2024) -
Toward Cultural Interpretability: A Linguistic Anthropological Framework for Describing and Evaluating Large Language Models (LLMs)
por: Jones, Graham M., et al.
Publicado: (2024)