Saved in:
| Main Author: | Coronado-Blázquez, Javier |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.21613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Deterministic or probabilistic? The psychology of LLMs as random number generators
by: Coronado-Blázquez, Javier
Published: (2025)
by: Coronado-Blázquez, Javier
Published: (2025)
A NLP Approach to "Review Bombing" in Metacritic PC Videogames User Ratings
by: Coronado-Blázquez, Javier
Published: (2024)
by: Coronado-Blázquez, Javier
Published: (2024)
Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?
by: Fu, Tairan, et al.
Published: (2026)
by: Fu, Tairan, et al.
Published: (2026)
Montague semantics and modifier consistency measurement in neural language models
by: Carvalho, Danilo S., et al.
Published: (2022)
by: Carvalho, Danilo S., et al.
Published: (2022)
A statistically consistent measure of semantic uncertainty using Language Models
by: Liu, Yi
Published: (2025)
by: Liu, Yi
Published: (2025)
Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency
by: Goel, Aman, et al.
Published: (2025)
by: Goel, Aman, et al.
Published: (2025)
Lost in Sampling: Assessing Lexical Reachability in LLMs via the Word Coverage Score (WCS)
by: Awad, Samer, et al.
Published: (2026)
by: Awad, Samer, et al.
Published: (2026)
Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As
by: Avnat, Eden, et al.
Published: (2024)
by: Avnat, Eden, et al.
Published: (2024)
Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
by: Tamayo, Daniel, et al.
Published: (2025)
by: Tamayo, Daniel, et al.
Published: (2025)
Is Self-knowledge and Action Consistent or Not: Investigating Large Language Model's Personality
by: Ai, Yiming, et al.
Published: (2024)
by: Ai, Yiming, et al.
Published: (2024)
Evaluating Large Language Models with Tests of Spanish as a Foreign Language: Pass or Fail?
by: Mayor-Rocher, Marina, et al.
Published: (2024)
by: Mayor-Rocher, Marina, et al.
Published: (2024)
Establishing Vocabulary Tests as a Benchmark for Evaluating Large Language Models
by: Martínez, Gonzalo, et al.
Published: (2023)
by: Martínez, Gonzalo, et al.
Published: (2023)
Information Extraction from Electricity Invoices with General-Purpose Large Language Models
by: Gómez, Javier, et al.
Published: (2026)
by: Gómez, Javier, et al.
Published: (2026)
Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation
by: Chirkova, Nadezhda, et al.
Published: (2023)
by: Chirkova, Nadezhda, et al.
Published: (2023)
Isolating authorship from content with semantic embeddings and contrastive learning
by: Huertas-Tato, Javier, et al.
Published: (2024)
by: Huertas-Tato, Javier, et al.
Published: (2024)
Knowledge prompt chaining for semantic modeling
by: Ding, Ning Pei, et al.
Published: (2025)
by: Ding, Ning Pei, et al.
Published: (2025)
Hint-enhanced In-Context Learning wakes Large Language Models up for knowledge-intensive tasks
by: Wang, Yifan, et al.
Published: (2023)
by: Wang, Yifan, et al.
Published: (2023)
Extracting books from production language models
by: Ahmed, Ahmed, et al.
Published: (2026)
by: Ahmed, Ahmed, et al.
Published: (2026)
Geo-Semantic-Parsing: AI-powered geoparsing by traversing semantic knowledge graphs
by: Nizzoli, Leonardo, et al.
Published: (2025)
by: Nizzoli, Leonardo, et al.
Published: (2025)
Neurosymbolic AI approach to Attribution in Large Language Models
by: Tilwani, Deepa, et al.
Published: (2024)
by: Tilwani, Deepa, et al.
Published: (2024)
Evaluating Large Language Models with Psychometrics
by: Li, Yuan, et al.
Published: (2024)
by: Li, Yuan, et al.
Published: (2024)
What is the best model? Application-driven Evaluation for Large Language Models
by: Lian, Shiguo, et al.
Published: (2024)
by: Lian, Shiguo, et al.
Published: (2024)
FABLES: Evaluating faithfulness and content selection in book-length summarization
by: Kim, Yekyung, et al.
Published: (2024)
by: Kim, Yekyung, et al.
Published: (2024)
Comprehensive Evaluation of Large Language Models for Topic Modeling
by: Doi, Tomoki, et al.
Published: (2024)
by: Doi, Tomoki, et al.
Published: (2024)
A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding
by: Laperrière, Gaëlle, et al.
Published: (2024)
by: Laperrière, Gaëlle, et al.
Published: (2024)
Evaluating Large Language Models as Expert Annotators
by: Tseng, Yu-Min, et al.
Published: (2025)
by: Tseng, Yu-Min, et al.
Published: (2025)
Evaluating the Deductive Competence of Large Language Models
by: Seals, Spencer M., et al.
Published: (2023)
by: Seals, Spencer M., et al.
Published: (2023)
Evaluating Large Language Models for Material Selection
by: Grandi, Daniele, et al.
Published: (2024)
by: Grandi, Daniele, et al.
Published: (2024)
Evaluating Deep Unlearning in Large Language Models
by: Wu, Ruihan, et al.
Published: (2024)
by: Wu, Ruihan, et al.
Published: (2024)
Evaluating Gender Bias in Large Language Models
by: Döll, Michael, et al.
Published: (2024)
by: Döll, Michael, et al.
Published: (2024)
Mitigating the Bias of Large Language Model Evaluation
by: Zhou, Hongli, et al.
Published: (2024)
by: Zhou, Hongli, et al.
Published: (2024)
Learning Evaluation Models from Large Language Models for Sequence Generation
by: Wang, Chenglong, et al.
Published: (2023)
by: Wang, Chenglong, et al.
Published: (2023)
Disentangling Language and Culture for Evaluating Multilingual Large Language Models
by: Ying, Jiahao, et al.
Published: (2025)
by: Ying, Jiahao, et al.
Published: (2025)
Evaluating Large Language Models for Radiology Natural Language Processing
by: Liu, Zhengliang, et al.
Published: (2023)
by: Liu, Zhengliang, et al.
Published: (2023)
Performance Evaluation of Tokenizers in Large Language Models for the Assamese Language
by: Tamang, Sagar, et al.
Published: (2024)
by: Tamang, Sagar, et al.
Published: (2024)
Pragmatic Competence Evaluation of Large Language Models for the Korean Language
by: Park, Dojun, et al.
Published: (2024)
by: Park, Dojun, et al.
Published: (2024)
Language Shapes Mental Health Evaluations in Large Language Models
by: Xu, Jiayi, et al.
Published: (2026)
by: Xu, Jiayi, et al.
Published: (2026)
A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science
by: Yang, Zonglin, et al.
Published: (2026)
by: Yang, Zonglin, et al.
Published: (2026)
Large Language Models, scientific knowledge and factuality: A framework to streamline human expert evaluation
by: Wysocka, Magdalena, et al.
Published: (2023)
by: Wysocka, Magdalena, et al.
Published: (2023)
On convexity and efficiency in semantic systems
by: Imel, Nathaniel, et al.
Published: (2026)
by: Imel, Nathaniel, et al.
Published: (2026)
Similar Items
-
Deterministic or probabilistic? The psychology of LLMs as random number generators
by: Coronado-Blázquez, Javier
Published: (2025) -
A NLP Approach to "Review Bombing" in Metacritic PC Videogames User Ratings
by: Coronado-Blázquez, Javier
Published: (2024) -
Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?
by: Fu, Tairan, et al.
Published: (2026) -
Montague semantics and modifier consistency measurement in neural language models
by: Carvalho, Danilo S., et al.
Published: (2022) -
A statistically consistent measure of semantic uncertainty using Language Models
by: Liu, Yi
Published: (2025)