Guardado en:
| Autores principales: | Kostiuk, Yevhen, Enevoldsen, Kenneth, Vahlstrup, Peter Bjerregaard, Kardos, Márton, Nielbo, Kristoffer |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2605.23420 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
topicwizard -- a Modern, Model-agnostic Framework for Topic Model Visualization and Interpretation
por: Kardos, Márton, et al.
Publicado: (2025)
por: Kardos, Márton, et al.
Publicado: (2025)
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
por: Enevoldsen, Kenneth, et al.
Publicado: (2024)
por: Enevoldsen, Kenneth, et al.
Publicado: (2024)
$S^3$ -- Semantic Signal Separation
por: Kardos, Márton, et al.
Publicado: (2024)
por: Kardos, Márton, et al.
Publicado: (2024)
Improving reasoning at inference time via uncertainty minimisation
por: Legrand, Nicolas, et al.
Publicado: (2026)
por: Legrand, Nicolas, et al.
Publicado: (2026)
One prompt is not enough: Instruction Sensitivity Undermines Embedding Model Evaluation
por: Kostiuk, Yevhen, et al.
Publicado: (2026)
por: Kostiuk, Yevhen, et al.
Publicado: (2026)
Continuous sentiment scores for literary and multilingual contexts
por: Lyngbaek, Laurits, et al.
Publicado: (2025)
por: Lyngbaek, Laurits, et al.
Publicado: (2025)
Is Sentiment Banana-Shaped? Exploring the Geometry and Portability of Sentiment Concept Vectors
por: Lyngbaek, Laurits, et al.
Publicado: (2026)
por: Lyngbaek, Laurits, et al.
Publicado: (2026)
Dynaword: From One-shot to Continuously Developed Datasets
por: Enevoldsen, Kenneth, et al.
Publicado: (2025)
por: Enevoldsen, Kenneth, et al.
Publicado: (2025)
Maintaining MTEB: Towards Long Term Usability and Reproducibility of Embedding Benchmarks
por: Chung, Isaac, et al.
Publicado: (2025)
por: Chung, Isaac, et al.
Publicado: (2025)
The Coverage Illusion: From Pre-retrieval Routing Failure to Post-retrieval Cascades in a Production RAG System
por: Hussain, Zafar, et al.
Publicado: (2026)
por: Hussain, Zafar, et al.
Publicado: (2026)
Topeax -- An Improved Clustering Topic Model with Density Peak Detection and Lexical-Semantic Term Importance
por: Kardos, Márton
Publicado: (2026)
por: Kardos, Márton
Publicado: (2026)
MIEB: Massive Image Embedding Benchmark
por: Xiao, Chenghao, et al.
Publicado: (2025)
por: Xiao, Chenghao, et al.
Publicado: (2025)
Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History
por: Kostiuk, Yevhen, et al.
Publicado: (2025)
por: Kostiuk, Yevhen, et al.
Publicado: (2025)
The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching
por: Kostiuk, Yevhen, et al.
Publicado: (2025)
por: Kostiuk, Yevhen, et al.
Publicado: (2025)
Good Books are Complex Matters: Gauging Complexity Profiles Across Diverse Categories of Perceived Literary Quality
por: Bizzoni, Yuri, et al.
Publicado: (2024)
por: Bizzoni, Yuri, et al.
Publicado: (2024)
Exposing Assumptions in AI Benchmarks through Cognitive Modelling
por: Rystrøm, Jonathan H., et al.
Publicado: (2024)
por: Rystrøm, Jonathan H., et al.
Publicado: (2024)
DANSK and DaCy 2.6.0: Domain Generalization of Danish Named Entity Recognition
por: Enevoldsen, Kenneth, et al.
Publicado: (2024)
por: Enevoldsen, Kenneth, et al.
Publicado: (2024)
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
por: Nielsen, Dan Saattrup, et al.
Publicado: (2024)
por: Nielsen, Dan Saattrup, et al.
Publicado: (2024)
Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media
por: Kristensen-McLachlan, Ross Deans, et al.
Publicado: (2024)
por: Kristensen-McLachlan, Ross Deans, et al.
Publicado: (2024)
Are Chatbots Reliable Text Annotators? Sometimes
por: Kristensen-McLachlan, Ross Deans, et al.
Publicado: (2023)
por: Kristensen-McLachlan, Ross Deans, et al.
Publicado: (2023)
HUME: Measuring the Human-Model Performance Gap in Text Embedding Tasks
por: Assadi, Adnan El, et al.
Publicado: (2025)
por: Assadi, Adnan El, et al.
Publicado: (2025)
Grounding Text Embeddings in Stakeholder Associations
por: Rystrøm, Jonathan, et al.
Publicado: (2026)
por: Rystrøm, Jonathan, et al.
Publicado: (2026)
Culmination phenomena across languages
por: Éva Kardos
Publicado: (2024)
por: Éva Kardos
Publicado: (2024)
PoTeC: A German Naturalistic Eye-tracking-while-reading Corpus
por: Jakobi, Deborah N., et al.
Publicado: (2024)
por: Jakobi, Deborah N., et al.
Publicado: (2024)
Cross-linguistic disagreement as a conflict of semantic alignment norms in multilingual AI~Linguistic Diversity as a Problem for Philosophy, Cognitive Science, and AI~
por: Mizumoto, Masaharu, et al.
Publicado: (2025)
por: Mizumoto, Masaharu, et al.
Publicado: (2025)
Naturalistic Language-related Movie-Watching fMRI Task for Detecting Neurocognitive Decline and Disorder
por: Wang, Yuejiao, et al.
Publicado: (2025)
por: Wang, Yuejiao, et al.
Publicado: (2025)
From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages
por: Kiulian, Artur, et al.
Publicado: (2024)
por: Kiulian, Artur, et al.
Publicado: (2024)
MEDUSA: A Multimodal Deep Fusion Multi-Stage Training Framework for Speech Emotion Recognition in Naturalistic Conditions
por: Chatzichristodoulou, Georgios, et al.
Publicado: (2025)
por: Chatzichristodoulou, Georgios, et al.
Publicado: (2025)
Are language models rational? The case of coherence norms and belief revision
por: Hofweber, Thomas, et al.
Publicado: (2024)
por: Hofweber, Thomas, et al.
Publicado: (2024)
Compute-Accuracy Pareto Frontiers for Open-Source Reasoning Large Language Models
por: Prucs, Ákos, et al.
Publicado: (2025)
por: Prucs, Ákos, et al.
Publicado: (2025)
A Hierarchical and Attentional Analysis of Argument Structure Constructions in BERT Using Naturalistic Corpora
por: Kaipeng, Liu, et al.
Publicado: (2026)
por: Kaipeng, Liu, et al.
Publicado: (2026)
Shifting social norms as a driving force for linguistic change: Struggles about language and gender in the German Bundestag
por: Müller-Spitzer, Carolin, et al.
Publicado: (2024)
por: Müller-Spitzer, Carolin, et al.
Publicado: (2024)
Are aligned neural networks adversarially aligned?
por: Carlini, Nicholas, et al.
Publicado: (2023)
por: Carlini, Nicholas, et al.
Publicado: (2023)
Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation
por: Alshemali, Safeyah Khaled, et al.
Publicado: (2024)
por: Alshemali, Safeyah Khaled, et al.
Publicado: (2024)
Evolution and compression in LLMs: On the emergence of human-aligned categorization
por: Imel, Nathaniel, et al.
Publicado: (2025)
por: Imel, Nathaniel, et al.
Publicado: (2025)
Neural paraphrasing by automatically crawled and aligned sentence pairs
por: Globo, Achille, et al.
Publicado: (2024)
por: Globo, Achille, et al.
Publicado: (2024)
Learning or Self-aligning? Rethinking Instruction Fine-tuning
por: Ren, Mengjie, et al.
Publicado: (2024)
por: Ren, Mengjie, et al.
Publicado: (2024)
EgoBabyVLM: Benchmarking Cross-Modal Learning from Naturalistic Egocentric Video Data
por: Lin, Dongyan, et al.
Publicado: (2026)
por: Lin, Dongyan, et al.
Publicado: (2026)
Compositional preference models for aligning LMs
por: Go, Dongyoung, et al.
Publicado: (2023)
por: Go, Dongyoung, et al.
Publicado: (2023)
Chinese sensorimotor and embodiment norms for 3,000 lexicalized concepts
por: Chen, Jing, et al.
Publicado: (2026)
por: Chen, Jing, et al.
Publicado: (2026)
Ejemplares similares
-
topicwizard -- a Modern, Model-agnostic Framework for Topic Model Visualization and Interpretation
por: Kardos, Márton, et al.
Publicado: (2025) -
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
por: Enevoldsen, Kenneth, et al.
Publicado: (2024) -
$S^3$ -- Semantic Signal Separation
por: Kardos, Márton, et al.
Publicado: (2024) -
Improving reasoning at inference time via uncertainty minimisation
por: Legrand, Nicolas, et al.
Publicado: (2026) -
One prompt is not enough: Instruction Sensitivity Undermines Embedding Model Evaluation
por: Kostiuk, Yevhen, et al.
Publicado: (2026)