:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Kostiuk, Yevhen, Enevoldsen, Kenneth, Vahlstrup, Peter Bjerregaard, Kardos, Márton, Nielbo, Kristoffer
Formato:	Preprint
Publicado:	2026
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2605.23420
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

topicwizard -- a Modern, Model-agnostic Framework for Topic Model Visualization and Interpretation
por: Kardos, Márton, et al.
Publicado: (2025)

The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
por: Enevoldsen, Kenneth, et al.
Publicado: (2024)

$S^3$ -- Semantic Signal Separation
por: Kardos, Márton, et al.
Publicado: (2024)

Improving reasoning at inference time via uncertainty minimisation
por: Legrand, Nicolas, et al.
Publicado: (2026)

One prompt is not enough: Instruction Sensitivity Undermines Embedding Model Evaluation
por: Kostiuk, Yevhen, et al.
Publicado: (2026)

Continuous sentiment scores for literary and multilingual contexts
por: Lyngbaek, Laurits, et al.
Publicado: (2025)

Is Sentiment Banana-Shaped? Exploring the Geometry and Portability of Sentiment Concept Vectors
por: Lyngbaek, Laurits, et al.
Publicado: (2026)

Dynaword: From One-shot to Continuously Developed Datasets
por: Enevoldsen, Kenneth, et al.
Publicado: (2025)

Maintaining MTEB: Towards Long Term Usability and Reproducibility of Embedding Benchmarks
por: Chung, Isaac, et al.
Publicado: (2025)

The Coverage Illusion: From Pre-retrieval Routing Failure to Post-retrieval Cascades in a Production RAG System
por: Hussain, Zafar, et al.
Publicado: (2026)

Topeax -- An Improved Clustering Topic Model with Density Peak Detection and Lexical-Semantic Term Importance
por: Kardos, Márton
Publicado: (2026)

MIEB: Massive Image Embedding Benchmark
por: Xiao, Chenghao, et al.
Publicado: (2025)

Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History
por: Kostiuk, Yevhen, et al.
Publicado: (2025)

The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching
por: Kostiuk, Yevhen, et al.
Publicado: (2025)

Good Books are Complex Matters: Gauging Complexity Profiles Across Diverse Categories of Perceived Literary Quality
por: Bizzoni, Yuri, et al.
Publicado: (2024)

Exposing Assumptions in AI Benchmarks through Cognitive Modelling
por: Rystrøm, Jonathan H., et al.
Publicado: (2024)

DANSK and DaCy 2.6.0: Domain Generalization of Danish Named Entity Recognition
por: Enevoldsen, Kenneth, et al.
Publicado: (2024)

Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
por: Nielsen, Dan Saattrup, et al.
Publicado: (2024)

Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media
por: Kristensen-McLachlan, Ross Deans, et al.
Publicado: (2024)

Are Chatbots Reliable Text Annotators? Sometimes
por: Kristensen-McLachlan, Ross Deans, et al.
Publicado: (2023)

HUME: Measuring the Human-Model Performance Gap in Text Embedding Tasks
por: Assadi, Adnan El, et al.
Publicado: (2025)

Grounding Text Embeddings in Stakeholder Associations
por: Rystrøm, Jonathan, et al.
Publicado: (2026)

Culmination phenomena across languages
por: Éva Kardos
Publicado: (2024)

PoTeC: A German Naturalistic Eye-tracking-while-reading Corpus
por: Jakobi, Deborah N., et al.
Publicado: (2024)

Cross-linguistic disagreement as a conflict of semantic alignment norms in multilingual AI~Linguistic Diversity as a Problem for Philosophy, Cognitive Science, and AI~
por: Mizumoto, Masaharu, et al.
Publicado: (2025)

Naturalistic Language-related Movie-Watching fMRI Task for Detecting Neurocognitive Decline and Disorder
por: Wang, Yuejiao, et al.
Publicado: (2025)

From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages
por: Kiulian, Artur, et al.
Publicado: (2024)

MEDUSA: A Multimodal Deep Fusion Multi-Stage Training Framework for Speech Emotion Recognition in Naturalistic Conditions
por: Chatzichristodoulou, Georgios, et al.
Publicado: (2025)

Are language models rational? The case of coherence norms and belief revision
por: Hofweber, Thomas, et al.
Publicado: (2024)

Compute-Accuracy Pareto Frontiers for Open-Source Reasoning Large Language Models
por: Prucs, Ákos, et al.
Publicado: (2025)

A Hierarchical and Attentional Analysis of Argument Structure Constructions in BERT Using Naturalistic Corpora
por: Kaipeng, Liu, et al.
Publicado: (2026)

Shifting social norms as a driving force for linguistic change: Struggles about language and gender in the German Bundestag
por: Müller-Spitzer, Carolin, et al.
Publicado: (2024)

Are aligned neural networks adversarially aligned?
por: Carlini, Nicholas, et al.
Publicado: (2023)

Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation
por: Alshemali, Safeyah Khaled, et al.
Publicado: (2024)

Evolution and compression in LLMs: On the emergence of human-aligned categorization
por: Imel, Nathaniel, et al.
Publicado: (2025)

Neural paraphrasing by automatically crawled and aligned sentence pairs
por: Globo, Achille, et al.
Publicado: (2024)

Learning or Self-aligning? Rethinking Instruction Fine-tuning
por: Ren, Mengjie, et al.
Publicado: (2024)

EgoBabyVLM: Benchmarking Cross-Modal Learning from Naturalistic Egocentric Video Data
por: Lin, Dongyan, et al.
Publicado: (2026)

Compositional preference models for aligning LMs
por: Go, Dongyoung, et al.
Publicado: (2023)

Chinese sensorimotor and embodiment norms for 3,000 lexicalized concepts
por: Chen, Jing, et al.
Publicado: (2026)