:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Šindelář, Pavel, Slivka, Dávid, Bouma, Christopher, Prášil, Filip, Bojar, Ondřej
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2603.28537
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Overview of the Sensemaking Task at the ELOQUENT 2025 Lab: LLMs as Teachers, Students and Evaluators
by: Šindelář, Pavel, et al.
Published: (2025)

Finetuning LLMs for EvaCun 2025 token prediction shared task
by: Jon, Josef, et al.
Published: (2025)

Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation
by: Barančíková, Petra, et al.
Published: (2025)

Understanding the role of FFNs in driving multilingual behaviour in LLMs
by: Bhattacharya, Sunit, et al.
Published: (2024)

Quality and Quantity of Machine Translation References for Automatic Metrics
by: Zouhar, Vilém, et al.
Published: (2024)

End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs
by: Luu, Nam, et al.
Published: (2025)

Continuous Rating as Reliable Human Evaluation of Simultaneous Speech Translation
by: Javorský, Dávid, et al.
Published: (2022)

Prompting LLMs: Length Control for Isometric Machine Translation
by: Javorský, Dávid, et al.
Published: (2025)

MockConf: A Student Interpretation Dataset: Analysis, Word- and Span-level Alignment and Baselines
by: Javorský, Dávid, et al.
Published: (2025)

Long-Form End-to-End Speech Translation via Latent Alignment Segmentation
by: Polák, Peter, et al.
Published: (2023)

ParCzech4Speech: A New Speech Corpus Derived from Czech Parliamentary Data
by: Stankov, Vladislav, et al.
Published: (2025)

Multimodal Shannon Game with Images
by: Zouhar, Vilém, et al.
Published: (2023)

ChatGPT for automated grading of short answer questions in mechanical ventilation
by: Jade, Tejas, et al.
Published: (2025)

Evaluating Optimal Reference Translations
by: Zouhar, Vilém, et al.
Published: (2023)

Better Late Than Never: Meta-Evaluation of Latency Metrics for Simultaneous Speech-to-Text Translation
by: Polák, Peter, et al.
Published: (2025)

Corpus of Cross-lingual Dialogues with Minutes and Detection of Misunderstandings
by: Čechovič, Marko, et al.
Published: (2025)

Findings of the Third Automatic Minuting (AutoMin) Challenge
by: Shinde, Kartik, et al.
Published: (2025)

How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?
by: Papi, Sara, et al.
Published: (2024)

Ratas framework: A comprehensive genai-based approach to rubric-based marking of real-world textual exams
by: Safilian, Masoud, et al.
Published: (2025)

FusionMind -- Improving question and answering with external context fusion
by: Verma, Shreyas, et al.
Published: (2023)

ConSens: Assessing context grounding in open-book question answering
by: Vankov, Ivan, et al.
Published: (2025)

Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks
by: Šmíd, Jakub, et al.
Published: (2025)

LLM Compression: How Far Can We Go in Balancing Size and Performance?
by: Sk, Sahil, et al.
Published: (2025)

Extract, Match, and Score: An Evaluation Paradigm for Long Question-context-answer Triplets in Financial Analysis
by: Hu, Bo, et al.
Published: (2025)

Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation
by: Sperber, Matthias, et al.
Published: (2024)

How effective are VLMs in assisting humans in inferring the quality of mental models from Multimodal short answers?
by: Sil, Pritam, et al.
Published: (2026)

CLIPPER: Compression enables long-context synthetic data generation
by: Pham, Chau Minh, et al.
Published: (2025)

Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health records
by: Ziletti, Angelo, et al.
Published: (2024)

MEEDAV: A Synchronous Web Viewer for EEG, Eye-Tracking and Speech Data
by: Pijálek, Jan, et al.
Published: (2026)

SRS-Stories: Vocabulary-constrained multilingual story generation for language learning
by: Kamzela, Wiktor, et al.
Published: (2025)

From text to multimodal: a survey of adversarial example generation in question answering systems
by: Yigit, Gulsum, et al.
Published: (2023)

Enhancing textual textbook question answering with large language models and retrieval augmented generation
by: Alawwad, Hessa Abdulrahman, et al.
Published: (2024)

Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUD
by: Pražák, Ondřej, et al.
Published: (2024)

PIAST: Rapid Prompting with In-context Augmentation for Scarce Training data
by: Batorski, Pawel, et al.
Published: (2025)

CMRAG: Co-modality-based visual document retrieval and question answering
by: Chen, Wang, et al.
Published: (2025)

What's the plan? Metrics for implicit planning in LLMs and their application to rhyme generation and question answering
by: Maar, Jim, et al.
Published: (2026)

factgenie: A Framework for Span-based Evaluation of Generated Texts
by: Kasner, Zdeněk, et al.
Published: (2024)

Can MLLMs generate human-like feedback in grading multimodal short answers?
by: Sil, Pritam, et al.
Published: (2024)

LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
by: Chen, Jianghao, et al.
Published: (2025)

TANQ: An open domain dataset of table answered questions
by: Akhtar, Mubashara, et al.
Published: (2024)