Saved in:
| Main Authors: | Šindelář, Pavel, Slivka, Dávid, Bouma, Christopher, Prášil, Filip, Bojar, Ondřej |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.28537 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Overview of the Sensemaking Task at the ELOQUENT 2025 Lab: LLMs as Teachers, Students and Evaluators
by: Šindelář, Pavel, et al.
Published: (2025)
by: Šindelář, Pavel, et al.
Published: (2025)
Finetuning LLMs for EvaCun 2025 token prediction shared task
by: Jon, Josef, et al.
Published: (2025)
by: Jon, Josef, et al.
Published: (2025)
Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation
by: Barančíková, Petra, et al.
Published: (2025)
by: Barančíková, Petra, et al.
Published: (2025)
Understanding the role of FFNs in driving multilingual behaviour in LLMs
by: Bhattacharya, Sunit, et al.
Published: (2024)
by: Bhattacharya, Sunit, et al.
Published: (2024)
Quality and Quantity of Machine Translation References for Automatic Metrics
by: Zouhar, Vilém, et al.
Published: (2024)
by: Zouhar, Vilém, et al.
Published: (2024)
End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs
by: Luu, Nam, et al.
Published: (2025)
by: Luu, Nam, et al.
Published: (2025)
Continuous Rating as Reliable Human Evaluation of Simultaneous Speech Translation
by: Javorský, Dávid, et al.
Published: (2022)
by: Javorský, Dávid, et al.
Published: (2022)
Prompting LLMs: Length Control for Isometric Machine Translation
by: Javorský, Dávid, et al.
Published: (2025)
by: Javorský, Dávid, et al.
Published: (2025)
MockConf: A Student Interpretation Dataset: Analysis, Word- and Span-level Alignment and Baselines
by: Javorský, Dávid, et al.
Published: (2025)
by: Javorský, Dávid, et al.
Published: (2025)
Long-Form End-to-End Speech Translation via Latent Alignment Segmentation
by: Polák, Peter, et al.
Published: (2023)
by: Polák, Peter, et al.
Published: (2023)
ParCzech4Speech: A New Speech Corpus Derived from Czech Parliamentary Data
by: Stankov, Vladislav, et al.
Published: (2025)
by: Stankov, Vladislav, et al.
Published: (2025)
Multimodal Shannon Game with Images
by: Zouhar, Vilém, et al.
Published: (2023)
by: Zouhar, Vilém, et al.
Published: (2023)
ChatGPT for automated grading of short answer questions in mechanical ventilation
by: Jade, Tejas, et al.
Published: (2025)
by: Jade, Tejas, et al.
Published: (2025)
Evaluating Optimal Reference Translations
by: Zouhar, Vilém, et al.
Published: (2023)
by: Zouhar, Vilém, et al.
Published: (2023)
Better Late Than Never: Meta-Evaluation of Latency Metrics for Simultaneous Speech-to-Text Translation
by: Polák, Peter, et al.
Published: (2025)
by: Polák, Peter, et al.
Published: (2025)
Corpus of Cross-lingual Dialogues with Minutes and Detection of Misunderstandings
by: Čechovič, Marko, et al.
Published: (2025)
by: Čechovič, Marko, et al.
Published: (2025)
Findings of the Third Automatic Minuting (AutoMin) Challenge
by: Shinde, Kartik, et al.
Published: (2025)
by: Shinde, Kartik, et al.
Published: (2025)
How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?
by: Papi, Sara, et al.
Published: (2024)
by: Papi, Sara, et al.
Published: (2024)
Ratas framework: A comprehensive genai-based approach to rubric-based marking of real-world textual exams
by: Safilian, Masoud, et al.
Published: (2025)
by: Safilian, Masoud, et al.
Published: (2025)
FusionMind -- Improving question and answering with external context fusion
by: Verma, Shreyas, et al.
Published: (2023)
by: Verma, Shreyas, et al.
Published: (2023)
ConSens: Assessing context grounding in open-book question answering
by: Vankov, Ivan, et al.
Published: (2025)
by: Vankov, Ivan, et al.
Published: (2025)
Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks
by: Šmíd, Jakub, et al.
Published: (2025)
by: Šmíd, Jakub, et al.
Published: (2025)
LLM Compression: How Far Can We Go in Balancing Size and Performance?
by: Sk, Sahil, et al.
Published: (2025)
by: Sk, Sahil, et al.
Published: (2025)
Extract, Match, and Score: An Evaluation Paradigm for Long Question-context-answer Triplets in Financial Analysis
by: Hu, Bo, et al.
Published: (2025)
by: Hu, Bo, et al.
Published: (2025)
Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation
by: Sperber, Matthias, et al.
Published: (2024)
by: Sperber, Matthias, et al.
Published: (2024)
How effective are VLMs in assisting humans in inferring the quality of mental models from Multimodal short answers?
by: Sil, Pritam, et al.
Published: (2026)
by: Sil, Pritam, et al.
Published: (2026)
CLIPPER: Compression enables long-context synthetic data generation
by: Pham, Chau Minh, et al.
Published: (2025)
by: Pham, Chau Minh, et al.
Published: (2025)
Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health records
by: Ziletti, Angelo, et al.
Published: (2024)
by: Ziletti, Angelo, et al.
Published: (2024)
MEEDAV: A Synchronous Web Viewer for EEG, Eye-Tracking and Speech Data
by: Pijálek, Jan, et al.
Published: (2026)
by: Pijálek, Jan, et al.
Published: (2026)
SRS-Stories: Vocabulary-constrained multilingual story generation for language learning
by: Kamzela, Wiktor, et al.
Published: (2025)
by: Kamzela, Wiktor, et al.
Published: (2025)
From text to multimodal: a survey of adversarial example generation in question answering systems
by: Yigit, Gulsum, et al.
Published: (2023)
by: Yigit, Gulsum, et al.
Published: (2023)
Enhancing textual textbook question answering with large language models and retrieval augmented generation
by: Alawwad, Hessa Abdulrahman, et al.
Published: (2024)
by: Alawwad, Hessa Abdulrahman, et al.
Published: (2024)
Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUD
by: Pražák, Ondřej, et al.
Published: (2024)
by: Pražák, Ondřej, et al.
Published: (2024)
PIAST: Rapid Prompting with In-context Augmentation for Scarce Training data
by: Batorski, Pawel, et al.
Published: (2025)
by: Batorski, Pawel, et al.
Published: (2025)
CMRAG: Co-modality-based visual document retrieval and question answering
by: Chen, Wang, et al.
Published: (2025)
by: Chen, Wang, et al.
Published: (2025)
What's the plan? Metrics for implicit planning in LLMs and their application to rhyme generation and question answering
by: Maar, Jim, et al.
Published: (2026)
by: Maar, Jim, et al.
Published: (2026)
factgenie: A Framework for Span-based Evaluation of Generated Texts
by: Kasner, Zdeněk, et al.
Published: (2024)
by: Kasner, Zdeněk, et al.
Published: (2024)
Can MLLMs generate human-like feedback in grading multimodal short answers?
by: Sil, Pritam, et al.
Published: (2024)
by: Sil, Pritam, et al.
Published: (2024)
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
by: Chen, Jianghao, et al.
Published: (2025)
by: Chen, Jianghao, et al.
Published: (2025)
TANQ: An open domain dataset of table answered questions
by: Akhtar, Mubashara, et al.
Published: (2024)
by: Akhtar, Mubashara, et al.
Published: (2024)
Similar Items
-
Overview of the Sensemaking Task at the ELOQUENT 2025 Lab: LLMs as Teachers, Students and Evaluators
by: Šindelář, Pavel, et al.
Published: (2025) -
Finetuning LLMs for EvaCun 2025 token prediction shared task
by: Jon, Josef, et al.
Published: (2025) -
Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation
by: Barančíková, Petra, et al.
Published: (2025) -
Understanding the role of FFNs in driving multilingual behaviour in LLMs
by: Bhattacharya, Sunit, et al.
Published: (2024) -
Quality and Quantity of Machine Translation References for Automatic Metrics
by: Zouhar, Vilém, et al.
Published: (2024)