:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Vintar, Špela, Pungeršek, Taja Kuzman, Brglez, Mojca, Ljubešić, Nikola
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Computation and Language Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2510.24450
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

From Polyester Girlfriends to Blind Mice: Creating the First Pragmatics Understanding Benchmarks for Slovene
di: Brglez, Mojca, et al.
Pubblicazione: (2025)

Supercharging Agenda Setting Research: The ParlaCAP Dataset of 28 European Parliaments and a Scalable Multilingual LLM-Based Classification
di: Pungeršek, Taja Kuzman, et al.
Pubblicazione: (2026)

State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?
di: Pungeršek, Taja Kuzman, et al.
Pubblicazione: (2025)

LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification
di: Kuzman, Taja, et al.
Pubblicazione: (2024)

ParlaSpeech 3.0: Richly Annotated Spoken Parliamentary Corpora of Croatian, Czech, Polish, and Serbian
di: Ljubešić, Nikola, et al.
Pubblicazione: (2025)

The Growing Gains and Pains of Iterative Web Corpora Crawling: Insights from South Slavic CLASSLA-web 2.0 Corpora
di: Pungeršek, Taja Kuzman, et al.
Pubblicazione: (2026)

CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation
di: Ljubešić, Nikola, et al.
Pubblicazione: (2024)

CLASSLA-Express: a Train of CLARIN.SI Workshops on Language Resources and Tools with Easily Expanding Route
di: Ljubešić, Nikola, et al.
Pubblicazione: (2024)

Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining
di: Ljubešić, Nikola, et al.
Pubblicazione: (2024)

The truth is no diaper: Human and AI-generated associations to emotional words
di: Vintar, Špela, et al.
Pubblicazione: (2025)

Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages
di: van Noord, Rik, et al.
Pubblicazione: (2024)

Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts
di: Chen, Xiangnan, et al.
Pubblicazione: (2025)

A Computational Analysis of the Dehumanisation of Migrants from Syria and Ukraine in Slovene News Media
di: Caporusso, Jaya, et al.
Pubblicazione: (2024)

ErrorMap and ErrorAtlas: Charting the Failure Landscape of Large Language Models
di: Ashury-Tahan, Shir, et al.
Pubblicazione: (2026)

Towards Best Practices for Open Datasets for LLM Training
di: Baack, Stefan, et al.
Pubblicazione: (2025)

ChartCards: A Chart-Metadata Generation Framework for Multi-Task Chart Understanding
di: Wu, Yifan, et al.
Pubblicazione: (2025)

A Taxonomy of Prompt Defects in LLM Systems
di: Tian, Haoye, et al.
Pubblicazione: (2025)

Enhancing Retrieval-Augmented Generation: A Study of Best Practices
di: Li, Siran, et al.
Pubblicazione: (2025)

COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
di: Guldimann, Philipp, et al.
Pubblicazione: (2024)

Automated Benchmark Generation from Domain Guidelines Informed by Bloom's Taxonomy
di: Chen, Si, et al.
Pubblicazione: (2026)

Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
di: Cui, Tianyu, et al.
Pubblicazione: (2024)

Overhearing LLM Agents: A Survey, Taxonomy, and Roadmap
di: Zhu, Andrew, et al.
Pubblicazione: (2025)

Affective Polarization across European Parliaments
di: Evkoski, Bojan, et al.
Pubblicazione: (2025)

Maintaining Journalistic Integrity in the Digital Age: A Comprehensive NLP Framework for Evaluating Online News Content
di: Bojic, Ljubisa, et al.
Pubblicazione: (2024)

Beyond SELECT: A Comprehensive Taxonomy-Guided Benchmark for Real-World Text-to-SQL Translation
di: Wang, Hao, et al.
Pubblicazione: (2025)

Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and Findings
di: Yang, Shujian, et al.
Pubblicazione: (2025)

CHARTOM: A Visual Theory-of-Mind Benchmark for LLMs on Misleading Charts
di: Bharti, Shubham, et al.
Pubblicazione: (2024)

Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering
di: Chen, Zixin, et al.
Pubblicazione: (2025)

LELA: An End-to-end LLM-based Entity Linking Framework with Zero-shot Domain Adaptation
di: Haffoudhi, Samy, et al.
Pubblicazione: (2026)

Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation
di: Wu, Zhanglin, et al.
Pubblicazione: (2025)

ChartCitor: Multi-Agent Framework for Fine-Grained Chart Visual Attribution
di: Goswami, Kanika, et al.
Pubblicazione: (2025)

Consistent and Distinctive: LLM Benchmark Efficiency via Maximum Independent Set Prompt Selection on Similarity Graphs
di: Kjorvezir, Denica, et al.
Pubblicazione: (2026)

RAGTurk: Best Practices for Retrieval Augmented Generation in Turkish
di: Köse, Süha Kağan, et al.
Pubblicazione: (2026)

WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts
di: Foroutan, Negar, et al.
Pubblicazione: (2025)

LiveFact: A Dynamic, Time-Aware Benchmark for LLM-Driven Fake News Detection
di: Xu, Cheng, et al.
Pubblicazione: (2026)

ChartEditBench: Evaluating Grounded Multi-Turn Chart Editing in Multimodal Language Models
di: Kapadnis, Manav Nitin, et al.
Pubblicazione: (2026)

The ParlaSent Multilingual Training Dataset for Sentiment Identification in Parliamentary Proceedings
di: Mochtak, Michal, et al.
Pubblicazione: (2023)

A Geometric Taxonomy of Hallucinations in LLMs
di: Marín, Javier
Pubblicazione: (2026)

$C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
di: Koh, Woosung, et al.
Pubblicazione: (2024)

LLM-Generated Negative News Headlines Dataset: Creation and Benchmarking Against Real Journalism
di: Babalola, Olusola, et al.
Pubblicazione: (2025)