Saved in:
| Main Authors: | Batura, Tatiana, Bruches, Elena, Shvenk, Milana, Malykh, Valentin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.09622 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers
by: Tsanda, Alena, et al.
Published: (2024)
by: Tsanda, Alena, et al.
Published: (2024)
StRuCom: A Novel Dataset of Structured Code Comments in Russian
by: Dziuba, Maria, et al.
Published: (2025)
by: Dziuba, Maria, et al.
Published: (2025)
Low-resource Machine Translation for Code-switched Kazakh-Russian Language Pair
by: Borisov, Maksim, et al.
Published: (2025)
by: Borisov, Maksim, et al.
Published: (2025)
Data filtering methods for training language models
by: Shevchenko, Egor, et al.
Published: (2026)
by: Shevchenko, Egor, et al.
Published: (2026)
Hierarchical Embedding Fusion for Retrieval-Augmented Code Generation
by: Sorokin, Nikita, et al.
Published: (2026)
by: Sorokin, Nikita, et al.
Published: (2026)
Iterative Self-Training for Code Generation via Reinforced Re-Ranking
by: Sorokin, Nikita, et al.
Published: (2025)
by: Sorokin, Nikita, et al.
Published: (2025)
CIDRe: A Reference-Free Multi-Aspect Criterion for Code Comment Quality Measurement
by: Dziuba, Maria, et al.
Published: (2025)
by: Dziuba, Maria, et al.
Published: (2025)
SumHiS: Extractive Summarization Exploiting Hidden Structure
by: Pavel, Tikhonov, et al.
Published: (2024)
by: Pavel, Tikhonov, et al.
Published: (2024)
Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models
by: Sedykh, Ivan, et al.
Published: (2026)
by: Sedykh, Ivan, et al.
Published: (2026)
Detection of Fake Generated Scientific Abstracts
by: Theocharopoulos, Panagiotis C., et al.
Published: (2023)
by: Theocharopoulos, Panagiotis C., et al.
Published: (2023)
AIxcellent Vibes at GermEval 2025 Shared Task on Candy Speech Detection: Improving Model Performance by Span-Level Training
by: Thelen, Christian Rene, et al.
Published: (2025)
by: Thelen, Christian Rene, et al.
Published: (2025)
SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration Mistakes
by: Vázquez, Raúl, et al.
Published: (2025)
by: Vázquez, Raúl, et al.
Published: (2025)
SemEval-2025 Task 9: The Food Hazard Detection Challenge
by: Randl, Korbinian, et al.
Published: (2025)
by: Randl, Korbinian, et al.
Published: (2025)
MERA Code: A Unified Framework for Evaluating Code Generation Across Tasks
by: Chervyakov, Artem, et al.
Published: (2025)
by: Chervyakov, Artem, et al.
Published: (2025)
BUSTED at AraGenEval Shared Task: A Comparative Study of Transformer-Based Models for Arabic AI-Generated Text Detection
by: Zain, Ali, et al.
Published: (2025)
by: Zain, Ali, et al.
Published: (2025)
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search
by: Tikhonov, Anton, et al.
Published: (2023)
by: Tikhonov, Anton, et al.
Published: (2023)
SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Detection
by: Muhammad, Shamsuddeen Hassan, et al.
Published: (2025)
by: Muhammad, Shamsuddeen Hassan, et al.
Published: (2025)
HausaNLP at SemEval-2025 Task 11: Hausa Text Emotion Detection
by: Sani, Sani Abdullahi, et al.
Published: (2025)
by: Sani, Sani Abdullahi, et al.
Published: (2025)
Team Anotheroption at SemEval-2025 Task 8: Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA
by: Evkarpidi, Nikolas, et al.
Published: (2025)
by: Evkarpidi, Nikolas, et al.
Published: (2025)
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets
by: Sedykh, Ivan, et al.
Published: (2023)
by: Sedykh, Ivan, et al.
Published: (2023)
ATLANTIS at SemEval-2025 Task 3: Detecting Hallucinated Text Spans in Question Answering
by: Kobus, Catherine, et al.
Published: (2025)
by: Kobus, Catherine, et al.
Published: (2025)
GATech at AbjadGenEval Shared Task: Multilingual Embeddings for Arabic Machine-Generated Text Classification
by: Khamis, Ahmed Khaled
Published: (2026)
by: Khamis, Ahmed Khaled
Published: (2026)
SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
by: Mickus, Timothee, et al.
Published: (2024)
by: Mickus, Timothee, et al.
Published: (2024)
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
by: Xue, Jieying, et al.
Published: (2025)
by: Xue, Jieying, et al.
Published: (2025)
AIpom at SemEval-2024 Task 8: Detecting AI-produced Outputs in M4
by: Shirnin, Alexander, et al.
Published: (2024)
by: Shirnin, Alexander, et al.
Published: (2024)
DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation
by: Ning, Xinyu, et al.
Published: (2024)
by: Ning, Xinyu, et al.
Published: (2024)
Overview of the 2024 ALTA Shared Task: Detect Automatic AI-Generated Sentences for Human-AI Hybrid Articles
by: Mollá, Diego, et al.
Published: (2024)
by: Mollá, Diego, et al.
Published: (2024)
Team A at SemEval-2025 Task 11: Breaking Language Barriers in Emotion Detection with Multilingual Models
by: Sahil, P Sam, et al.
Published: (2025)
by: Sahil, P Sam, et al.
Published: (2025)
M-DAIGT: A Shared Task on Multi-Domain Detection of AI-Generated Text
by: Lamsiyah, Salima, et al.
Published: (2025)
by: Lamsiyah, Salima, et al.
Published: (2025)
NUTSHELL: A Dataset for Abstract Generation from Scientific Talks
by: Züfle, Maike, et al.
Published: (2025)
by: Züfle, Maike, et al.
Published: (2025)
SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
by: Wang, Yuxia, et al.
Published: (2024)
by: Wang, Yuxia, et al.
Published: (2024)
UoB-NLP at SemEval-2025 Task 11: Leveraging Adapters for Multilingual and Cross-Lingual Emotion Detection
by: De Leon, Frances Laureano, et al.
Published: (2025)
by: De Leon, Frances Laureano, et al.
Published: (2025)
UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output
by: Huang, Sicong, et al.
Published: (2025)
by: Huang, Sicong, et al.
Published: (2025)
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks
by: Adamenko, Pavel, et al.
Published: (2025)
by: Adamenko, Pavel, et al.
Published: (2025)
ArxEval: Evaluating Retrieval and Generation in Language Models for Scientific Literature
by: Sinha, Aarush, et al.
Published: (2025)
by: Sinha, Aarush, et al.
Published: (2025)
Findings of the BEA 2025 Shared Task on Pedagogical Ability Assessment of AI-powered Tutors
by: Kochmar, Ekaterina, et al.
Published: (2025)
by: Kochmar, Ekaterina, et al.
Published: (2025)
LTG at SemEval-2025 Task 10: Optimizing Context for Classification of Narrative Roles
by: Rønningstad, Egil, et al.
Published: (2025)
by: Rønningstad, Egil, et al.
Published: (2025)
Cleaning English Abstracts of Scientific Publications
by: Rose, Michael E., et al.
Published: (2025)
by: Rose, Michael E., et al.
Published: (2025)
REFIND at SemEval-2025 Task 3: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models
by: Lee, DongGeon, et al.
Published: (2025)
by: Lee, DongGeon, et al.
Published: (2025)
keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection
by: Vemula, Saketh Reddy, et al.
Published: (2025)
by: Vemula, Saketh Reddy, et al.
Published: (2025)
Similar Items
-
Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers
by: Tsanda, Alena, et al.
Published: (2024) -
StRuCom: A Novel Dataset of Structured Code Comments in Russian
by: Dziuba, Maria, et al.
Published: (2025) -
Low-resource Machine Translation for Code-switched Kazakh-Russian Language Pair
by: Borisov, Maksim, et al.
Published: (2025) -
Data filtering methods for training language models
by: Shevchenko, Egor, et al.
Published: (2026) -
Hierarchical Embedding Fusion for Retrieval-Augmented Code Generation
by: Sorokin, Nikita, et al.
Published: (2026)