Saved in:
| Main Authors: | Sanchez-Bayona, Elisa, Agerri, Rodrigo |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.07053 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Metaphor and Large Language Models: When Surface Features Matter More than Deep Understanding
by: Sanchez-Bayona, Elisa, et al.
Published: (2025)
by: Sanchez-Bayona, Elisa, et al.
Published: (2025)
Crosslingual Reasoning through Test-Time Scaling
by: Yong, Zheng-Xin, et al.
Published: (2025)
by: Yong, Zheng-Xin, et al.
Published: (2025)
NepTam: A Nepali-Tamang Parallel Corpus and Baseline Machine Translation Experiments
by: Ghimire, Rupak Raj, et al.
Published: (2026)
by: Ghimire, Rupak Raj, et al.
Published: (2026)
Interpretability of the Intent Detection Problem: A New Approach
by: Sanchez-Karhunen, Eduardo, et al.
Published: (2026)
by: Sanchez-Karhunen, Eduardo, et al.
Published: (2026)
VietMix: A Naturally-Occurring Parallel Corpus and Augmentation Framework for Vietnamese-English Code-Mixed Machine Translation
by: Tran, Hieu, et al.
Published: (2025)
by: Tran, Hieu, et al.
Published: (2025)
Language Independent Stance Detection: Social Interaction-based Embeddings and Large Language Models
by: de Landa, Joseba Fernandez, et al.
Published: (2022)
by: de Landa, Joseba Fernandez, et al.
Published: (2022)
NSINA: A News Corpus for Sinhala
by: Hettiarachchi, Hansi, et al.
Published: (2024)
by: Hettiarachchi, Hansi, et al.
Published: (2024)
MultiMind at SemEval-2025 Task 7: Crosslingual Fact-Checked Claim Retrieval via Multi-Source Alignment
by: Abootorabi, Mohammad Mahdi, et al.
Published: (2025)
by: Abootorabi, Mohammad Mahdi, et al.
Published: (2025)
Health Insurance Coverage Rule Interpretation Corpus: Law, Policy, and Medical Guidance for Health Insurance Coverage Understanding
by: Gartner, Mike
Published: (2025)
by: Gartner, Mike
Published: (2025)
HLDC: Hindi Legal Documents Corpus
by: Kapoor, Arnav, et al.
Published: (2022)
by: Kapoor, Arnav, et al.
Published: (2022)
AlbNews: A Corpus of Headlines for Topic Modeling in Albanian
by: Çano, Erion, et al.
Published: (2024)
by: Çano, Erion, et al.
Published: (2024)
Towards Scalable Meta-Learning of near-optimal Interpretable Models via Synthetic Model Generations
by: Myint, Kyaw Hpone, et al.
Published: (2025)
by: Myint, Kyaw Hpone, et al.
Published: (2025)
MathPile: A Billion-Token-Scale Pretraining Corpus for Math
by: Wang, Zengzhi, et al.
Published: (2023)
by: Wang, Zengzhi, et al.
Published: (2023)
WorldSpeech: A Multilingual Speech Corpus from Around the World
by: Asonitis, Antonis, et al.
Published: (2026)
by: Asonitis, Antonis, et al.
Published: (2026)
Interpretable Predictability-Based AI Text Detection: A Replication Study
by: Skurla, Adam, et al.
Published: (2026)
by: Skurla, Adam, et al.
Published: (2026)
Cross-lingual Named Entity Corpus for Slavic Languages
by: Piskorski, Jakub, et al.
Published: (2024)
by: Piskorski, Jakub, et al.
Published: (2024)
Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain
by: García-Ferrero, Iker, et al.
Published: (2024)
by: García-Ferrero, Iker, et al.
Published: (2024)
Argument Mining in Data Scarce Settings: Cross-lingual Transfer and Few-shot Techniques
by: Yeginbergen, Anar, et al.
Published: (2024)
by: Yeginbergen, Anar, et al.
Published: (2024)
Dynamic Knowledge Integration for Evidence-Driven Counter-Argument Generation with Large Language Models
by: Yeginbergen, Anar, et al.
Published: (2025)
by: Yeginbergen, Anar, et al.
Published: (2025)
Multilingual Medical Reasoning for Question Answering with Large Language Models
by: Ferrazzi, Pietro, et al.
Published: (2025)
by: Ferrazzi, Pietro, et al.
Published: (2025)
Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair
by: Sakai, Yusuke, et al.
Published: (2024)
by: Sakai, Yusuke, et al.
Published: (2024)
Retrieve, Then Classify: Corpus-Grounded Automation of Clinical Value Set Authoring
by: Mukherjee, Sumit, et al.
Published: (2026)
by: Mukherjee, Sumit, et al.
Published: (2026)
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection
by: Hu, Mengya, et al.
Published: (2024)
by: Hu, Mengya, et al.
Published: (2024)
IPAD: Inverse Prompt for AI Detection - A Robust and Interpretable LLM-Generated Text Detector
by: Chen, Zheng, et al.
Published: (2025)
by: Chen, Zheng, et al.
Published: (2025)
Optimizing Large Language Models for Turkish: New Methodologies in Corpus Selection and Training
by: Kesgin, H. Toprak, et al.
Published: (2024)
by: Kesgin, H. Toprak, et al.
Published: (2024)
Detecting Suicidal Ideation in Text with Interpretable Deep Learning: A CNN-BiGRU with Attention Mechanism
by: Bhuiyan, Mohaiminul Islam, et al.
Published: (2025)
by: Bhuiyan, Mohaiminul Islam, et al.
Published: (2025)
PMOA-TTS: Introducing the PubMed Open Access Textual Times Series Corpus
by: Noroozizadeh, Shahriar, et al.
Published: (2025)
by: Noroozizadeh, Shahriar, et al.
Published: (2025)
Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales
by: Nirmal, Ayushi, et al.
Published: (2024)
by: Nirmal, Ayushi, et al.
Published: (2024)
UQA: Corpus for Urdu Question Answering
by: Arif, Samee, et al.
Published: (2024)
by: Arif, Samee, et al.
Published: (2024)
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation
by: Ziegler, Ingo, et al.
Published: (2024)
by: Ziegler, Ingo, et al.
Published: (2024)
Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement Learning
by: Rahman, Md Mushfiqur, et al.
Published: (2024)
by: Rahman, Md Mushfiqur, et al.
Published: (2024)
Metaphors are a Source of Cross-Domain Misalignment of Large Reasoning Models
by: Hu, Zhibo, et al.
Published: (2026)
by: Hu, Zhibo, et al.
Published: (2026)
MIB: A Mechanistic Interpretability Benchmark
by: Mueller, Aaron, et al.
Published: (2025)
by: Mueller, Aaron, et al.
Published: (2025)
Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
by: Patil, Abhinav, et al.
Published: (2024)
by: Patil, Abhinav, et al.
Published: (2024)
MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into $LaTeX$ Formulas for Improved Readability
by: Jung, Kyudan, et al.
Published: (2024)
by: Jung, Kyudan, et al.
Published: (2024)
A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus
by: Poesina, Eduard, et al.
Published: (2024)
by: Poesina, Eduard, et al.
Published: (2024)
Communication Compression for Tensor Parallel LLM Inference
by: Hansen-Palmus, Jan, et al.
Published: (2024)
by: Hansen-Palmus, Jan, et al.
Published: (2024)
Exploring and Improving Drafts in Blockwise Parallel Decoding
by: Kim, Taehyeon, et al.
Published: (2024)
by: Kim, Taehyeon, et al.
Published: (2024)
Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
by: Chuang, Yun-Yen, et al.
Published: (2024)
by: Chuang, Yun-Yen, et al.
Published: (2024)
MetaScale: Test-Time Scaling with Evolving Meta-Thoughts
by: Liu, Qin, et al.
Published: (2025)
by: Liu, Qin, et al.
Published: (2025)
Similar Items
-
Metaphor and Large Language Models: When Surface Features Matter More than Deep Understanding
by: Sanchez-Bayona, Elisa, et al.
Published: (2025) -
Crosslingual Reasoning through Test-Time Scaling
by: Yong, Zheng-Xin, et al.
Published: (2025) -
NepTam: A Nepali-Tamang Parallel Corpus and Baseline Machine Translation Experiments
by: Ghimire, Rupak Raj, et al.
Published: (2026) -
Interpretability of the Intent Detection Problem: A New Approach
by: Sanchez-Karhunen, Eduardo, et al.
Published: (2026) -
VietMix: A Naturally-Occurring Parallel Corpus and Augmentation Framework for Vietnamese-English Code-Mixed Machine Translation
by: Tran, Hieu, et al.
Published: (2025)