Guardado en:
| Autores principales: | Jung, Haeji, Kim, Jinju, Kim, Kyungjin, Roh, Youjeong, Mortensen, David R. |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2510.10827 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Harnessing Linguistic Dissimilarity for Language Generalization on Unseen Low-Resource Varieties
por: Kim, Jinju, et al.
Publicado: (2026)
por: Kim, Jinju, et al.
Publicado: (2026)
Mitigating the Linguistic Gap with Phonemic Representations for Robust Cross-lingual Transfer
por: Jung, Haeji, et al.
Publicado: (2024)
por: Jung, Haeji, et al.
Publicado: (2024)
Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages
por: Sohn, Jimin, et al.
Publicado: (2024)
por: Sohn, Jimin, et al.
Publicado: (2024)
AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer
por: Lauc, Davor, et al.
Publicado: (2024)
por: Lauc, Davor, et al.
Publicado: (2024)
Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages
por: Azam, Gulfarogh, et al.
Publicado: (2025)
por: Azam, Gulfarogh, et al.
Publicado: (2025)
Cross-Lingual IPA Contrastive Learning for Zero-Shot NER
por: Sohn, Jimin, et al.
Publicado: (2025)
por: Sohn, Jimin, et al.
Publicado: (2025)
How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
por: Yuan, Fei, et al.
Publicado: (2023)
por: Yuan, Fei, et al.
Publicado: (2023)
NADIR: Differential Attention Flow for Non-Autoregressive Transliteration in Indic Languages
por: Tomar, Lakshya, et al.
Publicado: (2026)
por: Tomar, Lakshya, et al.
Publicado: (2026)
We Can't Understand AI Using our Existing Vocabulary
por: Hewitt, John, et al.
Publicado: (2025)
por: Hewitt, John, et al.
Publicado: (2025)
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
por: Butt, Umer, et al.
Publicado: (2025)
por: Butt, Umer, et al.
Publicado: (2025)
Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models
por: Kim, Seungduk, et al.
Publicado: (2024)
por: Kim, Seungduk, et al.
Publicado: (2024)
CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents
por: Roh, Taeyun, et al.
Publicado: (2026)
por: Roh, Taeyun, et al.
Publicado: (2026)
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
por: Namgoong, Hyuk, et al.
Publicado: (2024)
por: Namgoong, Hyuk, et al.
Publicado: (2024)
Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts
por: Ma, Chunlan, et al.
Publicado: (2024)
por: Ma, Chunlan, et al.
Publicado: (2024)
Mining Social Determinants of Health for Heart Failure Patient 30-Day Readmission via Large Language Model
por: Shao, Mingchen, et al.
Publicado: (2025)
por: Shao, Mingchen, et al.
Publicado: (2025)
1 Trillion Token (1TT) Platform: A Novel Framework for Efficient Data Sharing and Compensation in Large Language Models
por: Park, Chanjun, et al.
Publicado: (2024)
por: Park, Chanjun, et al.
Publicado: (2024)
TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
por: Jung, Chanjoo, et al.
Publicado: (2025)
por: Jung, Chanjoo, et al.
Publicado: (2025)
Beyond Learning: A Training-Free Alternative to Model Adaptation
por: Yoon, Namkyung, et al.
Publicado: (2026)
por: Yoon, Namkyung, et al.
Publicado: (2026)
LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization
por: Jo, Daejin, et al.
Publicado: (2025)
por: Jo, Daejin, et al.
Publicado: (2025)
Discrete Prompt Compression with Reinforcement Learning
por: Jung, Hoyoun, et al.
Publicado: (2023)
por: Jung, Hoyoun, et al.
Publicado: (2023)
IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method
por: Kim, Mihyeon, et al.
Publicado: (2025)
por: Kim, Mihyeon, et al.
Publicado: (2025)
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning
por: Naeem, Sameea, et al.
Publicado: (2022)
por: Naeem, Sameea, et al.
Publicado: (2022)
ReaComp: Compiling LLM Reasoning into Symbolic Solvers for Efficient Program Synthesis
por: Naik, Atharva, et al.
Publicado: (2026)
por: Naik, Atharva, et al.
Publicado: (2026)
Exploiting the Potential of Seq2Seq Models as Robust Few-Shot Learners
por: Lee, Jihyeon, et al.
Publicado: (2023)
por: Lee, Jihyeon, et al.
Publicado: (2023)
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
por: Tao, Chaofan, et al.
Publicado: (2024)
por: Tao, Chaofan, et al.
Publicado: (2024)
Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift
por: Kim, Hyunwoo, et al.
Publicado: (2026)
por: Kim, Hyunwoo, et al.
Publicado: (2026)
PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning
por: Jung, Min Jae, et al.
Publicado: (2024)
por: Jung, Min Jae, et al.
Publicado: (2024)
Hierarchical Retrieval with Out-Of-Vocabulary Queries: A Case Study on SNOMED CT
por: Dilworth, Jonathon, et al.
Publicado: (2025)
por: Dilworth, Jonathon, et al.
Publicado: (2025)
Safeguarding RAG Pipelines with GMTP: A Gradient-based Masked Token Probability Method for Poisoned Document Detection
por: Kim, San, et al.
Publicado: (2025)
por: Kim, San, et al.
Publicado: (2025)
Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling
por: Shin, Haebin, et al.
Publicado: (2025)
por: Shin, Haebin, et al.
Publicado: (2025)
Can LLMs Recognize Toxicity? A Structured Investigation Framework and Toxicity Metric
por: Koh, Hyukhun, et al.
Publicado: (2024)
por: Koh, Hyukhun, et al.
Publicado: (2024)
Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction
por: Naik, Atharva, et al.
Publicado: (2024)
por: Naik, Atharva, et al.
Publicado: (2024)
Theme-Explanation Structure for Table Summarization using Large Language Models: A Case Study on Korean Tabular Data
por: Kwack, TaeYoon, et al.
Publicado: (2025)
por: Kwack, TaeYoon, et al.
Publicado: (2025)
ACoRN: Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models
por: Kim, Singon, et al.
Publicado: (2025)
por: Kim, Singon, et al.
Publicado: (2025)
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding
por: Kim, Suyoung, et al.
Publicado: (2024)
por: Kim, Suyoung, et al.
Publicado: (2024)
UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation
por: Choi, Juhwan, et al.
Publicado: (2024)
por: Choi, Juhwan, et al.
Publicado: (2024)
Carrot and Stick: Inducing Self-Motivation with Positive & Negative Feedback
por: Sohn, Jimin, et al.
Publicado: (2024)
por: Sohn, Jimin, et al.
Publicado: (2024)
Dagger Behind Smile: Fool LLMs with a Happy Ending Story
por: Song, Xurui, et al.
Publicado: (2025)
por: Song, Xurui, et al.
Publicado: (2025)
KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language
por: Kim, Yoonshik, et al.
Publicado: (2025)
por: Kim, Yoonshik, et al.
Publicado: (2025)
An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Language Model Inference
por: Yamaguchi, Atsuki, et al.
Publicado: (2024)
por: Yamaguchi, Atsuki, et al.
Publicado: (2024)
Ejemplares similares
-
Harnessing Linguistic Dissimilarity for Language Generalization on Unseen Low-Resource Varieties
por: Kim, Jinju, et al.
Publicado: (2026) -
Mitigating the Linguistic Gap with Phonemic Representations for Robust Cross-lingual Transfer
por: Jung, Haeji, et al.
Publicado: (2024) -
Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages
por: Sohn, Jimin, et al.
Publicado: (2024) -
AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer
por: Lauc, Davor, et al.
Publicado: (2024) -
Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages
por: Azam, Gulfarogh, et al.
Publicado: (2025)