:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Jung, Haeji, Kim, Jinju, Kim, Kyungjin, Roh, Youjeong, Mortensen, David R.
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2510.10827
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Harnessing Linguistic Dissimilarity for Language Generalization on Unseen Low-Resource Varieties
por: Kim, Jinju, et al.
Publicado: (2026)

Mitigating the Linguistic Gap with Phonemic Representations for Robust Cross-lingual Transfer
por: Jung, Haeji, et al.
Publicado: (2024)

Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages
por: Sohn, Jimin, et al.
Publicado: (2024)

AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer
por: Lauc, Davor, et al.
Publicado: (2024)

Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages
por: Azam, Gulfarogh, et al.
Publicado: (2025)

Cross-Lingual IPA Contrastive Learning for Zero-Shot NER
por: Sohn, Jimin, et al.
Publicado: (2025)

How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
por: Yuan, Fei, et al.
Publicado: (2023)

NADIR: Differential Attention Flow for Non-Autoregressive Transliteration in Indic Languages
por: Tomar, Lakshya, et al.
Publicado: (2026)

We Can't Understand AI Using our Existing Vocabulary
por: Hewitt, John, et al.
Publicado: (2025)

Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
por: Butt, Umer, et al.
Publicado: (2025)

Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models
por: Kim, Seungduk, et al.
Publicado: (2024)

CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents
por: Roh, Taeyun, et al.
Publicado: (2026)

Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
por: Namgoong, Hyuk, et al.
Publicado: (2024)

Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts
por: Ma, Chunlan, et al.
Publicado: (2024)

Mining Social Determinants of Health for Heart Failure Patient 30-Day Readmission via Large Language Model
por: Shao, Mingchen, et al.
Publicado: (2025)

1 Trillion Token (1TT) Platform: A Novel Framework for Efficient Data Sharing and Compensation in Large Language Models
por: Park, Chanjun, et al.
Publicado: (2024)

TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
por: Jung, Chanjoo, et al.
Publicado: (2025)

Beyond Learning: A Training-Free Alternative to Model Adaptation
por: Yoon, Namkyung, et al.
Publicado: (2026)

LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization
por: Jo, Daejin, et al.
Publicado: (2025)

Discrete Prompt Compression with Reinforcement Learning
por: Jung, Hoyoun, et al.
Publicado: (2023)

IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method
por: Kim, Mihyeon, et al.
Publicado: (2025)

Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning
por: Naeem, Sameea, et al.
Publicado: (2022)

ReaComp: Compiling LLM Reasoning into Symbolic Solvers for Efficient Program Synthesis
por: Naik, Atharva, et al.
Publicado: (2026)

Exploiting the Potential of Seq2Seq Models as Robust Few-Shot Learners
por: Lee, Jihyeon, et al.
Publicado: (2023)

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
por: Tao, Chaofan, et al.
Publicado: (2024)

Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift
por: Kim, Hyunwoo, et al.
Publicado: (2026)

PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning
por: Jung, Min Jae, et al.
Publicado: (2024)

Hierarchical Retrieval with Out-Of-Vocabulary Queries: A Case Study on SNOMED CT
por: Dilworth, Jonathon, et al.
Publicado: (2025)

Safeguarding RAG Pipelines with GMTP: A Gradient-based Masked Token Probability Method for Poisoned Document Detection
por: Kim, San, et al.
Publicado: (2025)

Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling
por: Shin, Haebin, et al.
Publicado: (2025)

Can LLMs Recognize Toxicity? A Structured Investigation Framework and Toxicity Metric
por: Koh, Hyukhun, et al.
Publicado: (2024)

Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction
por: Naik, Atharva, et al.
Publicado: (2024)

Theme-Explanation Structure for Table Summarization using Large Language Models: A Case Study on Korean Tabular Data
por: Kwack, TaeYoon, et al.
Publicado: (2025)

ACoRN: Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models
por: Kim, Singon, et al.
Publicado: (2025)

Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding
por: Kim, Suyoung, et al.
Publicado: (2024)

UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation
por: Choi, Juhwan, et al.
Publicado: (2024)

Carrot and Stick: Inducing Self-Motivation with Positive & Negative Feedback
por: Sohn, Jimin, et al.
Publicado: (2024)

Dagger Behind Smile: Fool LLMs with a Happy Ending Story
por: Song, Xurui, et al.
Publicado: (2025)

KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language
por: Kim, Yoonshik, et al.
Publicado: (2025)

An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Language Model Inference
por: Yamaguchi, Atsuki, et al.
Publicado: (2024)