Saved in:
| Main Authors: | Kim, Dongjun, Kim, Minhyuk, Chun, YongChan, Park, Chanjun, Lim, Heuiseok |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.07113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval
by: Chun, Yongchan, et al.
Published: (2025)
by: Chun, Yongchan, et al.
Published: (2025)
Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks
by: Kim, Dongjun, et al.
Published: (2025)
by: Kim, Dongjun, et al.
Published: (2025)
KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models
by: Kim, Dongjun, et al.
Published: (2025)
by: Kim, Dongjun, et al.
Published: (2025)
LANGSAE EDITING: Improving Multilingual Information Retrieval via Post-hoc Language Identity Removal
by: Kim, Dongjun, et al.
Published: (2026)
by: Kim, Dongjun, et al.
Published: (2026)
ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction
by: Park, Jeiyoon, et al.
Published: (2024)
by: Park, Jeiyoon, et al.
Published: (2024)
CharacterGPT: A Persona Reconstruction Framework for Role-Playing Agents
by: Park, Jeiyoon, et al.
Published: (2024)
by: Park, Jeiyoon, et al.
Published: (2024)
MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation
by: Park, Chanhee, et al.
Published: (2025)
by: Park, Chanhee, et al.
Published: (2025)
Understanding LLM Development Through Longitudinal Study: Insights from the Open Ko-LLM Leaderboard
by: Park, Chanjun, et al.
Published: (2024)
by: Park, Chanjun, et al.
Published: (2024)
MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents
by: Shin, Joongmin, et al.
Published: (2026)
by: Shin, Joongmin, et al.
Published: (2026)
Who speaks like a style of Vitamin: Towards Syntax-Aware DialogueSummarization using Multi-task Learning
by: Lee, Seolhwa, et al.
Published: (2021)
by: Lee, Seolhwa, et al.
Published: (2021)
Can Code-Switched Texts Activate a Knowledge Switch in LLMs? A Case Study on English-Korean Code-Switching
by: Kim, Seoyeon, et al.
Published: (2024)
by: Kim, Seoyeon, et al.
Published: (2024)
Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline
by: Koo, Seonmin, et al.
Published: (2024)
by: Koo, Seonmin, et al.
Published: (2024)
Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse
by: Lee, Seungyoon, et al.
Published: (2024)
by: Lee, Seungyoon, et al.
Published: (2024)
CoME: An Unlearning-based Approach to Conflict-free Model Editing
by: Jung, Dahyun, et al.
Published: (2025)
by: Jung, Dahyun, et al.
Published: (2025)
FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models
by: Jung, Dahyun, et al.
Published: (2025)
by: Jung, Dahyun, et al.
Published: (2025)
From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems
by: Jang, Youngjoon, et al.
Published: (2025)
by: Jang, Youngjoon, et al.
Published: (2025)
Translation of Multifaceted Data without Re-Training of Machine Translation Systems
by: Moon, Hyeonseok, et al.
Published: (2024)
by: Moon, Hyeonseok, et al.
Published: (2024)
CLEAR: Cross-Lingual Enhancement in Alignment via Reverse-training
by: Lee, Seungyoon, et al.
Published: (2026)
by: Lee, Seungyoon, et al.
Published: (2026)
InstaTrans: An Instruction-Aware Translation Framework for Non-English Instruction Datasets
by: Kim, Yungi, et al.
Published: (2024)
by: Kim, Yungi, et al.
Published: (2024)
Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models
by: Moon, Hyeonseok, et al.
Published: (2025)
by: Moon, Hyeonseok, et al.
Published: (2025)
No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand
by: Jung, Jimin, et al.
Published: (2026)
by: Jung, Jimin, et al.
Published: (2026)
Open Ko-LLM Leaderboard2: Bridging Foundational and Practical Evaluation for Korean LLMs
by: Kim, Hyeonwoo, et al.
Published: (2024)
by: Kim, Hyeonwoo, et al.
Published: (2024)
Model-Based Data-Centric AI: Bridging the Divide Between Academic Ideals and Industrial Pragmatism
by: Park, Chanjun, et al.
Published: (2024)
by: Park, Chanjun, et al.
Published: (2024)
TORSO: Template-Oriented Reasoning Towards General Tasks
by: Kim, Minhyuk, et al.
Published: (2025)
by: Kim, Minhyuk, et al.
Published: (2025)
Sensory-Aware Sequential Recommendation via Review-Distilled Representations
by: Yoon, Yeo Chan, et al.
Published: (2026)
by: Yoon, Yeo Chan, et al.
Published: (2026)
Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark
by: Park, Chanjun, et al.
Published: (2024)
by: Park, Chanjun, et al.
Published: (2024)
Mind the Blind Spots: A Focus-Level Evaluation Framework for LLM Reviews
by: Shin, Hyungyu, et al.
Published: (2025)
by: Shin, Hyungyu, et al.
Published: (2025)
NeedleChain: Measuring Intact Context Comprehension Capability of Large Language Models
by: Moon, Hyeonseok, et al.
Published: (2025)
by: Moon, Hyeonseok, et al.
Published: (2025)
Unveiling the Limits of Large Language Models in Inferring Pragmatic Meaning from Non-Verbal Responses
by: Eo, Sugyeong, et al.
Published: (2026)
by: Eo, Sugyeong, et al.
Published: (2026)
Evalverse: Unified and Accessible Library for Large Language Model Evaluation
by: Kim, Jihoo, et al.
Published: (2024)
by: Kim, Jihoo, et al.
Published: (2024)
Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models
by: Kim, Dahyun, et al.
Published: (2024)
by: Kim, Dahyun, et al.
Published: (2024)
Don't Judge Code by Its Cover: Exploring Biases in LLM Judges for Code Evaluation
by: Moon, Jiwon, et al.
Published: (2025)
by: Moon, Jiwon, et al.
Published: (2025)
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language Models
by: Park, Hyunbyung, et al.
Published: (2024)
by: Park, Hyunbyung, et al.
Published: (2024)
MCS-SQL: Leveraging Multiple Prompts and Multiple-Choice Selection For Text-to-SQL Generation
by: Lee, Dongjun, et al.
Published: (2024)
by: Lee, Dongjun, et al.
Published: (2024)
Analysis of Utterance Embeddings and Clustering Methods Related to Intent Induction for Task-Oriented Dialogue
by: Park, Jeiyoon, et al.
Published: (2022)
by: Park, Jeiyoon, et al.
Published: (2022)
sDPO: Don't Use Your Data All at Once
by: Kim, Dahyun, et al.
Published: (2024)
by: Kim, Dahyun, et al.
Published: (2024)
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models
by: Kim, Hyeonwoo, et al.
Published: (2024)
by: Kim, Hyeonwoo, et al.
Published: (2024)
Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
by: Kim, Geonmin, et al.
Published: (2024)
by: Kim, Geonmin, et al.
Published: (2024)
LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models
by: Kim, Yungi, et al.
Published: (2024)
by: Kim, Yungi, et al.
Published: (2024)
Rethinking KenLM: Good and Bad Model Ensembles for Efficient Text Quality Filtering in Large Web Corpora
by: Kim, Yungi, et al.
Published: (2024)
by: Kim, Yungi, et al.
Published: (2024)
Similar Items
-
Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval
by: Chun, Yongchan, et al.
Published: (2025) -
Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks
by: Kim, Dongjun, et al.
Published: (2025) -
KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models
by: Kim, Dongjun, et al.
Published: (2025) -
LANGSAE EDITING: Improving Multilingual Information Retrieval via Post-hoc Language Identity Removal
by: Kim, Dongjun, et al.
Published: (2026) -
ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction
by: Park, Jeiyoon, et al.
Published: (2024)