Saved in:
| Main Authors: | Kim, Dongseok, Choi, Hyoungsun, Rasool, Mohamed Jismy Aashik, Oh, Gisung |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.12688 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CUBE: Contrastive Understanding by Balanced Experiments
by: Kim, Dongseok, et al.
Published: (2025)
by: Kim, Dongseok, et al.
Published: (2025)
Gaming and Cooperation in Federated Learning: What Can Happen and How to Monitor It
by: Kim, Dongseok, et al.
Published: (2025)
by: Kim, Dongseok, et al.
Published: (2025)
CLAPS: Aleatoric-Epistemic Scaling via Last-Layer Laplace for Conformal Regression
by: Kim, Dongseok, et al.
Published: (2025)
by: Kim, Dongseok, et al.
Published: (2025)
$ϕ$-Table: A Statistical Explanation for Global SHAP
by: Kim, Dongseok, et al.
Published: (2025)
by: Kim, Dongseok, et al.
Published: (2025)
A Ridge Too Far: Correcting Over-Shrinkage via Negative Regularization
by: Kim, Dongseok, et al.
Published: (2025)
by: Kim, Dongseok, et al.
Published: (2025)
SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models
by: Jeong, Wonjun, et al.
Published: (2025)
by: Jeong, Wonjun, et al.
Published: (2025)
Interactive Prompt Debugging with Sequence Salience
by: Tenney, Ian, et al.
Published: (2024)
by: Tenney, Ian, et al.
Published: (2024)
Expanding Foundational Language Capabilities in Open-Source LLMs through a Korean Case Study
by: Lim, Junghwan, et al.
Published: (2025)
by: Lim, Junghwan, et al.
Published: (2025)
The Amazing Agent Race: Strong Tool Users, Weak Navigators
by: Kim, Zae Myung, et al.
Published: (2026)
by: Kim, Zae Myung, et al.
Published: (2026)
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
by: Sclar, Melanie, et al.
Published: (2023)
by: Sclar, Melanie, et al.
Published: (2023)
KL for a KL: On-Policy Distillation with Control Variate Baseline
by: Oh, Minjae, et al.
Published: (2026)
by: Oh, Minjae, et al.
Published: (2026)
PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning
by: Kim, Gyeongman, et al.
Published: (2024)
by: Kim, Gyeongman, et al.
Published: (2024)
Offline Learning and Forgetting for Reasoning with Large Language Models
by: Ni, Tianwei, et al.
Published: (2025)
by: Ni, Tianwei, et al.
Published: (2025)
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
by: Puerto, Haritz, et al.
Published: (2024)
by: Puerto, Haritz, et al.
Published: (2024)
Promptception: How Sensitive Are Large Multimodal Models to Prompts?
by: Ismithdeen, Mohamed Insaf, et al.
Published: (2025)
by: Ismithdeen, Mohamed Insaf, et al.
Published: (2025)
Sem-DPO: Mitigating Semantic Inconsistency in Preference Optimization for Prompt Engineering
by: Mohamed, Anas, et al.
Published: (2025)
by: Mohamed, Anas, et al.
Published: (2025)
How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities
by: Xu, Ziwen, et al.
Published: (2026)
by: Xu, Ziwen, et al.
Published: (2026)
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
by: Choi, Yumin, et al.
Published: (2025)
by: Choi, Yumin, et al.
Published: (2025)
Unfamiliar Finetuning Examples Control How Language Models Hallucinate
by: Kang, Katie, et al.
Published: (2024)
by: Kang, Katie, et al.
Published: (2024)
GFlowPO: Generative Flow Network as a Language Model Prompt Optimizer
by: Cho, Junmo, et al.
Published: (2026)
by: Cho, Junmo, et al.
Published: (2026)
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts
by: Dong, Honghua, et al.
Published: (2024)
by: Dong, Honghua, et al.
Published: (2024)
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
by: Choi, Yunho, et al.
Published: (2026)
by: Choi, Yunho, et al.
Published: (2026)
Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models
by: Zollo, Thomas P., et al.
Published: (2023)
by: Zollo, Thomas P., et al.
Published: (2023)
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
by: Botev, Aleksandar, et al.
Published: (2024)
by: Botev, Aleksandar, et al.
Published: (2024)
How Contaminated Is Your Benchmark? Quantifying Dataset Leakage in Large Language Models with Kernel Divergence
by: Choi, Hyeong Kyu, et al.
Published: (2025)
by: Choi, Hyeong Kyu, et al.
Published: (2025)
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
by: Loula, João, et al.
Published: (2025)
by: Loula, João, et al.
Published: (2025)
Tabular Transfer Learning via Prompting LLMs
by: Nam, Jaehyun, et al.
Published: (2024)
by: Nam, Jaehyun, et al.
Published: (2024)
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
by: Choi, Yunseon, et al.
Published: (2024)
by: Choi, Yunseon, et al.
Published: (2024)
BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization
by: Lee, Gihun, et al.
Published: (2024)
by: Lee, Gihun, et al.
Published: (2024)
How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse
by: Seddik, Mohamed El Amine, et al.
Published: (2024)
by: Seddik, Mohamed El Amine, et al.
Published: (2024)
How Alignment Routes: Localizing, Scaling, and Controlling Policy Circuits in Language Models
by: Frank, Gregory N.
Published: (2026)
by: Frank, Gregory N.
Published: (2026)
How Susceptible are LLMs to Influence in Prompts?
by: Anagnostidis, Sotiris, et al.
Published: (2024)
by: Anagnostidis, Sotiris, et al.
Published: (2024)
RePrompt: Planning by Automatic Prompt Engineering for Large Language Models Agents
by: Chen, Weizhe, et al.
Published: (2024)
by: Chen, Weizhe, et al.
Published: (2024)
DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues
by: Jang, Kyochul, et al.
Published: (2025)
by: Jang, Kyochul, et al.
Published: (2025)
Prompt-Response Semantic Divergence Metrics for Faithfulness Hallucination and Misalignment Detection in Large Language Models
by: Halperin, Igor
Published: (2025)
by: Halperin, Igor
Published: (2025)
On Prompt-Driven Safeguarding for Large Language Models
by: Zheng, Chujie, et al.
Published: (2024)
by: Zheng, Chujie, et al.
Published: (2024)
Graph Neural Prompting with Large Language Models
by: Tian, Yijun, et al.
Published: (2023)
by: Tian, Yijun, et al.
Published: (2023)
Soft Prompting for Unlearning in Large Language Models
by: Bhaila, Karuna, et al.
Published: (2024)
by: Bhaila, Karuna, et al.
Published: (2024)
Prompt Optimization Via Diffusion Language Models
by: Wang, Shiyu, et al.
Published: (2026)
by: Wang, Shiyu, et al.
Published: (2026)
Structured Prompts Improve Evaluation of Language Models
by: Aali, Asad, et al.
Published: (2025)
by: Aali, Asad, et al.
Published: (2025)
Similar Items
-
CUBE: Contrastive Understanding by Balanced Experiments
by: Kim, Dongseok, et al.
Published: (2025) -
Gaming and Cooperation in Federated Learning: What Can Happen and How to Monitor It
by: Kim, Dongseok, et al.
Published: (2025) -
CLAPS: Aleatoric-Epistemic Scaling via Last-Layer Laplace for Conformal Regression
by: Kim, Dongseok, et al.
Published: (2025) -
$ϕ$-Table: A Statistical Explanation for Global SHAP
by: Kim, Dongseok, et al.
Published: (2025) -
A Ridge Too Far: Correcting Over-Shrinkage via Negative Regularization
by: Kim, Dongseok, et al.
Published: (2025)