:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Lee, Seanie, Park, Sangwoo, Choi, Yumin, Kim, Gyeongman, Kang, Minki, Yun, Jihun, Park, Dongmin, Park, Jongho, Hwang, Sung Ju
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2601.23143
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

PREPING: Building Agent Memory without Tasks
di: Choi, Yumin, et al.
Pubblicazione: (2026)

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
di: Lee, Hyomin, et al.
Pubblicazione: (2026)

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR
di: Lee, Chanuk, et al.
Pubblicazione: (2026)

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs
di: Park, Sangwoo, et al.
Pubblicazione: (2026)

FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates
di: Park, Sangwoo, et al.
Pubblicazione: (2025)

Can Large Language Models Develop Strategic Reasoning? Post-training Insights from Learning Chess
di: Hwang, Dongyoon, et al.
Pubblicazione: (2025)

Distilling LLM Agent into Small Models with Retrieval and Code Tools
di: Kang, Minki, et al.
Pubblicazione: (2025)

Lookahead Unmasking Elicits Accurate Decoding in Diffusion Language Models
di: Lee, Sanghyun, et al.
Pubblicazione: (2025)

HoliSafe: Holistic Safety Benchmarking and Modeling for Vision-Language Model
di: Lee, Youngwan, et al.
Pubblicazione: (2025)

Effective Test-Time Scaling of Discrete Diffusion through Iterative Refinement
di: Lee, Sanghyun, et al.
Pubblicazione: (2025)

FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
di: Lee, Seanie, et al.
Pubblicazione: (2025)

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources
di: Baek, Jinheon, et al.
Pubblicazione: (2026)

Rethinking Reward Models for Multi-Domain Test-Time Scaling
di: Lee, Dong Bok, et al.
Pubblicazione: (2025)

Reasoning Structure Matters for Safety Alignment of Reasoning Models
di: In, Yeonjun, et al.
Pubblicazione: (2026)

SAGE: Shaping Anchors for Guided Exploration in RLVR of LLMs
di: Lee, Chanuk, et al.
Pubblicazione: (2026)

Chain of Retrieval: Multi-Aspect Iterative Search Expansion and Post-Order Search Aggregation for Full Paper Retrieval
di: Park, Sangwoo, et al.
Pubblicazione: (2025)

R1-ACT: Efficient Reasoning Model Safety Alignment by Activating Safety Knowledge
di: In, Yeonjun, et al.
Pubblicazione: (2025)

Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models
di: Kang, Minki, et al.
Pubblicazione: (2024)

Test-time Alignment of Diffusion Models without Reward Over-optimization
di: Kim, Sunwoo, et al.
Pubblicazione: (2025)

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models
di: Lee, Seanie, et al.
Pubblicazione: (2025)

VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning?
di: Kim, Minkyu, et al.
Pubblicazione: (2026)

Pruning and Distilling Mixture-of-Experts into Dense Language Models
di: Kim, Junhyuck, et al.
Pubblicazione: (2026)

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents
di: Yeo, Woongyeng, et al.
Pubblicazione: (2026)

System Prompt Optimization with Meta-Learning
di: Choi, Yumin, et al.
Pubblicazione: (2025)

HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
di: Lee, Seanie, et al.
Pubblicazione: (2024)

Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
di: Choi, Yumin, et al.
Pubblicazione: (2025)

Set-based Meta-Interpolation for Few-Task Meta-Learning
di: Lee, Seanie, et al.
Pubblicazione: (2022)

Multi-View Node Pruning for Accurate Graph Representation
di: Kim, Hanjin, et al.
Pubblicazione: (2025)

CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
di: Choi, Donghee, et al.
Pubblicazione: (2024)

Expanding Search Space with Diverse Prompting Agents: An Efficient Sampling Approach for LLM Mathematical Reasoning
di: Lee, Gisang, et al.
Pubblicazione: (2024)

Beyond RLHF: A Unified Theoretical Framework of Alignment
di: Yun, Jihun, et al.
Pubblicazione: (2025)

Active Learning for Continual Learning: Keeping the Past Alive in the Present
di: Park, Jaehyun, et al.
Pubblicazione: (2025)

CompoDistill: Attention Distillation for Compositional Reasoning in Multimodal LLMs
di: Kim, Jiwan, et al.
Pubblicazione: (2025)

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents
di: Kim, Kangsan, et al.
Pubblicazione: (2026)

KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis
di: Lee, Youngwan, et al.
Pubblicazione: (2023)

A Unified Framework for Motion Reasoning and Generation in Human Interaction
di: Park, Jeongeun, et al.
Pubblicazione: (2024)

When Model Meets New Normals: Test-time Adaptation for Unsupervised Time-series Anomaly Detection
di: Kim, Dongmin, et al.
Pubblicazione: (2023)

Beyond Attack Success Rate: Temporal Logit Observability for LLM Safety Failures
di: Park, Junyoung, et al.
Pubblicazione: (2026)

Raon-Speech Technical Report
di: Kim, Beomsoo, et al.
Pubblicazione: (2026)

VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
di: Kim, Kangsan, et al.
Pubblicazione: (2024)