Saved in:
| Main Authors: | Cho, Geonwoo, Im, Jaegyun, Lee, Jihwan, Yi, Hojun, Kim, Sejin, Kim, Sundong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.19997 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Causal-Paced Deep Reinforcement Learning
by: Cho, Geonwoo, et al.
Published: (2025)
by: Cho, Geonwoo, et al.
Published: (2025)
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
by: Cho, Geonwoo, et al.
Published: (2025)
by: Cho, Geonwoo, et al.
Published: (2025)
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning
by: Lee, Hosung, et al.
Published: (2024)
by: Lee, Hosung, et al.
Published: (2024)
Adversarial Environment Design via Regret-Guided Diffusion Models
by: Chung, Hojun, et al.
Published: (2024)
by: Chung, Hojun, et al.
Published: (2024)
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
by: Park, Jaehyun, et al.
Published: (2024)
by: Park, Jaehyun, et al.
Published: (2024)
Enhancing Analogical Reasoning in the Abstraction and Reasoning Corpus via Model-Based RL
by: Lee, Jihwan, et al.
Published: (2024)
by: Lee, Jihwan, et al.
Published: (2024)
System 2 Reasoning for Human-AI Alignment: Generality and Adaptivity via ARC-AGI
by: Kim, Sejin, et al.
Published: (2024)
by: Kim, Sejin, et al.
Published: (2024)
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task
by: Kim, Yunho, et al.
Published: (2024)
by: Kim, Yunho, et al.
Published: (2024)
Addressing and Visualizing Misalignments in Human Task-Solving Trajectories
by: Kim, Sejin, et al.
Published: (2024)
by: Kim, Sejin, et al.
Published: (2024)
Partial Inverse Design of High-Performance Concrete Using Cooperative Neural Networks for Constraint-Aware Mix Generation
by: Nugraha, Agung, et al.
Published: (2025)
by: Nugraha, Agung, et al.
Published: (2025)
ARCTraj: A Dataset and Benchmark of Human Reasoning Trajectories for Abstract Problem Solving
by: Kim, Sejin, et al.
Published: (2025)
by: Kim, Sejin, et al.
Published: (2025)
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
by: Kim, Dohyeong, et al.
Published: (2024)
by: Kim, Dohyeong, et al.
Published: (2024)
BoA: Attention-aware Post-training Quantization without Backpropagation
by: Kim, Junhan, et al.
Published: (2024)
by: Kim, Junhan, et al.
Published: (2024)
Refining Minimax Regret for Unsupervised Environment Design
by: Beukman, Michael, et al.
Published: (2024)
by: Beukman, Michael, et al.
Published: (2024)
Backdoor defense, learnability and obfuscation
by: Christiano, Paul, et al.
Published: (2024)
by: Christiano, Paul, et al.
Published: (2024)
No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery
by: Rutherford, Alexander, et al.
Published: (2024)
by: Rutherford, Alexander, et al.
Published: (2024)
Progressive Weight Loading: Accelerating Initial Inference and Gradually Boosting Performance on Resource-Constrained Environments
by: Kim, Hyunwoo, et al.
Published: (2025)
by: Kim, Hyunwoo, et al.
Published: (2025)
Causal Disentanglement Learning for Accurate Anomaly Detection in Multivariate Time Series
by: Kim, Wonah, et al.
Published: (2025)
by: Kim, Wonah, et al.
Published: (2025)
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
by: Kim, Seyeon, et al.
Published: (2024)
by: Kim, Seyeon, et al.
Published: (2024)
Scaling Laws of SignSGD in Linear Regression: When Does It Outperform SGD?
by: Kim, Jihwan, et al.
Published: (2026)
by: Kim, Jihwan, et al.
Published: (2026)
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
by: Cho, Taehyun, et al.
Published: (2024)
by: Cho, Taehyun, et al.
Published: (2024)
Subspace-based Approximate Hessian Method for Zeroth-Order Optimization
by: Kim, Dongyoon, et al.
Published: (2025)
by: Kim, Dongyoon, et al.
Published: (2025)
The Othello AI Arena: Evaluating Intelligent Systems Through Limited-Time Adaptation to Unseen Boards
by: Kim, Sundong
Published: (2025)
by: Kim, Sundong
Published: (2025)
Attention-aware Semantic Communications for Collaborative Inference
by: Im, Jiwoong, et al.
Published: (2024)
by: Im, Jiwoong, et al.
Published: (2024)
Objectomaly: Objectness-Aware Refinement for OoD Segmentation with Structural Consistency and Boundary Precision
by: Song, Jeonghoon, et al.
Published: (2025)
by: Song, Jeonghoon, et al.
Published: (2025)
Offline Reinforcement Learning with Universal Horizon Models
by: Chung, Hojun, et al.
Published: (2026)
by: Chung, Hojun, et al.
Published: (2026)
Hierarchical and Modular Network on Non-prehensile Manipulation in General Environments
by: Cho, Yoonyoung, et al.
Published: (2025)
by: Cho, Yoonyoung, et al.
Published: (2025)
SASSHA: Sharpness-aware Adaptive Second-order Optimization with Stable Hessian Approximation
by: Shin, Dahun, et al.
Published: (2025)
by: Shin, Dahun, et al.
Published: (2025)
TransPL: VQ-Code Transition Matrices for Pseudo-Labeling of Time Series Unsupervised Domain Adaptation
by: Kim, Jaeho, et al.
Published: (2025)
by: Kim, Jaeho, et al.
Published: (2025)
Bridging the Gap Between Molecule and Textual Descriptions via Substructure-aware Alignment
by: Park, Hyuntae, et al.
Published: (2025)
by: Park, Hyuntae, et al.
Published: (2025)
Explicit Feature Interaction-aware Graph Neural Networks
by: Kim, Minkyu, et al.
Published: (2022)
by: Kim, Minkyu, et al.
Published: (2022)
QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference
by: Kim, Taesu, et al.
Published: (2024)
by: Kim, Taesu, et al.
Published: (2024)
Learning-augmented smooth integer programs with PAC-learnable oracles
by: He, Hao-Yuan, et al.
Published: (2026)
by: He, Hao-Yuan, et al.
Published: (2026)
CVA: Context-aware Video-text Alignment for Video Temporal Grounding
by: Moon, Sungho, et al.
Published: (2026)
by: Moon, Sungho, et al.
Published: (2026)
Lipschitz-aware Linearity Grafting for Certified Robustness
by: Han, Yongjin, et al.
Published: (2025)
by: Han, Yongjin, et al.
Published: (2025)
Transferable Model-agnostic Vision-Language Model Adaptation for Efficient Weak-to-Strong Generalization
by: Park, Jihwan, et al.
Published: (2025)
by: Park, Jihwan, et al.
Published: (2025)
An End-to-End Approach for Korean Wakeword Systems with Speaker Authentication
by: Seo, Geonwoo
Published: (2025)
by: Seo, Geonwoo
Published: (2025)
Thickness-aware E(3)-Equivariant 3D Mesh Neural Networks
by: Kim, Sungwon, et al.
Published: (2025)
by: Kim, Sungwon, et al.
Published: (2025)
Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm
by: Vakili, Sattar, et al.
Published: (2024)
by: Vakili, Sattar, et al.
Published: (2024)
Vocabulary shapes cross-lingual variation of word-order learnability in language models
by: Martins, Jonas Mayer, et al.
Published: (2026)
by: Martins, Jonas Mayer, et al.
Published: (2026)
Similar Items
-
Causal-Paced Deep Reinforcement Learning
by: Cho, Geonwoo, et al.
Published: (2025) -
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
by: Cho, Geonwoo, et al.
Published: (2025) -
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning
by: Lee, Hosung, et al.
Published: (2024) -
Adversarial Environment Design via Regret-Guided Diffusion Models
by: Chung, Hojun, et al.
Published: (2024) -
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
by: Park, Jaehyun, et al.
Published: (2024)