Saved in:
| Main Authors: | Yoo, Jaesung, de la Torre, Fernanda, Yang, Guangyu Robert |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2306.04440 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains
by: Bae, Jaesung, et al.
Published: (2026)
by: Bae, Jaesung, et al.
Published: (2026)
Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech
by: Bae, Jaesung, et al.
Published: (2026)
by: Bae, Jaesung, et al.
Published: (2026)
On Evaluation of Unsupervised Feature Selection for Pattern Classification
by: Kim, Gyu-Il, et al.
Published: (2026)
by: Kim, Gyu-Il, et al.
Published: (2026)
ODIM: Outlier Detection via Likelihood of Under-Fitted Generative Models
by: Kim, Dongha, et al.
Published: (2023)
by: Kim, Dongha, et al.
Published: (2023)
Primal-Dual Spectral Representation for Off-policy Evaluation
by: Hu, Yang, et al.
Published: (2024)
by: Hu, Yang, et al.
Published: (2024)
Review learning: Real world validation of privacy preserving continual learning across medical institutions
by: Yoo, Jaesung, et al.
Published: (2022)
by: Yoo, Jaesung, et al.
Published: (2022)
What type of inference is planning?
by: Lázaro-Gredilla, Miguel, et al.
Published: (2024)
by: Lázaro-Gredilla, Miguel, et al.
Published: (2024)
Value-guided action planning with JEPA world models
by: Destrade, Matthieu, et al.
Published: (2025)
by: Destrade, Matthieu, et al.
Published: (2025)
Advanced Torrential Loss Function for Precipitation Forecasting
by: Choi, Jaeho, et al.
Published: (2025)
by: Choi, Jaeho, et al.
Published: (2025)
Fast and Effective On-policy Distillation from Reasoning Prefixes
by: Zhang, Dongxu, et al.
Published: (2026)
by: Zhang, Dongxu, et al.
Published: (2026)
Implicit Hypergraph Neural Networks: A Stable Framework for Higher-Order Relational Learning with Provable Guarantees
by: Li, Xiaoyu, et al.
Published: (2025)
by: Li, Xiaoyu, et al.
Published: (2025)
Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning
by: Wang, Qingqing, et al.
Published: (2024)
by: Wang, Qingqing, et al.
Published: (2024)
Automated ICD Classification of Psychiatric Diagnoses: From Classical NLP to Large Language Models
by: Ortega, Fernando, et al.
Published: (2026)
by: Ortega, Fernando, et al.
Published: (2026)
Global Spatio-Temporal Fusion-based Traffic Prediction Algorithm with Anomaly Aware
by: Liu, Chaoqun, et al.
Published: (2024)
by: Liu, Chaoqun, et al.
Published: (2024)
Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
by: Mu, Zhancun, et al.
Published: (2026)
by: Mu, Zhancun, et al.
Published: (2026)
Using large language models for embodied planning introduces systematic safety risks
by: Zhang, Tao, et al.
Published: (2026)
by: Zhang, Tao, et al.
Published: (2026)
Epic-Sounds: A Large-scale Dataset of Actions That Sound
by: Huh, Jaesung, et al.
Published: (2023)
by: Huh, Jaesung, et al.
Published: (2023)
ExO-PPO: an Extended Off-policy Proximal Policy Optimization Algorithm
by: Wang, Hanyong, et al.
Published: (2026)
by: Wang, Hanyong, et al.
Published: (2026)
Retro-fallback: retrosynthetic planning in an uncertain world
by: Tripp, Austin, et al.
Published: (2023)
by: Tripp, Austin, et al.
Published: (2023)
Rubric-based On-policy Distillation
by: Fang, Junfeng, et al.
Published: (2026)
by: Fang, Junfeng, et al.
Published: (2026)
MambaSL: Exploring Single-Layer Mamba for Time Series Classification
by: Jung, Yoo-Min, et al.
Published: (2026)
by: Jung, Yoo-Min, et al.
Published: (2026)
A Flat Minima Perspective on Understanding Augmentations and Model Robustness
by: Yoo, Weebum, et al.
Published: (2025)
by: Yoo, Weebum, et al.
Published: (2025)
ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning
by: Liang, Kun, et al.
Published: (2026)
by: Liang, Kun, et al.
Published: (2026)
Physics-informed self-supervised learning for predictive modeling of coronary artery digital twins
by: Sun, Xiaowu, et al.
Published: (2025)
by: Sun, Xiaowu, et al.
Published: (2025)
DeepLogit: A sequentially constrained explainable deep learning modeling approach for transport policy analysis
by: Oon, Jeremy, et al.
Published: (2025)
by: Oon, Jeremy, et al.
Published: (2025)
2DXformer: Dual Transformers for Wind Power Forecasting with Dual Exogenous Variables
by: Zhang, Yajuan, et al.
Published: (2025)
by: Zhang, Yajuan, et al.
Published: (2025)
Markov flow policy -- deep MC
by: Soffair, Nitsan, et al.
Published: (2024)
by: Soffair, Nitsan, et al.
Published: (2024)
Semantic Geometry for policy-constrained interpretation
by: Phadke, Nikit
Published: (2025)
by: Phadke, Nikit
Published: (2025)
Dynamic planning in hierarchical active inference
by: Priorelli, Matteo, et al.
Published: (2024)
by: Priorelli, Matteo, et al.
Published: (2024)
ReDi: Rectified Discrete Flow
by: Yoo, Jaehoon, et al.
Published: (2025)
by: Yoo, Jaehoon, et al.
Published: (2025)
Offline Policy Learning via Skill-step Abstraction for Long-horizon Goal-Conditioned Tasks
by: Kim, Donghoon, et al.
Published: (2024)
by: Kim, Donghoon, et al.
Published: (2024)
NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache
by: Son, Donghyun, et al.
Published: (2025)
by: Son, Donghyun, et al.
Published: (2025)
PIMSM: Physics-Informed Multi-Scale Mamba for Stable Neural Representations under Distribution Shift
by: Bae, Sangyoon, et al.
Published: (2026)
by: Bae, Sangyoon, et al.
Published: (2026)
Sophisticated Learning: A novel algorithm for active learning during model-based planning
by: Hodson, Rowan, et al.
Published: (2023)
by: Hodson, Rowan, et al.
Published: (2023)
Multiple-policy Evaluation via Density Estimation
by: Chen, Yilei, et al.
Published: (2024)
by: Chen, Yilei, et al.
Published: (2024)
That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation
by: Bae, Jaesung, et al.
Published: (2025)
by: Bae, Jaesung, et al.
Published: (2025)
Explainable deep learning improves human mental models of self-driving cars
by: Kenny, Eoin M., et al.
Published: (2024)
by: Kenny, Eoin M., et al.
Published: (2024)
What's the plan? Metrics for implicit planning in LLMs and their application to rhyme generation and question answering
by: Maar, Jim, et al.
Published: (2026)
by: Maar, Jim, et al.
Published: (2026)
Improving planning and MBRL with temporally-extended actions
by: Chatterjee, Palash, et al.
Published: (2025)
by: Chatterjee, Palash, et al.
Published: (2025)
On Predictive planning and counterfactual learning in active inference
by: Paul, Aswin, et al.
Published: (2024)
by: Paul, Aswin, et al.
Published: (2024)
Similar Items
-
Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains
by: Bae, Jaesung, et al.
Published: (2026) -
Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech
by: Bae, Jaesung, et al.
Published: (2026) -
On Evaluation of Unsupervised Feature Selection for Pattern Classification
by: Kim, Gyu-Il, et al.
Published: (2026) -
ODIM: Outlier Detection via Likelihood of Under-Fitted Generative Models
by: Kim, Dongha, et al.
Published: (2023) -
Primal-Dual Spectral Representation for Off-policy Evaluation
by: Hu, Yang, et al.
Published: (2024)