:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yoo, Jaesung, de la Torre, Fernanda, Yang, Guangyu Robert
Format:	Preprint
Published:	2023
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2306.04440
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains
by: Bae, Jaesung, et al.
Published: (2026)

Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech
by: Bae, Jaesung, et al.
Published: (2026)

On Evaluation of Unsupervised Feature Selection for Pattern Classification
by: Kim, Gyu-Il, et al.
Published: (2026)

ODIM: Outlier Detection via Likelihood of Under-Fitted Generative Models
by: Kim, Dongha, et al.
Published: (2023)

Primal-Dual Spectral Representation for Off-policy Evaluation
by: Hu, Yang, et al.
Published: (2024)

Review learning: Real world validation of privacy preserving continual learning across medical institutions
by: Yoo, Jaesung, et al.
Published: (2022)

What type of inference is planning?
by: Lázaro-Gredilla, Miguel, et al.
Published: (2024)

Value-guided action planning with JEPA world models
by: Destrade, Matthieu, et al.
Published: (2025)

Advanced Torrential Loss Function for Precipitation Forecasting
by: Choi, Jaeho, et al.
Published: (2025)

Fast and Effective On-policy Distillation from Reasoning Prefixes
by: Zhang, Dongxu, et al.
Published: (2026)

Implicit Hypergraph Neural Networks: A Stable Framework for Higher-Order Relational Learning with Provable Guarantees
by: Li, Xiaoyu, et al.
Published: (2025)

Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning
by: Wang, Qingqing, et al.
Published: (2024)

Automated ICD Classification of Psychiatric Diagnoses: From Classical NLP to Large Language Models
by: Ortega, Fernando, et al.
Published: (2026)

Global Spatio-Temporal Fusion-based Traffic Prediction Algorithm with Anomaly Aware
by: Liu, Chaoqun, et al.
Published: (2024)

Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
by: Mu, Zhancun, et al.
Published: (2026)

Using large language models for embodied planning introduces systematic safety risks
by: Zhang, Tao, et al.
Published: (2026)

Epic-Sounds: A Large-scale Dataset of Actions That Sound
by: Huh, Jaesung, et al.
Published: (2023)

ExO-PPO: an Extended Off-policy Proximal Policy Optimization Algorithm
by: Wang, Hanyong, et al.
Published: (2026)

Retro-fallback: retrosynthetic planning in an uncertain world
by: Tripp, Austin, et al.
Published: (2023)

Rubric-based On-policy Distillation
by: Fang, Junfeng, et al.
Published: (2026)

MambaSL: Exploring Single-Layer Mamba for Time Series Classification
by: Jung, Yoo-Min, et al.
Published: (2026)

A Flat Minima Perspective on Understanding Augmentations and Model Robustness
by: Yoo, Weebum, et al.
Published: (2025)

ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning
by: Liang, Kun, et al.
Published: (2026)

Physics-informed self-supervised learning for predictive modeling of coronary artery digital twins
by: Sun, Xiaowu, et al.
Published: (2025)

DeepLogit: A sequentially constrained explainable deep learning modeling approach for transport policy analysis
by: Oon, Jeremy, et al.
Published: (2025)

2DXformer: Dual Transformers for Wind Power Forecasting with Dual Exogenous Variables
by: Zhang, Yajuan, et al.
Published: (2025)

Markov flow policy -- deep MC
by: Soffair, Nitsan, et al.
Published: (2024)

Semantic Geometry for policy-constrained interpretation
by: Phadke, Nikit
Published: (2025)

Dynamic planning in hierarchical active inference
by: Priorelli, Matteo, et al.
Published: (2024)

ReDi: Rectified Discrete Flow
by: Yoo, Jaehoon, et al.
Published: (2025)

Offline Policy Learning via Skill-step Abstraction for Long-horizon Goal-Conditioned Tasks
by: Kim, Donghoon, et al.
Published: (2024)

NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache
by: Son, Donghyun, et al.
Published: (2025)

PIMSM: Physics-Informed Multi-Scale Mamba for Stable Neural Representations under Distribution Shift
by: Bae, Sangyoon, et al.
Published: (2026)

Sophisticated Learning: A novel algorithm for active learning during model-based planning
by: Hodson, Rowan, et al.
Published: (2023)

Multiple-policy Evaluation via Density Estimation
by: Chen, Yilei, et al.
Published: (2024)

That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation
by: Bae, Jaesung, et al.
Published: (2025)

Explainable deep learning improves human mental models of self-driving cars
by: Kenny, Eoin M., et al.
Published: (2024)

What's the plan? Metrics for implicit planning in LLMs and their application to rhyme generation and question answering
by: Maar, Jim, et al.
Published: (2026)

Improving planning and MBRL with temporally-extended actions
by: Chatterjee, Palash, et al.
Published: (2025)

On Predictive planning and counterfactual learning in active inference
by: Paul, Aswin, et al.
Published: (2024)