:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Creer, Kode, Parvez, Imitiaz
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2404.03888
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

HKD-SHO: A hybrid smart home system based on knowledge-based and data-driven services
by: Qiu, Mingming, et al.
Published: (2024)

Rubric-based On-policy Distillation
by: Fang, Junfeng, et al.
Published: (2026)

CST-AFNet: A dual attention-based deep learning framework for intrusion detection in IoT networks
by: Ishtiaq, Waqas, et al.
Published: (2025)

Low Variance Off-policy Evaluation with State-based Importance Sampling
by: Bossens, David M., et al.
Published: (2022)

An advantage based policy transfer algorithm for reinforcement learning with measures of transferability
by: Alam, Md Ferdous, et al.
Published: (2023)

Predictive representations: building blocks of intelligence
by: Carvalho, Wilka, et al.
Published: (2024)

On the consistent reasoning paradox of intelligence and optimal trust in AI: The power of 'I don't know'
by: Bastounis, Alexander, et al.
Published: (2024)

On the benefits of pixel-based hierarchical policies for task generalization
by: Cristea-Platon, Tudor, et al.
Published: (2024)

MobileNetV2: A lightweight classification model for home-based sleep apnea screening
by: Pan, Hui, et al.
Published: (2024)

Real-world validation of safe reinforcement learning, model predictive control and decision tree-based home energy management systems
by: Ruddick, Julian, et al.
Published: (2024)

Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
by: Wang, Yibo, et al.
Published: (2024)

An overview of diffusion models for generative artificial intelligence
by: Gallon, Davide, et al.
Published: (2024)

WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment
by: Dihan, Mahir Labib, et al.
Published: (2025)

Development of management systems using artificial intelligence systems and machine learning methods for boards of directors (preprint, unofficial translation)
by: Romanova, Anna
Published: (2025)

Applications of Explainable artificial intelligence in Earth system science
by: Huang, Feini, et al.
Published: (2024)

fg-expo: Frontier-guided exploration-prioritized policy optimization via adaptive kl and gaussian curriculum
by: Lin, Mingxiong, et al.
Published: (2026)

Rethinking industrial artificial intelligence: a unified foundation framework
by: Lee, Jay, et al.
Published: (2025)

Markov flow policy -- deep MC
by: Soffair, Nitsan, et al.
Published: (2024)

Dual policy as self-model for planning
by: Yoo, Jaesung, et al.
Published: (2023)

Semantic Geometry for policy-constrained interpretation
by: Phadke, Nikit
Published: (2025)

Multi-task Domain Adaptation for Computation Offloading in Edge-intelligence Networks
by: Han, Runxin, et al.
Published: (2025)

Separable neural architectures as a primitive for unified predictive and generative intelligence
by: Batley, Reza T., et al.
Published: (2026)

LLM-based policy generation for intent-based management of applications
by: Dzeparoska, Kristina, et al.
Published: (2024)

Multiple-policy Evaluation via Density Estimation
by: Chen, Yilei, et al.
Published: (2024)

TastepepAI, An artificial intelligence platform for taste peptide de novo design
by: Yue, Jianda, et al.
Published: (2025)

Tracking daily paths in home contexts with RSSI fingerprinting based on UWB through deep learning models
by: Polo-Rodríguez, Aurora, et al.
Published: (2025)

RL-GPT: Integrating Reinforcement Learning and Code-as-policy
by: Liu, Shaoteng, et al.
Published: (2024)

Learning in complex action spaces without policy gradients
by: Tavakoli, Arash, et al.
Published: (2024)

Tourism destination events classifier based on artificial intelligence techniques
by: Camacho-Ruiz, Miguel, et al.
Published: (2024)

Counterfactual experience augmented off-policy reinforcement learning
by: Lee, Sunbowen, et al.
Published: (2025)

Fast and Effective On-policy Distillation from Reasoning Prefixes
by: Zhang, Dongxu, et al.
Published: (2026)

Detecting broken Absorber Tubes in CSP plants using intelligent sampling and dual loss
by: Pérez-Cutiño, Miguel Angel, et al.
Published: (2022)

Self-Consistency from Only Two Samples: CoT-PoT Ensembling for Efficient LLM Reasoning
by: Saparkhan, Raman, et al.
Published: (2026)

Generative method for aerodynamic optimization based on classifier-free guided denoising diffusion probabilistic model
by: Deng, Shisong, et al.
Published: (2025)

Research on an intelligent fault diagnosis method for nuclear power plants based on ETCN-SSA combined algorithm
by: Fang, Jiayan, et al.
Published: (2024)

From paper to benchmark: agentic, framework-based reproduction of under-specified methods in machine health intelligence
by: Theiler, Raffael, et al.
Published: (2026)

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2021)

ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning
by: Liang, Kun, et al.
Published: (2026)

Bootstrap Off-policy with World Model
by: Zhan, Guojian, et al.
Published: (2025)

Artificial intelligence for science: The easy and hard problems
by: Battleday, Ruairidh M., et al.
Published: (2024)