Saved in:
| Main Authors: | Creer, Kode, Parvez, Imitiaz |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.03888 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
HKD-SHO: A hybrid smart home system based on knowledge-based and data-driven services
by: Qiu, Mingming, et al.
Published: (2024)
by: Qiu, Mingming, et al.
Published: (2024)
Rubric-based On-policy Distillation
by: Fang, Junfeng, et al.
Published: (2026)
by: Fang, Junfeng, et al.
Published: (2026)
CST-AFNet: A dual attention-based deep learning framework for intrusion detection in IoT networks
by: Ishtiaq, Waqas, et al.
Published: (2025)
by: Ishtiaq, Waqas, et al.
Published: (2025)
Low Variance Off-policy Evaluation with State-based Importance Sampling
by: Bossens, David M., et al.
Published: (2022)
by: Bossens, David M., et al.
Published: (2022)
An advantage based policy transfer algorithm for reinforcement learning with measures of transferability
by: Alam, Md Ferdous, et al.
Published: (2023)
by: Alam, Md Ferdous, et al.
Published: (2023)
Predictive representations: building blocks of intelligence
by: Carvalho, Wilka, et al.
Published: (2024)
by: Carvalho, Wilka, et al.
Published: (2024)
On the consistent reasoning paradox of intelligence and optimal trust in AI: The power of 'I don't know'
by: Bastounis, Alexander, et al.
Published: (2024)
by: Bastounis, Alexander, et al.
Published: (2024)
On the benefits of pixel-based hierarchical policies for task generalization
by: Cristea-Platon, Tudor, et al.
Published: (2024)
by: Cristea-Platon, Tudor, et al.
Published: (2024)
MobileNetV2: A lightweight classification model for home-based sleep apnea screening
by: Pan, Hui, et al.
Published: (2024)
by: Pan, Hui, et al.
Published: (2024)
Real-world validation of safe reinforcement learning, model predictive control and decision tree-based home energy management systems
by: Ruddick, Julian, et al.
Published: (2024)
by: Ruddick, Julian, et al.
Published: (2024)
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
by: Wang, Yibo, et al.
Published: (2024)
by: Wang, Yibo, et al.
Published: (2024)
An overview of diffusion models for generative artificial intelligence
by: Gallon, Davide, et al.
Published: (2024)
by: Gallon, Davide, et al.
Published: (2024)
WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment
by: Dihan, Mahir Labib, et al.
Published: (2025)
by: Dihan, Mahir Labib, et al.
Published: (2025)
Development of management systems using artificial intelligence systems and machine learning methods for boards of directors (preprint, unofficial translation)
by: Romanova, Anna
Published: (2025)
by: Romanova, Anna
Published: (2025)
Applications of Explainable artificial intelligence in Earth system science
by: Huang, Feini, et al.
Published: (2024)
by: Huang, Feini, et al.
Published: (2024)
fg-expo: Frontier-guided exploration-prioritized policy optimization via adaptive kl and gaussian curriculum
by: Lin, Mingxiong, et al.
Published: (2026)
by: Lin, Mingxiong, et al.
Published: (2026)
Rethinking industrial artificial intelligence: a unified foundation framework
by: Lee, Jay, et al.
Published: (2025)
by: Lee, Jay, et al.
Published: (2025)
Markov flow policy -- deep MC
by: Soffair, Nitsan, et al.
Published: (2024)
by: Soffair, Nitsan, et al.
Published: (2024)
Dual policy as self-model for planning
by: Yoo, Jaesung, et al.
Published: (2023)
by: Yoo, Jaesung, et al.
Published: (2023)
Semantic Geometry for policy-constrained interpretation
by: Phadke, Nikit
Published: (2025)
by: Phadke, Nikit
Published: (2025)
Multi-task Domain Adaptation for Computation Offloading in Edge-intelligence Networks
by: Han, Runxin, et al.
Published: (2025)
by: Han, Runxin, et al.
Published: (2025)
Separable neural architectures as a primitive for unified predictive and generative intelligence
by: Batley, Reza T., et al.
Published: (2026)
by: Batley, Reza T., et al.
Published: (2026)
LLM-based policy generation for intent-based management of applications
by: Dzeparoska, Kristina, et al.
Published: (2024)
by: Dzeparoska, Kristina, et al.
Published: (2024)
Multiple-policy Evaluation via Density Estimation
by: Chen, Yilei, et al.
Published: (2024)
by: Chen, Yilei, et al.
Published: (2024)
TastepepAI, An artificial intelligence platform for taste peptide de novo design
by: Yue, Jianda, et al.
Published: (2025)
by: Yue, Jianda, et al.
Published: (2025)
Tracking daily paths in home contexts with RSSI fingerprinting based on UWB through deep learning models
by: Polo-Rodríguez, Aurora, et al.
Published: (2025)
by: Polo-Rodríguez, Aurora, et al.
Published: (2025)
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
by: Liu, Shaoteng, et al.
Published: (2024)
by: Liu, Shaoteng, et al.
Published: (2024)
Learning in complex action spaces without policy gradients
by: Tavakoli, Arash, et al.
Published: (2024)
by: Tavakoli, Arash, et al.
Published: (2024)
Tourism destination events classifier based on artificial intelligence techniques
by: Camacho-Ruiz, Miguel, et al.
Published: (2024)
by: Camacho-Ruiz, Miguel, et al.
Published: (2024)
Counterfactual experience augmented off-policy reinforcement learning
by: Lee, Sunbowen, et al.
Published: (2025)
by: Lee, Sunbowen, et al.
Published: (2025)
Fast and Effective On-policy Distillation from Reasoning Prefixes
by: Zhang, Dongxu, et al.
Published: (2026)
by: Zhang, Dongxu, et al.
Published: (2026)
Detecting broken Absorber Tubes in CSP plants using intelligent sampling and dual loss
by: Pérez-Cutiño, Miguel Angel, et al.
Published: (2022)
by: Pérez-Cutiño, Miguel Angel, et al.
Published: (2022)
Self-Consistency from Only Two Samples: CoT-PoT Ensembling for Efficient LLM Reasoning
by: Saparkhan, Raman, et al.
Published: (2026)
by: Saparkhan, Raman, et al.
Published: (2026)
Generative method for aerodynamic optimization based on classifier-free guided denoising diffusion probabilistic model
by: Deng, Shisong, et al.
Published: (2025)
by: Deng, Shisong, et al.
Published: (2025)
Research on an intelligent fault diagnosis method for nuclear power plants based on ETCN-SSA combined algorithm
by: Fang, Jiayan, et al.
Published: (2024)
by: Fang, Jiayan, et al.
Published: (2024)
From paper to benchmark: agentic, framework-based reproduction of under-specified methods in machine health intelligence
by: Theiler, Raffael, et al.
Published: (2026)
by: Theiler, Raffael, et al.
Published: (2026)
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2021)
by: Patterson, Andrew, et al.
Published: (2021)
ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning
by: Liang, Kun, et al.
Published: (2026)
by: Liang, Kun, et al.
Published: (2026)
Bootstrap Off-policy with World Model
by: Zhan, Guojian, et al.
Published: (2025)
by: Zhan, Guojian, et al.
Published: (2025)
Artificial intelligence for science: The easy and hard problems
by: Battleday, Ruairidh M., et al.
Published: (2024)
by: Battleday, Ruairidh M., et al.
Published: (2024)
Similar Items
-
HKD-SHO: A hybrid smart home system based on knowledge-based and data-driven services
by: Qiu, Mingming, et al.
Published: (2024) -
Rubric-based On-policy Distillation
by: Fang, Junfeng, et al.
Published: (2026) -
CST-AFNet: A dual attention-based deep learning framework for intrusion detection in IoT networks
by: Ishtiaq, Waqas, et al.
Published: (2025) -
Low Variance Off-policy Evaluation with State-based Importance Sampling
by: Bossens, David M., et al.
Published: (2022) -
An advantage based policy transfer algorithm for reinforcement learning with measures of transferability
by: Alam, Md Ferdous, et al.
Published: (2023)