:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Fujimoto, Takumi, Nishi, Hiroaki
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.01036
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Variance reduction of diffusion model's gradients with Taylor approximation-based control variate
by: Jeha, Paul, et al.
Published: (2024)

Data value estimation on private gradients
by: Zhou, Zijian, et al.
Published: (2024)

Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)

Discrete-Choice Model with Generalized Additive Utility Network
by: Nishi, Tomoki, et al.
Published: (2023)

Feature learning as alignment: a structural property of gradient descent in non-linear neural networks
by: Beaglehole, Daniel, et al.
Published: (2024)

On-site estimation of battery electrochemical parameters via transfer learning based physics-informed neural network approach
by: Yeregui, Josu, et al.
Published: (2025)

NeuralMOVES: A lightweight and microscopic vehicle emission estimation model based on reverse engineering and surrogate learning
by: Ramirez-Sanchez, Edgar, et al.
Published: (2025)

Hardware-accelerated graph neural networks: an alternative approach for neuromorphic event-based audio classification and keyword spotting on SoC FPGA
by: Jeziorek, Kamil, et al.
Published: (2026)

Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025)

Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
by: Arnob, Samin Yeasar, et al.
Published: (2025)

HyperbolicLR: Epoch insensitive learning rate scheduler
by: Kim, Tae-Geun
Published: (2024)

Multi-omics data integration for early diagnosis of hepatocellular carcinoma (HCC) using machine learning
by: Spooner, Annette, et al.
Published: (2024)

Hardware-Accelerated Event-Graph Neural Networks for Low-Latency Time-Series Classification on SoC FPGA
by: Nakano, Hiroshi, et al.
Published: (2025)

Electrostatics-based particle sampling and approximate inference
by: Huang, Yongchao
Published: (2024)

Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning
by: Wang, Qingqing, et al.
Published: (2024)

Debiased Model-based Representations for Sample-efficient Continuous Control
by: Lyu, Jiafei, et al.
Published: (2026)

Scalable Option Learning in High-Throughput Environments
by: Henaff, Mikael, et al.
Published: (2025)

Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation
by: Yin, Shuyu, et al.
Published: (2024)

State-space models can learn in-context by gradient descent
by: Sushma, Neeraj Mohan, et al.
Published: (2024)

A second-order-like optimizer with adaptive gradient scaling for deep learning
by: Bolte, Jérôme, et al.
Published: (2024)

A federated learning framework with knowledge graph and temporal transformer for early sepsis prediction in multi-center ICUs
by: Chang, Yue, et al.
Published: (2026)

Trainable Quantum Neural Network for Multiclass Image Classification with the Power of Pre-trained Tree Tensor Networks
by: Murota, Keisuke, et al.
Published: (2025)

PF-GNN: Differentiable particle filtering based approximation of universal graph representations
by: Dupty, Mohammed Haroon, et al.
Published: (2024)

A Polynomial-Time Axiomatic Alternative to SHAP for Feature Attribution
by: Hiraki, Kazuhiro, et al.
Published: (2026)

Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
by: Yang, Yan, et al.
Published: (2024)

No learning rates needed: Introducing SALSA -- Stable Armijo Line Search Adaptation
by: Kenneweg, Philip, et al.
Published: (2024)

Noise-based reward-modulated learning
by: Fernández, Jesús García, et al.
Published: (2025)

Signs Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuning
by: Fujisawa, Yoshihiko, et al.
Published: (2026)

Surrogate uncertainty estimation for your time series forecasting black-box: learn when to trust
by: Erlygin, Leonid, et al.
Published: (2023)

LPCD: Unified Framework from Layer-Wise to Submodule Quantization
by: Ichikawa, Yuma, et al.
Published: (2025)

Towards General-Purpose Model-Free Reinforcement Learning
by: Fujimoto, Scott, et al.
Published: (2025)

Optimal rates for density and mode estimation with expand-and-sparsify representations
by: Sinha, Kaushik, et al.
Published: (2026)

Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
by: Patil, Gandharv, et al.
Published: (2022)

Imitation Learning from Observation through Optimal Transport
by: Chang, Wei-Di, et al.
Published: (2023)

Causal prompting model-based offline reinforcement learning
by: Yu, Xuehui, et al.
Published: (2024)

Deep learning-based fault identification in condition monitoring
by: Dhungana, Hariom, et al.
Published: (2024)

An end-to-end attention-based approach for learning on graphs
by: Buterez, David, et al.
Published: (2024)

Learning-based estimation of cattle weight gain and its influencing factors
by: Hossain, Muhammad Riaz Hasib, et al.
Published: (2025)

Bridging the Rural Healthcare Gap: A Cascaded Edge-Cloud Architecture for Automated Retinal Screening
by: Doshi, Nishi, et al.
Published: (2026)

Learning in complex action spaces without policy gradients
by: Tavakoli, Arash, et al.
Published: (2024)