Saved in:
| Main Authors: | Fujimoto, Takumi, Nishi, Hiroaki |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.01036 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Variance reduction of diffusion model's gradients with Taylor approximation-based control variate
by: Jeha, Paul, et al.
Published: (2024)
by: Jeha, Paul, et al.
Published: (2024)
Data value estimation on private gradients
by: Zhou, Zijian, et al.
Published: (2024)
by: Zhou, Zijian, et al.
Published: (2024)
Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)
by: Lyle, Clare, et al.
Published: (2024)
Discrete-Choice Model with Generalized Additive Utility Network
by: Nishi, Tomoki, et al.
Published: (2023)
by: Nishi, Tomoki, et al.
Published: (2023)
Feature learning as alignment: a structural property of gradient descent in non-linear neural networks
by: Beaglehole, Daniel, et al.
Published: (2024)
by: Beaglehole, Daniel, et al.
Published: (2024)
On-site estimation of battery electrochemical parameters via transfer learning based physics-informed neural network approach
by: Yeregui, Josu, et al.
Published: (2025)
by: Yeregui, Josu, et al.
Published: (2025)
NeuralMOVES: A lightweight and microscopic vehicle emission estimation model based on reverse engineering and surrogate learning
by: Ramirez-Sanchez, Edgar, et al.
Published: (2025)
by: Ramirez-Sanchez, Edgar, et al.
Published: (2025)
Hardware-accelerated graph neural networks: an alternative approach for neuromorphic event-based audio classification and keyword spotting on SoC FPGA
by: Jeziorek, Kamil, et al.
Published: (2026)
by: Jeziorek, Kamil, et al.
Published: (2026)
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025)
by: Lim, Han-Dong, et al.
Published: (2025)
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
by: Arnob, Samin Yeasar, et al.
Published: (2025)
by: Arnob, Samin Yeasar, et al.
Published: (2025)
HyperbolicLR: Epoch insensitive learning rate scheduler
by: Kim, Tae-Geun
Published: (2024)
by: Kim, Tae-Geun
Published: (2024)
Multi-omics data integration for early diagnosis of hepatocellular carcinoma (HCC) using machine learning
by: Spooner, Annette, et al.
Published: (2024)
by: Spooner, Annette, et al.
Published: (2024)
Hardware-Accelerated Event-Graph Neural Networks for Low-Latency Time-Series Classification on SoC FPGA
by: Nakano, Hiroshi, et al.
Published: (2025)
by: Nakano, Hiroshi, et al.
Published: (2025)
Electrostatics-based particle sampling and approximate inference
by: Huang, Yongchao
Published: (2024)
by: Huang, Yongchao
Published: (2024)
Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning
by: Wang, Qingqing, et al.
Published: (2024)
by: Wang, Qingqing, et al.
Published: (2024)
Debiased Model-based Representations for Sample-efficient Continuous Control
by: Lyu, Jiafei, et al.
Published: (2026)
by: Lyu, Jiafei, et al.
Published: (2026)
Scalable Option Learning in High-Throughput Environments
by: Henaff, Mikael, et al.
Published: (2025)
by: Henaff, Mikael, et al.
Published: (2025)
Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation
by: Yin, Shuyu, et al.
Published: (2024)
by: Yin, Shuyu, et al.
Published: (2024)
State-space models can learn in-context by gradient descent
by: Sushma, Neeraj Mohan, et al.
Published: (2024)
by: Sushma, Neeraj Mohan, et al.
Published: (2024)
A second-order-like optimizer with adaptive gradient scaling for deep learning
by: Bolte, Jérôme, et al.
Published: (2024)
by: Bolte, Jérôme, et al.
Published: (2024)
A federated learning framework with knowledge graph and temporal transformer for early sepsis prediction in multi-center ICUs
by: Chang, Yue, et al.
Published: (2026)
by: Chang, Yue, et al.
Published: (2026)
Trainable Quantum Neural Network for Multiclass Image Classification with the Power of Pre-trained Tree Tensor Networks
by: Murota, Keisuke, et al.
Published: (2025)
by: Murota, Keisuke, et al.
Published: (2025)
PF-GNN: Differentiable particle filtering based approximation of universal graph representations
by: Dupty, Mohammed Haroon, et al.
Published: (2024)
by: Dupty, Mohammed Haroon, et al.
Published: (2024)
A Polynomial-Time Axiomatic Alternative to SHAP for Feature Attribution
by: Hiraki, Kazuhiro, et al.
Published: (2026)
by: Hiraki, Kazuhiro, et al.
Published: (2026)
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
by: Yang, Yan, et al.
Published: (2024)
by: Yang, Yan, et al.
Published: (2024)
No learning rates needed: Introducing SALSA -- Stable Armijo Line Search Adaptation
by: Kenneweg, Philip, et al.
Published: (2024)
by: Kenneweg, Philip, et al.
Published: (2024)
Noise-based reward-modulated learning
by: Fernández, Jesús García, et al.
Published: (2025)
by: Fernández, Jesús García, et al.
Published: (2025)
Signs Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuning
by: Fujisawa, Yoshihiko, et al.
Published: (2026)
by: Fujisawa, Yoshihiko, et al.
Published: (2026)
Surrogate uncertainty estimation for your time series forecasting black-box: learn when to trust
by: Erlygin, Leonid, et al.
Published: (2023)
by: Erlygin, Leonid, et al.
Published: (2023)
LPCD: Unified Framework from Layer-Wise to Submodule Quantization
by: Ichikawa, Yuma, et al.
Published: (2025)
by: Ichikawa, Yuma, et al.
Published: (2025)
Towards General-Purpose Model-Free Reinforcement Learning
by: Fujimoto, Scott, et al.
Published: (2025)
by: Fujimoto, Scott, et al.
Published: (2025)
Optimal rates for density and mode estimation with expand-and-sparsify representations
by: Sinha, Kaushik, et al.
Published: (2026)
by: Sinha, Kaushik, et al.
Published: (2026)
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
by: Patil, Gandharv, et al.
Published: (2022)
by: Patil, Gandharv, et al.
Published: (2022)
Imitation Learning from Observation through Optimal Transport
by: Chang, Wei-Di, et al.
Published: (2023)
by: Chang, Wei-Di, et al.
Published: (2023)
Causal prompting model-based offline reinforcement learning
by: Yu, Xuehui, et al.
Published: (2024)
by: Yu, Xuehui, et al.
Published: (2024)
Deep learning-based fault identification in condition monitoring
by: Dhungana, Hariom, et al.
Published: (2024)
by: Dhungana, Hariom, et al.
Published: (2024)
An end-to-end attention-based approach for learning on graphs
by: Buterez, David, et al.
Published: (2024)
by: Buterez, David, et al.
Published: (2024)
Learning-based estimation of cattle weight gain and its influencing factors
by: Hossain, Muhammad Riaz Hasib, et al.
Published: (2025)
by: Hossain, Muhammad Riaz Hasib, et al.
Published: (2025)
Bridging the Rural Healthcare Gap: A Cascaded Edge-Cloud Architecture for Automated Retinal Screening
by: Doshi, Nishi, et al.
Published: (2026)
by: Doshi, Nishi, et al.
Published: (2026)
Learning in complex action spaces without policy gradients
by: Tavakoli, Arash, et al.
Published: (2024)
by: Tavakoli, Arash, et al.
Published: (2024)
Similar Items
-
Variance reduction of diffusion model's gradients with Taylor approximation-based control variate
by: Jeha, Paul, et al.
Published: (2024) -
Data value estimation on private gradients
by: Zhou, Zijian, et al.
Published: (2024) -
Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024) -
Discrete-Choice Model with Generalized Additive Utility Network
by: Nishi, Tomoki, et al.
Published: (2023) -
Feature learning as alignment: a structural property of gradient descent in non-linear neural networks
by: Beaglehole, Daniel, et al.
Published: (2024)