Saved in:
| Main Authors: | Kotamreddy, Harshil, Machado, Marlos C. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.09127 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
An Analysis of Action-Value Temporal-Difference Methods That Learn State Values
by: Daley, Brett, et al.
Published: (2025)
by: Daley, Brett, et al.
Published: (2025)
Multimodal Data Curation Through Ranked Retrieval
by: Muthukumar, Pratyush, et al.
Published: (2026)
by: Muthukumar, Pratyush, et al.
Published: (2026)
Reward-Aware Proto-Representations in Reinforcement Learning
by: Tse, Hon Tik, et al.
Published: (2025)
by: Tse, Hon Tik, et al.
Published: (2025)
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
by: Daley, Brett, et al.
Published: (2023)
by: Daley, Brett, et al.
Published: (2023)
The Laplacian Keyboard: Beyond the Linear Span
by: Chandrasekar, Siddarth, et al.
Published: (2026)
by: Chandrasekar, Siddarth, et al.
Published: (2026)
DROGO: Default Representation Objective via Graph Optimization in Reinforcement Learning
by: Tse, Hon Tik, et al.
Published: (2026)
by: Tse, Hon Tik, et al.
Published: (2026)
Plastic Learning with Deep Fourier Features
by: Lewandowski, Alex, et al.
Published: (2024)
by: Lewandowski, Alex, et al.
Published: (2024)
Averaging $n$-step Returns Reduces Variance in Reinforcement Learning
by: Daley, Brett, et al.
Published: (2024)
by: Daley, Brett, et al.
Published: (2024)
Deep Double Q-learning
by: Nagarajan, Prabhat, et al.
Published: (2025)
by: Nagarajan, Prabhat, et al.
Published: (2025)
Demystifying the Recency Heuristic in Temporal-Difference Learning
by: Daley, Brett, et al.
Published: (2024)
by: Daley, Brett, et al.
Published: (2024)
Proper Laplacian Representation Learning
by: Gomez, Diego, et al.
Published: (2023)
by: Gomez, Diego, et al.
Published: (2023)
Harnessing Discrete Representations For Continual Reinforcement Learning
by: Meyer, Edan, et al.
Published: (2023)
by: Meyer, Edan, et al.
Published: (2023)
Directions of Curvature as an Explanation for Loss of Plasticity
by: Lewandowski, Alex, et al.
Published: (2023)
by: Lewandowski, Alex, et al.
Published: (2023)
Laplacian Representations for Decision-Time Planning
by: Shehmar, Dikshant, et al.
Published: (2026)
by: Shehmar, Dikshant, et al.
Published: (2026)
AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning
by: Pramanik, Subhojeet, et al.
Published: (2023)
by: Pramanik, Subhojeet, et al.
Published: (2023)
LATTA: Langevin-Anchored Test-Time Adaptation for Enhanced Robustness and Stability
by: Vejendla, Harshil
Published: (2025)
by: Vejendla, Harshil
Published: (2025)
Learning to Predict Chaos: Curriculum-Driven Training for Robust Forecasting of Chaotic Dynamics
by: Vejendla, Harshil
Published: (2025)
by: Vejendla, Harshil
Published: (2025)
Investigating Sparsity in Recurrent Neural Networks
by: Darji, Harshil
Published: (2024)
by: Darji, Harshil
Published: (2024)
Drift-Adapter: A Practical Approach to Near Zero-Downtime Embedding Model Upgrades in Vector Databases
by: Vejendla, Harshil
Published: (2025)
by: Vejendla, Harshil
Published: (2025)
H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference
by: Vejendla, Harshil
Published: (2025)
by: Vejendla, Harshil
Published: (2025)
Teaching by Failure: Counter-Example-Driven Curricula for Transformer Self-Improvement
by: Vejendla, Harshil
Published: (2025)
by: Vejendla, Harshil
Published: (2025)
Wave-PDE Nets: Trainable Wave-Equation Layers as an Alternative to Attention
by: Vejendla, Harshil
Published: (2025)
by: Vejendla, Harshil
Published: (2025)
RewriteNets: End-to-End Trainable String-Rewriting for Generative Sequence Modeling
by: Vejendla, Harshil
Published: (2026)
by: Vejendla, Harshil
Published: (2026)
SliceMoE: Routing Embedding Slices Instead of Tokens for Fine-Grained and Balanced Transformer Scaling
by: Vejendla, Harshil
Published: (2025)
by: Vejendla, Harshil
Published: (2025)
Deep Reinforcement Learning with Gradient Eligibility Traces
by: Elelimy, Esraa, et al.
Published: (2025)
by: Elelimy, Esraa, et al.
Published: (2025)
Learning Continually by Spectral Regularization
by: Lewandowski, Alex, et al.
Published: (2024)
by: Lewandowski, Alex, et al.
Published: (2024)
The Cell Must Go On: Agar.io for Continual Reinforcement Learning
by: Mohamed, Mohamed A., et al.
Published: (2025)
by: Mohamed, Mohamed A., et al.
Published: (2025)
Challenges and Considerations in Annotating Legal Data: A Comprehensive Overview
by: Darji, Harshil, et al.
Published: (2024)
by: Darji, Harshil, et al.
Published: (2024)
Kolmogorov-Arnold Neural Networks for High-Entropy Alloys Design
by: Bandyopadhyay, Yagnik, et al.
Published: (2024)
by: Bandyopadhyay, Yagnik, et al.
Published: (2024)
Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training
by: Cao, Chuxue, et al.
Published: (2026)
by: Cao, Chuxue, et al.
Published: (2026)
Priority-Aware Shapley Value
by: Lee, Kiljae, et al.
Published: (2026)
by: Lee, Kiljae, et al.
Published: (2026)
Segmentation and Processing of German Court Decisions from Open Legal Data
by: Darji, Harshil, et al.
Published: (2026)
by: Darji, Harshil, et al.
Published: (2026)
MaestroMotif: Skill Design from Artificial Intelligence Feedback
by: Klissarov, Martin, et al.
Published: (2024)
by: Klissarov, Martin, et al.
Published: (2024)
Generalized Priority-Aware Shapley Value
by: Lee, Kiljae, et al.
Published: (2026)
by: Lee, Kiljae, et al.
Published: (2026)
Beyond Shapley Values: Cooperative Games for the Interpretation of Machine Learning Models
by: Idrissi, Marouane Il, et al.
Published: (2025)
by: Idrissi, Marouane Il, et al.
Published: (2025)
Calibrated Value-Aware Model Learning with Probabilistic Environment Models
by: Voelcker, Claas, et al.
Published: (2025)
by: Voelcker, Claas, et al.
Published: (2025)
PhyPlan: Generalizable and Rapid Physical Task Planning with Physics Informed Skill Networks for Robot Manipulators
by: Chopra, Mudit, et al.
Published: (2024)
by: Chopra, Mudit, et al.
Published: (2024)
A Comprehensive Study of Shapley Value in Data Analytics
by: Lin, Hong, et al.
Published: (2024)
by: Lin, Hong, et al.
Published: (2024)
Toward Autonomous UI Exploration: The UIExplorer Benchmark
by: Nica, Andrei Cristian, et al.
Published: (2025)
by: Nica, Andrei Cristian, et al.
Published: (2025)
AIM: Intent-Aware Unified world action Modeling with Spatial Value Maps
by: Fan, Liaoyuan, et al.
Published: (2026)
by: Fan, Liaoyuan, et al.
Published: (2026)
Similar Items
-
An Analysis of Action-Value Temporal-Difference Methods That Learn State Values
by: Daley, Brett, et al.
Published: (2025) -
Multimodal Data Curation Through Ranked Retrieval
by: Muthukumar, Pratyush, et al.
Published: (2026) -
Reward-Aware Proto-Representations in Reinforcement Learning
by: Tse, Hon Tik, et al.
Published: (2025) -
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
by: Daley, Brett, et al.
Published: (2023) -
The Laplacian Keyboard: Beyond the Linear Span
by: Chandrasekar, Siddarth, et al.
Published: (2026)