Saved in:
| Main Authors: | Kumar, Abhijit, Kumar, Natalya, Gupta, Shikhar |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.16158 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Stepwise Credit Assignment for GRPO on Flow-Matching Models
by: Savani, Yash, et al.
Published: (2026)
by: Savani, Yash, et al.
Published: (2026)
Knowing When to Ask: Segment-Level Credit Assignment for LLM Tool Use
by: Kumar, Abhijit, et al.
Published: (2026)
by: Kumar, Abhijit, et al.
Published: (2026)
MURPHY: Feedback-Aware GRPO with Retrospective Credit Assignment for Multi-Turn Code Generation
by: Ekbote, Chanakya, et al.
Published: (2025)
by: Ekbote, Chanakya, et al.
Published: (2025)
GRPO-$λ$: Credit Assignment improves LLM Reasoning
by: Parthasarathi, Prasanna, et al.
Published: (2025)
by: Parthasarathi, Prasanna, et al.
Published: (2025)
Beyond Uniform Credit: Causal Credit Assignment for Policy Optimization
by: Khandoga, Mykola, et al.
Published: (2026)
by: Khandoga, Mykola, et al.
Published: (2026)
DACA-GRPO: Denoising-Aware Credit Assignment for Reinforcement Learning in Diffusion Language Models
by: Monsefi, Amin Karimi, et al.
Published: (2026)
by: Monsefi, Amin Karimi, et al.
Published: (2026)
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
by: Low, Siow Meng, et al.
Published: (2025)
by: Low, Siow Meng, et al.
Published: (2025)
Focus Where It Matters: Graph Selective State Focused Attention Networks
by: Vashistha, Shikhar, et al.
Published: (2024)
by: Vashistha, Shikhar, et al.
Published: (2024)
Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning
by: Li, Ziheng, et al.
Published: (2026)
by: Li, Ziheng, et al.
Published: (2026)
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning
by: Yang, Matthew Y. R., et al.
Published: (2026)
by: Yang, Matthew Y. R., et al.
Published: (2026)
Deep Reinforcement Learning for Interference Suppression in RIS-Aided Space-Air-Ground Integrated Networks
by: Mamillapalli, Pujitha, et al.
Published: (2026)
by: Mamillapalli, Pujitha, et al.
Published: (2026)
ICA: Information-Aware Credit Assignment for Visually Grounded Long-Horizon Information-Seeking Agents
by: Pang, Cong, et al.
Published: (2026)
by: Pang, Cong, et al.
Published: (2026)
Intrinsic Credit Assignment for Long Horizon Interaction
by: Auzina, Ilze Amanda, et al.
Published: (2026)
by: Auzina, Ilze Amanda, et al.
Published: (2026)
Semantic Voting: Execution-Grounded Consensus for LLM Code Generation
by: Jiang, Shan, et al.
Published: (2026)
by: Jiang, Shan, et al.
Published: (2026)
Score Broadcast and Decorrelation: A General Framework for Broadcast-Based Credit Assignment
by: Uzun, Mustafa, et al.
Published: (2026)
by: Uzun, Mustafa, et al.
Published: (2026)
In-Context Credit Assignment via the Core
by: Harris, Keegan, et al.
Published: (2026)
by: Harris, Keegan, et al.
Published: (2026)
CAPO: Towards Enhancing LLM Reasoning through Generative Credit Assignment
by: Xie, Guofu, et al.
Published: (2025)
by: Xie, Guofu, et al.
Published: (2025)
Hindsight Credit Assignment for Long-Horizon LLM Agents
by: Tan, Hui-Ze, et al.
Published: (2026)
by: Tan, Hui-Ze, et al.
Published: (2026)
Exact Is Easier: Credit Assignment for Cooperative LLM Agents
by: Chen, Yanjun, et al.
Published: (2026)
by: Chen, Yanjun, et al.
Published: (2026)
Cardinality-Preserving Attention Channels for Graph Transformers in Molecular Property Prediction
by: Gupta, Abhijit
Published: (2026)
by: Gupta, Abhijit
Published: (2026)
Forward Target Propagation: A Forward-Only Approach to Global Error Credit Assignment via Local Losses
by: As-Saquib, Nazmus Saadat, et al.
Published: (2025)
by: As-Saquib, Nazmus Saadat, et al.
Published: (2025)
CardiGraphormer: Unveiling the Power of Self-Supervised Learning in Revolutionizing Drug Discovery
by: Gupta, Abhijit
Published: (2023)
by: Gupta, Abhijit
Published: (2023)
Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models
by: Chen, Wen-Tse, et al.
Published: (2026)
by: Chen, Wen-Tse, et al.
Published: (2026)
RTMC: Step-Level Credit Assignment via Rollout Trees
by: Wang, Tao, et al.
Published: (2026)
by: Wang, Tao, et al.
Published: (2026)
Beyond Uniform Credit Assignment: Selective Eligibility Traces for RLVR
by: Mou, Chaoli, et al.
Published: (2026)
by: Mou, Chaoli, et al.
Published: (2026)
VinePPO: Refining Credit Assignment in RL Training of LLMs
by: Kazemnejad, Amirhossein, et al.
Published: (2024)
by: Kazemnejad, Amirhossein, et al.
Published: (2024)
Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
by: Ramesh, Aditya A., et al.
Published: (2024)
by: Ramesh, Aditya A., et al.
Published: (2024)
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
by: Pignatelli, Eduardo, et al.
Published: (2023)
by: Pignatelli, Eduardo, et al.
Published: (2023)
Credit Assignment via Neural Manifold Noise Correlation
by: Kang, Byungwoo, et al.
Published: (2026)
by: Kang, Byungwoo, et al.
Published: (2026)
COSAC: Counterfactual Credit Assignment in Sequential Cooperative Teams
by: Deshmukh, Shripad, et al.
Published: (2026)
by: Deshmukh, Shripad, et al.
Published: (2026)
Linear Predictability of Attention Heads in Large Language Models
by: Shaikh, Khalid, et al.
Published: (2026)
by: Shaikh, Khalid, et al.
Published: (2026)
Prediction of good reaction coordinates and future evolution of MD trajectories using Regularized Sparse Autoencoders: A novel deep learning approach
by: Gupta, Abhijit
Published: (2022)
by: Gupta, Abhijit
Published: (2022)
Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation
by: Barnes, Jarrod
Published: (2026)
by: Barnes, Jarrod
Published: (2026)
ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?
by: Nangia, Ayush, et al.
Published: (2026)
by: Nangia, Ayush, et al.
Published: (2026)
ARCA: Adapter-Residual Credit Assignment When Token Signals Degenerate
by: Lafuente-Mercado, Rodney
Published: (2026)
by: Lafuente-Mercado, Rodney
Published: (2026)
DGPO: Distribution Guided Policy Optimization for Fine Grained Credit Assignment
by: Jin, Hongbo, et al.
Published: (2026)
by: Jin, Hongbo, et al.
Published: (2026)
MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment
by: Wang, Ziyan, et al.
Published: (2023)
by: Wang, Ziyan, et al.
Published: (2023)
Search-Based Credit Assignment for Offline Preference-Based Reinforcement Learning
by: Gao, Xiancheng, et al.
Published: (2025)
by: Gao, Xiancheng, et al.
Published: (2025)
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
by: Qu, Yun, et al.
Published: (2024)
by: Qu, Yun, et al.
Published: (2024)
Reducing Credit Assignment Variance via Counterfactual Reasoning Paths
by: Ding, Fei, et al.
Published: (2026)
by: Ding, Fei, et al.
Published: (2026)
Similar Items
-
Stepwise Credit Assignment for GRPO on Flow-Matching Models
by: Savani, Yash, et al.
Published: (2026) -
Knowing When to Ask: Segment-Level Credit Assignment for LLM Tool Use
by: Kumar, Abhijit, et al.
Published: (2026) -
MURPHY: Feedback-Aware GRPO with Retrospective Credit Assignment for Multi-Turn Code Generation
by: Ekbote, Chanakya, et al.
Published: (2025) -
GRPO-$λ$: Credit Assignment improves LLM Reasoning
by: Parthasarathi, Prasanna, et al.
Published: (2025) -
Beyond Uniform Credit: Causal Credit Assignment for Policy Optimization
by: Khandoga, Mykola, et al.
Published: (2026)