:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Biré, Emilien, Santos, María, Yuan, Kai
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2601.22701
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

RankQ: Offline-to-Online Reinforcement Learning via Self-Supervised Action Ranking
di: Choi, Andrew, et al.
Pubblicazione: (2026)

Q-function Decomposition with Intervention Semantics with Factored Action Spaces
di: Lee, Junkyu, et al.
Pubblicazione: (2025)

Process Reward Model with Q-Value Rankings
di: Li, Wendi, et al.
Pubblicazione: (2024)

GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning
di: Yu, Xiaoyang, et al.
Pubblicazione: (2023)

ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
di: Zhao, Kai, et al.
Pubblicazione: (2023)

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
di: Lin, Zongyu, et al.
Pubblicazione: (2025)

Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors
di: Luo, Zhenglong, et al.
Pubblicazione: (2024)

GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs
di: An, Selim, et al.
Pubblicazione: (2026)

Stochastic Q-learning for Large Discrete Action Spaces
di: Fourati, Fares, et al.
Pubblicazione: (2024)

Design and Evaluation of Cost-Aware PoQ for Decentralized LLM Inference
di: Tian, Arther, et al.
Pubblicazione: (2025)

Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training
di: Ghosh, Ipsita, et al.
Pubblicazione: (2025)

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning
di: Wang, Chaojie, et al.
Pubblicazione: (2024)

Adaptive Action Chunking via Multi-Chunk Q Value Estimation
di: Shin, Yongjae, et al.
Pubblicazione: (2026)

Inference of Deterministic Finite Automata via Q-Learning
di: Hosseinkhani, Elaheh, et al.
Pubblicazione: (2025)

PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference
di: Wang, Qirui, et al.
Pubblicazione: (2026)

Local Reinforcement Learning with Action-Conditioned Root Mean Squared Q-Functions
di: Wu, Frank, et al.
Pubblicazione: (2025)

When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning
di: Dodeja, Lakshita, et al.
Pubblicazione: (2026)

$β$-DQN: Improving Deep Q-Learning By Evolving the Behavior
di: Zhang, Hongming, et al.
Pubblicazione: (2025)

Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space
di: Yu, Xiaoyang, et al.
Pubblicazione: (2024)

Coarse-to-fine Q-Network with Action Sequence for Data-Efficient Reinforcement Learning
di: Seo, Younggyo, et al.
Pubblicazione: (2024)

Causal Deep Q Network
di: Khelifi, Elouanes, et al.
Pubblicazione: (2025)

MemQ: Integrating Q-Learning into Self-Evolving Memory Agents over Provenance DAGs
di: Liao, Junwei, et al.
Pubblicazione: (2026)

Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
di: Jain, Ayush, et al.
Pubblicazione: (2024)

Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning
di: Khan, Muhammad Junaid, et al.
Pubblicazione: (2024)

Drift Q-Learning
di: Houssaini, Anas, et al.
Pubblicazione: (2026)

Frictional Q-Learning
di: Kim, Hyunwoo, et al.
Pubblicazione: (2025)

Flow Q-Learning
di: Park, Seohong, et al.
Pubblicazione: (2025)

Decoupled Q-Chunking
di: Li, Qiyang, et al.
Pubblicazione: (2025)

Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
di: Zhang, Ziqi, et al.
Pubblicazione: (2023)

Chunk-Guided Q-Learning
di: Song, Gwanwoo, et al.
Pubblicazione: (2026)

Periodic Regularized Q-Learning
di: Yang, Hyukjun, et al.
Pubblicazione: (2026)

Deep Double Q-learning
di: Nagarajan, Prabhat, et al.
Pubblicazione: (2025)

SQT -- std $Q$-target
di: Soffair, Nitsan, et al.
Pubblicazione: (2024)

Scalable In-Context Q-Learning
di: Liu, Jinmei, et al.
Pubblicazione: (2025)

LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ
di: Allard, Marc-Antoine, et al.
Pubblicazione: (2024)

DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
di: Singh, Aditya Kumar, et al.
Pubblicazione: (2026)

Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
di: Hong, Joey, et al.
Pubblicazione: (2024)

Techniques to Improve Q&A Accuracy with Transformer-based models on Large Complex Documents
di: Liao, Chejui, et al.
Pubblicazione: (2020)

Regularized Q-Learning with Linear Function Approximation
di: Xi, Jiachen, et al.
Pubblicazione: (2024)

FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game
di: Hu, Guangzheng, et al.
Pubblicazione: (2024)