Saved in:
| Main Authors: | Wang, Yuhui, Li, Weida, Faccio, Francesco, Wu, Qingyuan, Schmidhuber, Jürgen |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.03485 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
by: Wang, Yuhui, et al.
Published: (2024)
by: Wang, Yuhui, et al.
Published: (2024)
Highway Reinforcement Learning
by: Wang, Yuhui, et al.
Published: (2024)
by: Wang, Yuhui, et al.
Published: (2024)
Curious Causality-Seeking Agents Learn Meta Causal World
by: Zhao, Zhiyu, et al.
Published: (2025)
by: Zhao, Zhiyu, et al.
Published: (2025)
Language Agents as Optimizable Graphs
by: Zhuge, Mingchen, et al.
Published: (2024)
by: Zhuge, Mingchen, et al.
Published: (2024)
Upside Down Reinforcement Learning with Policy Generators
by: Di Ventura, Jacopo, et al.
Published: (2025)
by: Di Ventura, Jacopo, et al.
Published: (2025)
Learning Useful Representations of Recurrent Neural Network Weight Matrices
by: Herrmann, Vincent, et al.
Published: (2024)
by: Herrmann, Vincent, et al.
Published: (2024)
Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays
by: Wu, Qingyuan, et al.
Published: (2024)
by: Wu, Qingyuan, et al.
Published: (2024)
Towards a Robust Soft Baby Robot With Rich Interaction Ability for Advanced Machine Learning Algorithms
by: Alhakami, Mohannad, et al.
Published: (2024)
by: Alhakami, Mohannad, et al.
Published: (2024)
Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization
by: Dai, Yanning, et al.
Published: (2026)
by: Dai, Yanning, et al.
Published: (2026)
Interestingness as an Inductive Heuristic for Future Compression Progress
by: Herrmann, Vincent, et al.
Published: (2026)
by: Herrmann, Vincent, et al.
Published: (2026)
On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning, and Online Decision Transformers
by: Štrupl, Miroslav, et al.
Published: (2025)
by: Štrupl, Miroslav, et al.
Published: (2025)
FACTS: A Factored State-Space Framework For World Modelling
by: Nanbo, Li, et al.
Published: (2024)
by: Nanbo, Li, et al.
Published: (2024)
How to Correctly do Semantic Backpropagation on Language-based Agentic Systems
by: Wang, Wenyi, et al.
Published: (2024)
by: Wang, Wenyi, et al.
Published: (2024)
Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
by: Ramesh, Aditya A., et al.
Published: (2024)
by: Ramesh, Aditya A., et al.
Published: (2024)
MeSH: Memory-as-State-Highways for Recursive Transformers
by: Yu, Chengting, et al.
Published: (2025)
by: Yu, Chengting, et al.
Published: (2025)
Variational Delayed Policy Optimization
by: Wu, Qingyuan, et al.
Published: (2024)
by: Wu, Qingyuan, et al.
Published: (2024)
Fairness Overfitting in Machine Learning: An Information-Theoretic Perspective
by: Laakom, Firas, et al.
Published: (2025)
by: Laakom, Firas, et al.
Published: (2025)
Fast and scalable retrosynthetic planning with a transformer neural network and speculative beam search
by: Andronov, Mikhail, et al.
Published: (2025)
by: Andronov, Mikhail, et al.
Published: (2025)
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
by: Bai, Chenjia, et al.
Published: (2024)
by: Bai, Chenjia, et al.
Published: (2024)
A Unified Framework for Rethinking Policy Divergence Measures in GRPO
by: Wu, Qingyuan, et al.
Published: (2026)
by: Wu, Qingyuan, et al.
Published: (2026)
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
by: Gopalakrishnan, Anand, et al.
Published: (2025)
by: Gopalakrishnan, Anand, et al.
Published: (2025)
Cross-Modal Reconstruction Pretraining for Ramp Flow Prediction at Highway Interchanges
by: Li, Yongchao, et al.
Published: (2025)
by: Li, Yongchao, et al.
Published: (2025)
PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors
by: Chen, Yimeng, et al.
Published: (2025)
by: Chen, Yimeng, et al.
Published: (2025)
Hybrid LSTM-Transformer Models for Profiling Highway-Railway Grade Crossings
by: Chatterjee, Kaustav, et al.
Published: (2025)
by: Chatterjee, Kaustav, et al.
Published: (2025)
Accelerating the inference of string generation-based chemical reaction models for industrial applications
by: Andronov, Mikhail, et al.
Published: (2024)
by: Andronov, Mikhail, et al.
Published: (2024)
Stop-RAG: Value-Based Retrieval Control for Iterative RAG
by: Park, Jaewan, et al.
Published: (2025)
by: Park, Jaewan, et al.
Published: (2025)
Evolutionary Guided Decoding: Iterative Value Refinement for LLMs
by: Liu, Zhenhua, et al.
Published: (2025)
by: Liu, Zhenhua, et al.
Published: (2025)
Multi-View Subgraph Neural Networks: Self-Supervised Learning with Scarce Labeled Data
by: Wang, Zhenzhong, et al.
Published: (2024)
by: Wang, Zhenzhong, et al.
Published: (2024)
Counterfactual explainability and analysis of variance
by: Gao, Zijun, et al.
Published: (2024)
by: Gao, Zijun, et al.
Published: (2024)
Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning
by: Wu, Qingyuan, et al.
Published: (2025)
by: Wu, Qingyuan, et al.
Published: (2025)
MoEUT: Mixture-of-Experts Universal Transformers
by: Csordás, Róbert, et al.
Published: (2024)
by: Csordás, Róbert, et al.
Published: (2024)
Concurrent Learning with Aggregated States via Randomized Least Squares Value Iteration
by: Chen, Yan, et al.
Published: (2025)
by: Chen, Yan, et al.
Published: (2025)
Heterogeneous Self-Play for Realistic Highway Traffic Simulation
by: Qiu, Jinkai, et al.
Published: (2026)
by: Qiu, Jinkai, et al.
Published: (2026)
Resolve Highway Conflict in Multi-Autonomous Vehicle Controls with Local State Attention
by: Ta, Xuan Duy, et al.
Published: (2025)
by: Ta, Xuan Duy, et al.
Published: (2025)
Highway Networks for Improved Surface Reconstruction: The Role of Residuals and Weight Updates
by: Noorizadegan, A., et al.
Published: (2024)
by: Noorizadegan, A., et al.
Published: (2024)
Significativity Indices for Agreement Values
by: Casagrande, Alberto, et al.
Published: (2025)
by: Casagrande, Alberto, et al.
Published: (2025)
Set-Valued Sensitivity Analysis of Deep Neural Networks
by: Wang, Xin, et al.
Published: (2024)
by: Wang, Xin, et al.
Published: (2024)
VC-Soup: Value-Consistency Guided Multi-Value Alignment for Large Language Models
by: Xu, Hefei, et al.
Published: (2026)
by: Xu, Hefei, et al.
Published: (2026)
Shift-Invariant Attribute Scoring for Kolmogorov-Arnold Networks via Shapley Value
by: Fan, Wangxuan, et al.
Published: (2025)
by: Fan, Wangxuan, et al.
Published: (2025)
Iterative Inference in a Chess-Playing Neural Network
by: Sandmann, Elias, et al.
Published: (2025)
by: Sandmann, Elias, et al.
Published: (2025)
Similar Items
-
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
by: Wang, Yuhui, et al.
Published: (2024) -
Highway Reinforcement Learning
by: Wang, Yuhui, et al.
Published: (2024) -
Curious Causality-Seeking Agents Learn Meta Causal World
by: Zhao, Zhiyu, et al.
Published: (2025) -
Language Agents as Optimizable Graphs
by: Zhuge, Mingchen, et al.
Published: (2024) -
Upside Down Reinforcement Learning with Policy Generators
by: Di Ventura, Jacopo, et al.
Published: (2025)