:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Yuhui, Li, Weida, Faccio, Francesco, Wu, Qingyuan, Schmidhuber, Jürgen
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2406.03485
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
by: Wang, Yuhui, et al.
Published: (2024)

Highway Reinforcement Learning
by: Wang, Yuhui, et al.
Published: (2024)

Curious Causality-Seeking Agents Learn Meta Causal World
by: Zhao, Zhiyu, et al.
Published: (2025)

Language Agents as Optimizable Graphs
by: Zhuge, Mingchen, et al.
Published: (2024)

Upside Down Reinforcement Learning with Policy Generators
by: Di Ventura, Jacopo, et al.
Published: (2025)

Learning Useful Representations of Recurrent Neural Network Weight Matrices
by: Herrmann, Vincent, et al.
Published: (2024)

Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays
by: Wu, Qingyuan, et al.
Published: (2024)

Towards a Robust Soft Baby Robot With Rich Interaction Ability for Advanced Machine Learning Algorithms
by: Alhakami, Mohannad, et al.
Published: (2024)

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization
by: Dai, Yanning, et al.
Published: (2026)

Interestingness as an Inductive Heuristic for Future Compression Progress
by: Herrmann, Vincent, et al.
Published: (2026)

On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning, and Online Decision Transformers
by: Štrupl, Miroslav, et al.
Published: (2025)

FACTS: A Factored State-Space Framework For World Modelling
by: Nanbo, Li, et al.
Published: (2024)

How to Correctly do Semantic Backpropagation on Language-based Agentic Systems
by: Wang, Wenyi, et al.
Published: (2024)

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
by: Ramesh, Aditya A., et al.
Published: (2024)

MeSH: Memory-as-State-Highways for Recursive Transformers
by: Yu, Chengting, et al.
Published: (2025)

Variational Delayed Policy Optimization
by: Wu, Qingyuan, et al.
Published: (2024)

Fairness Overfitting in Machine Learning: An Information-Theoretic Perspective
by: Laakom, Firas, et al.
Published: (2025)

Fast and scalable retrosynthetic planning with a transformer neural network and speculative beam search
by: Andronov, Mikhail, et al.
Published: (2025)

Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
by: Bai, Chenjia, et al.
Published: (2024)

A Unified Framework for Rethinking Policy Divergence Measures in GRPO
by: Wu, Qingyuan, et al.
Published: (2026)

Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
by: Gopalakrishnan, Anand, et al.
Published: (2025)

Cross-Modal Reconstruction Pretraining for Ramp Flow Prediction at Highway Interchanges
by: Li, Yongchao, et al.
Published: (2025)

PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors
by: Chen, Yimeng, et al.
Published: (2025)

Hybrid LSTM-Transformer Models for Profiling Highway-Railway Grade Crossings
by: Chatterjee, Kaustav, et al.
Published: (2025)

Accelerating the inference of string generation-based chemical reaction models for industrial applications
by: Andronov, Mikhail, et al.
Published: (2024)

Stop-RAG: Value-Based Retrieval Control for Iterative RAG
by: Park, Jaewan, et al.
Published: (2025)

Evolutionary Guided Decoding: Iterative Value Refinement for LLMs
by: Liu, Zhenhua, et al.
Published: (2025)

Multi-View Subgraph Neural Networks: Self-Supervised Learning with Scarce Labeled Data
by: Wang, Zhenzhong, et al.
Published: (2024)

Counterfactual explainability and analysis of variance
by: Gao, Zijun, et al.
Published: (2024)

Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning
by: Wu, Qingyuan, et al.
Published: (2025)

MoEUT: Mixture-of-Experts Universal Transformers
by: Csordás, Róbert, et al.
Published: (2024)

Concurrent Learning with Aggregated States via Randomized Least Squares Value Iteration
by: Chen, Yan, et al.
Published: (2025)

Heterogeneous Self-Play for Realistic Highway Traffic Simulation
by: Qiu, Jinkai, et al.
Published: (2026)

Resolve Highway Conflict in Multi-Autonomous Vehicle Controls with Local State Attention
by: Ta, Xuan Duy, et al.
Published: (2025)

Highway Networks for Improved Surface Reconstruction: The Role of Residuals and Weight Updates
by: Noorizadegan, A., et al.
Published: (2024)

Significativity Indices for Agreement Values
by: Casagrande, Alberto, et al.
Published: (2025)

Set-Valued Sensitivity Analysis of Deep Neural Networks
by: Wang, Xin, et al.
Published: (2024)

VC-Soup: Value-Consistency Guided Multi-Value Alignment for Large Language Models
by: Xu, Hefei, et al.
Published: (2026)

Shift-Invariant Attribute Scoring for Kolmogorov-Arnold Networks via Shapley Value
by: Fan, Wangxuan, et al.
Published: (2025)

Iterative Inference in a Chess-Playing Neural Network
by: Sandmann, Elias, et al.
Published: (2025)