:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Wang, Yuhui, Wu, Qingyuan, Ashley, Dylan R., Faccio, Francesco, Li, Weida, Huang, Chao, Schmidhuber, Jürgen
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Machine Learning Artificial Intelligence I.2.6
Accesso online:	https://arxiv.org/abs/2406.08404
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Learning Useful Representations of Recurrent Neural Network Weight Matrices
di: Herrmann, Vincent, et al.
Pubblicazione: (2024)

Upside Down Reinforcement Learning with Policy Generators
di: Di Ventura, Jacopo, et al.
Pubblicazione: (2025)

Towards a Robust Soft Baby Robot With Rich Interaction Ability for Advanced Machine Learning Algorithms
di: Alhakami, Mohannad, et al.
Pubblicazione: (2024)

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization
di: Dai, Yanning, et al.
Pubblicazione: (2026)

On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning, and Online Decision Transformers
di: Štrupl, Miroslav, et al.
Pubblicazione: (2025)

Highway Value Iteration Networks
di: Wang, Yuhui, et al.
Pubblicazione: (2024)

How to Correctly do Semantic Backpropagation on Language-based Agentic Systems
di: Wang, Wenyi, et al.
Pubblicazione: (2024)

Automatic Album Sequencing
di: Herrmann, Vincent, et al.
Pubblicazione: (2024)

Measuring In-Context Computation Complexity via Hidden State Prediction
di: Herrmann, Vincent, et al.
Pubblicazione: (2025)

Interestingness as an Inductive Heuristic for Future Compression Progress
di: Herrmann, Vincent, et al.
Pubblicazione: (2026)

Multiple Token Divergence: Measuring and Steering In-Context Computation Density
di: Herrmann, Vincent, et al.
Pubblicazione: (2025)

Rewarding Beliefs, Not Actions: Consistency-Guided Credit Assignment for Long-Horizon Agents
di: Tang, Wenjie, et al.
Pubblicazione: (2026)

Fractional Policy Gradients: Reinforcement Learning with Long-Term Memory
di: Pawar, Urvi, et al.
Pubblicazione: (2025)

$μ$PC: Scaling Predictive Coding to 100+ Layer Networks
di: Innocenti, Francesco, et al.
Pubblicazione: (2025)

RPRA: Predicting an LLM-Judge for Efficient but Performant Inference
di: Ashley, Dylan R., et al.
Pubblicazione: (2026)

Advancing Multimodal Agent Reasoning with Long-Term Neuro-Symbolic Memory
di: Jiang, Rongjie, et al.
Pubblicazione: (2026)

Towards Scaling Deep Neural Networks with Predictive Coding: Theory and Practice
di: Innocenti, Francesco
Pubblicazione: (2025)

Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers
di: Gray, Gavia, et al.
Pubblicazione: (2024)

Graph Neural Network Based Action Ranking for Planning
di: Mangannavar, Rajesh, et al.
Pubblicazione: (2024)

Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
di: Fadli, Samih
Pubblicazione: (2025)

Analysing Factorizations of Action-Value Networks for Cooperative Multi-Agent Reinforcement Learning
di: Castellini, Jacopo, et al.
Pubblicazione: (2019)

Backpropagation Through Time For Networks With Long-Term Dependencies
di: Bird, George, et al.
Pubblicazione: (2021)

Synergizing Deep Learning and Biological Heuristics for Extreme Long-Tail White Blood Cell Classification
di: Nguyen, Duc T., et al.
Pubblicazione: (2026)

Fusing Rewards and Preferences in Reinforcement Learning
di: Khorasani, Sadegh, et al.
Pubblicazione: (2025)

Kolmogorov Arnold Networks and Multi-Layer Perceptrons: A Paradigm Shift in Neural Modelling
di: Gaonkar, Aradhya, et al.
Pubblicazione: (2026)

RACAS: Controlling Diverse Robots With a Single Agentic System
di: Ashley, Dylan R., et al.
Pubblicazione: (2026)

Versatile Ordering Network: An Attention-based Neural Network for Ordering Across Scales and Quality Metrics
di: Yu, Zehua, et al.
Pubblicazione: (2024)

AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
di: Yousaf, Iqra
Pubblicazione: (2024)

Are We Winning the Wrong Game? Revisiting Evaluation Practices for Long-Term Time Series Forecasting
di: Phungtua-eng, Thanapol, et al.
Pubblicazione: (2026)

Expressivity of Graph Neural Networks Through the Lens of Adversarial Robustness
di: Campi, Francesco, et al.
Pubblicazione: (2023)

TelePlanNet: An AI-Driven Framework for Efficient Telecom Network Planning
di: Deng, Zongyuan, et al.
Pubblicazione: (2025)

FedWCM: Unleashing the Potential of Momentum-based Federated Learning in Long-Tailed Scenarios
di: Li, Tianle, et al.
Pubblicazione: (2025)

Mindstorms in Natural Language-Based Societies of Mind
di: Zhuge, Mingchen, et al.
Pubblicazione: (2023)

CaMeRL: Collision-Aware and Memory-Enhanced Reinforcement Learning for UAV Navigation in Multi-Scale Obstacle Environments
di: Hong, Hong, et al.
Pubblicazione: (2026)

Distributed Value Decomposition Networks with Networked Agents
di: Varela, Guilherme S., et al.
Pubblicazione: (2025)

Composing Linear Layers from Irreducibles
di: Pence, Travis, et al.
Pubblicazione: (2025)

Explaining Temporal Graph Predictions With Shapley Values
di: Sussek, Lea-Marie, et al.
Pubblicazione: (2026)

Reinforcement Learning-Based Energy-Aware Coverage Path Planning for Precision Agriculture
di: Wu, Beining, et al.
Pubblicazione: (2026)

Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
di: Hedar, Abdel-Rahman, et al.
Pubblicazione: (2024)

TRIM: Achieving Extreme Sparsity with Targeted Row-wise Iterative Metric-driven Pruning
di: Beck, Florentin, et al.
Pubblicazione: (2025)