Saved in:
| Main Authors: | Liao, Junyi, Zhu, Zihan, Fang, Ethan, Yang, Zhuoran, Tarokh, Vahid |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.12707 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SUN Team's Contribution to ABAW 2024 Competition: Audio-visual Valence-Arousal Estimation and Expression Recognition
by: Dresvyanskiy, Denis, et al.
Published: (2024)
by: Dresvyanskiy, Denis, et al.
Published: (2024)
Shrinkage Initialization for Smooth Learning of Neural Networks
by: Cheng, Miao, et al.
Published: (2025)
by: Cheng, Miao, et al.
Published: (2025)
4OPS: Structural Difficulty Modeling in Integer Arithmetic Puzzles
by: Zeytuncu, Yunus E.
Published: (2026)
by: Zeytuncu, Yunus E.
Published: (2026)
Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models
by: Tan, Chee Wei, et al.
Published: (2026)
by: Tan, Chee Wei, et al.
Published: (2026)
Hybrid-AIRL: Enhancing Inverse Reinforcement Learning with Supervised Expert Guidance
by: Silue, Bram, et al.
Published: (2025)
by: Silue, Bram, et al.
Published: (2025)
One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents
by: Hong, Yoosung
Published: (2026)
by: Hong, Yoosung
Published: (2026)
BandiK: Efficient Multi-Task Decomposition Using a Multi-Bandit Framework
by: Millinghoffer, András, et al.
Published: (2025)
by: Millinghoffer, András, et al.
Published: (2025)
Semi-overlapping Multi-bandit Best Arm Identification for Sequential Support Network Learning
by: Antos, András, et al.
Published: (2025)
by: Antos, András, et al.
Published: (2025)
Amortized Molecular Optimization via Group Relative Policy Optimization
by: Javaid, Muhammad bin, et al.
Published: (2026)
by: Javaid, Muhammad bin, et al.
Published: (2026)
Kolmogorov Arnold Networks and Multi-Layer Perceptrons: A Paradigm Shift in Neural Modelling
by: Gaonkar, Aradhya, et al.
Published: (2026)
by: Gaonkar, Aradhya, et al.
Published: (2026)
Distributional Reinforcement Learning for Condition-Based Maintenance of Multi-Pump Equipment
by: Yasuno, Takato
Published: (2026)
by: Yasuno, Takato
Published: (2026)
SQARL: A Size-Agnostic Reinforcement Learning approach for Circuit Allocation in Distributed Quantum Architectures
by: Carballo, Víctor, et al.
Published: (2026)
by: Carballo, Víctor, et al.
Published: (2026)
Learning Controllable and Diverse Player Behaviors in Multi-Agent Environments
by: Cilan, Atahan, et al.
Published: (2025)
by: Cilan, Atahan, et al.
Published: (2025)
Perfecting Aircraft Maneuvers with Reinforcement Learning
by: Cilan, Atahan, et al.
Published: (2026)
by: Cilan, Atahan, et al.
Published: (2026)
A survey of air combat behavior modeling using machine learning
by: Gorton, Patrick Ribu, et al.
Published: (2024)
by: Gorton, Patrick Ribu, et al.
Published: (2024)
Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025)
by: Khorasani, Sadegh, et al.
Published: (2025)
Application of Sensitivity Analysis Methods for Studying Neural Network Models
by: Miao, Jiaxuan, et al.
Published: (2025)
by: Miao, Jiaxuan, et al.
Published: (2025)
AI Agents for the Dhumbal Card Game: A Comparative Study
by: Malla, Sahaj Raj
Published: (2025)
by: Malla, Sahaj Raj
Published: (2025)
LLMs for Game Theory: Entropy-Guided In-Context Learning and Adaptive CoT Reasoning
by: Banfi, Tommaso Felice, et al.
Published: (2026)
by: Banfi, Tommaso Felice, et al.
Published: (2026)
Model Fusion via Retrofitting
by: Luenam, Phoomraphee, et al.
Published: (2025)
by: Luenam, Phoomraphee, et al.
Published: (2025)
From Theory to Practice with RAVEN-UCB: Addressing Non-Stationarity in Multi-Armed Bandits through Variance Adaptation
by: Fang, Junyi, et al.
Published: (2025)
by: Fang, Junyi, et al.
Published: (2025)
Compositional Concept-Based Neuron-Level Interpretability for Deep Reinforcement Learning
by: Jiang, Zeyu, et al.
Published: (2025)
by: Jiang, Zeyu, et al.
Published: (2025)
Hierarchical Pooling and Explainability in Graph Neural Networks for Tumor and Tissue-of-Origin Classification Using RNA-seq Data
by: Fontanari, Thomas Vaitses, et al.
Published: (2026)
by: Fontanari, Thomas Vaitses, et al.
Published: (2026)
Learning Rate Engineering: From Coarse Single Parameter to Layered Evolution
by: Yao, Ming-Hong, et al.
Published: (2026)
by: Yao, Ming-Hong, et al.
Published: (2026)
Aletheia: Quantifying Cognitive Conviction in Reasoning Models via Regularized Inverse Confusion Matrix
by: Fu, Fanzhe
Published: (2026)
by: Fu, Fanzhe
Published: (2026)
Fixing the Double Penalty in Data-Driven Weather Forecasting Through a Modified Spherical Harmonic Loss Function
by: Subich, Christopher, et al.
Published: (2025)
by: Subich, Christopher, et al.
Published: (2025)
Unsupervised Discovery of Clinical Disease Signatures Using Probabilistic Independence
by: Lasko, Thomas A., et al.
Published: (2024)
by: Lasko, Thomas A., et al.
Published: (2024)
Atom dimension adaptation for infinite set dictionary learning
by: Băltoiu, Andra, et al.
Published: (2024)
by: Băltoiu, Andra, et al.
Published: (2024)
Assessing the Performance-Efficiency Trade-off of Foundation Models in Probabilistic Electricity Price Forecasting
by: Lettner, Jan Niklas, et al.
Published: (2026)
by: Lettner, Jan Niklas, et al.
Published: (2026)
Predictable Gradient Manifolds in Deep Learning: Temporal Path-Length and Intrinsic Rank as a Complexity Regime
by: Calvo, Anherutowa
Published: (2026)
by: Calvo, Anherutowa
Published: (2026)
Recurrent Memory-Augmented Transformers with Chunked Attention for Long-Context Language Modeling
by: Kashyap, Ankit
Published: (2025)
by: Kashyap, Ankit
Published: (2025)
StepScorer: Accelerating Reinforcement Learning with Step-wise Scoring and Psychological Regret Modeling
by: Xu, Zhe
Published: (2026)
by: Xu, Zhe
Published: (2026)
Multi-Agent Pathfinding with Non-Unit Integer Edge Costs via Enhanced Conflict-Based Search and Graph Discretization
by: Fan, Hongkai, et al.
Published: (2026)
by: Fan, Hongkai, et al.
Published: (2026)
CART-ELC: Oblique Decision Tree Induction via Exhaustive Search
by: Laack, Andrew D.
Published: (2025)
by: Laack, Andrew D.
Published: (2025)
VGC-Bench: Towards Mastering Diverse Team Strategies in Competitive Pokémon
by: Angliss, Cameron, et al.
Published: (2025)
by: Angliss, Cameron, et al.
Published: (2025)
Augmenting deep neural networks with symbolic knowledge: Towards trustworthy and interpretable AI for education
by: Hooshyar, Danial, et al.
Published: (2023)
by: Hooshyar, Danial, et al.
Published: (2023)
CoGraM: Context-sensitive granular optimization method with rollback for robust model fusion
by: Lenz, Julius
Published: (2025)
by: Lenz, Julius
Published: (2025)
Improved Performances and Motivation in Intelligent Tutoring Systems: Combining Machine Learning and Learner Choice
by: Clément, Benjamin, et al.
Published: (2024)
by: Clément, Benjamin, et al.
Published: (2024)
Dense Neural Network Based Arrhythmia Classification on Low-cost and Low-compute Micro-controller
by: Zishan, Md Abu Obaida, et al.
Published: (2025)
by: Zishan, Md Abu Obaida, et al.
Published: (2025)
A Novel Loss Function for Deep Learning Based Daily Stock Trading System
by: Guo, Ruoyu, et al.
Published: (2025)
by: Guo, Ruoyu, et al.
Published: (2025)
Similar Items
-
SUN Team's Contribution to ABAW 2024 Competition: Audio-visual Valence-Arousal Estimation and Expression Recognition
by: Dresvyanskiy, Denis, et al.
Published: (2024) -
Shrinkage Initialization for Smooth Learning of Neural Networks
by: Cheng, Miao, et al.
Published: (2025) -
4OPS: Structural Difficulty Modeling in Integer Arithmetic Puzzles
by: Zeytuncu, Yunus E.
Published: (2026) -
Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models
by: Tan, Chee Wei, et al.
Published: (2026) -
Hybrid-AIRL: Enhancing Inverse Reinforcement Learning with Supervised Expert Guidance
by: Silue, Bram, et al.
Published: (2025)