Saved in:
| Main Authors: | Mahajan, Mihir, Nguyen, Alfred, Srambical, Franz, Bauer, Stefan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.27002 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases
by: Wong, Sherman, et al.
Published: (2025)
by: Wong, Sherman, et al.
Published: (2025)
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX
by: Bonnet, Clément, et al.
Published: (2023)
by: Bonnet, Clément, et al.
Published: (2023)
JaxARC: A High-Performance JAX-based Environment for Abstraction and Reasoning Research
by: Aadam, et al.
Published: (2026)
by: Aadam, et al.
Published: (2026)
Simple, Good, Fast: Self-Supervised World Models Free of Baggage
by: Robine, Jan, et al.
Published: (2025)
by: Robine, Jan, et al.
Published: (2025)
laplax -- Laplace Approximations with JAX
by: Weber, Tobias, et al.
Published: (2025)
by: Weber, Tobias, et al.
Published: (2025)
PuzzleJAX: A Benchmark for Reasoning and Learning
by: Earle, Sam, et al.
Published: (2025)
by: Earle, Sam, et al.
Published: (2025)
CAX: Cellular Automata Accelerated in JAX
by: Faldor, Maxence, et al.
Published: (2024)
by: Faldor, Maxence, et al.
Published: (2024)
minimax: Efficient Baselines for Autocurricula in JAX
by: Jiang, Minqi, et al.
Published: (2023)
by: Jiang, Minqi, et al.
Published: (2023)
NAVIX: Scaling MiniGrid Environments with JAX
by: Pignatelli, Eduardo, et al.
Published: (2024)
by: Pignatelli, Eduardo, et al.
Published: (2024)
A Simple and Scalable Representation for Graph Generation
by: Jang, Yunhui, et al.
Published: (2023)
by: Jang, Yunhui, et al.
Published: (2023)
Uncertainty Quantification for Gradient-based Explanations in Neural Networks
by: Mulye, Mihir, et al.
Published: (2024)
by: Mulye, Mihir, et al.
Published: (2024)
Marginals Before Conditionals
by: Sahasrabudhe, Mihir
Published: (2026)
by: Sahasrabudhe, Mihir
Published: (2026)
Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX
by: Nishimori, Soichiro, et al.
Published: (2026)
by: Nishimori, Soichiro, et al.
Published: (2026)
Learning Bug Context for PyTorch-to-JAX Translation with LLMs
by: Phan, Hung, et al.
Published: (2025)
by: Phan, Hung, et al.
Published: (2025)
G-Core: A Simple, Scalable and Balanced RLHF Trainer
by: Wu, Junyu, et al.
Published: (2025)
by: Wu, Junyu, et al.
Published: (2025)
Chargax: A JAX Accelerated EV Charging Simulator
by: Ponse, Koen, et al.
Published: (2025)
by: Ponse, Koen, et al.
Published: (2025)
Training Agents Inside of Scalable World Models
by: Hafner, Danijar, et al.
Published: (2025)
by: Hafner, Danijar, et al.
Published: (2025)
Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces
by: Mahajan, Pranav, et al.
Published: (2026)
by: Mahajan, Pranav, et al.
Published: (2026)
Simple and Scalable Strategies to Continually Pre-train Large Language Models
by: Ibrahim, Adam, et al.
Published: (2024)
by: Ibrahim, Adam, et al.
Published: (2024)
WeChat-YATT: A Scalable, Simple, Efficient, and Production Ready Training Library
by: Wu, Junyu, et al.
Published: (2025)
by: Wu, Junyu, et al.
Published: (2025)
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
by: Rutherford, Alexander, et al.
Published: (2023)
by: Rutherford, Alexander, et al.
Published: (2023)
Teleporter Theory: A General and Simple Approach for Modeling Cross-World Counterfactual Causality
by: Li, Jiangmeng, et al.
Published: (2024)
by: Li, Jiangmeng, et al.
Published: (2024)
CaRL: Learning Scalable Planning Policies with Simple Rewards
by: Jaeger, Bernhard, et al.
Published: (2025)
by: Jaeger, Bernhard, et al.
Published: (2025)
Scaling Is All You Need: Autonomous Driving with JAX-Accelerated Reinforcement Learning
by: Harmel, Moritz, et al.
Published: (2023)
by: Harmel, Moritz, et al.
Published: (2023)
Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization
by: Berasi, Davide, et al.
Published: (2026)
by: Berasi, Davide, et al.
Published: (2026)
Sanity Checks for Explanation Uncertainty
by: Valdenegro-Toro, Matias, et al.
Published: (2024)
by: Valdenegro-Toro, Matias, et al.
Published: (2024)
JPC: Flexible Inference for Predictive Coding Networks in JAX
by: Innocenti, Francesco, et al.
Published: (2024)
by: Innocenti, Francesco, et al.
Published: (2024)
Higher Embedding Dimension Creates a Stronger World Model for a Simple Sorting Task
by: Bhalla, Brady, et al.
Published: (2025)
by: Bhalla, Brady, et al.
Published: (2025)
Generalized Orders of Magnitude for Scalable, Parallel, High-Dynamic-Range Computation
by: Heinsen, Franz A., et al.
Published: (2025)
by: Heinsen, Franz A., et al.
Published: (2025)
Is Implicit Knowledge Enough for LLMs? A RAG Approach for Tree-based Structures
by: Gupte, Mihir, et al.
Published: (2025)
by: Gupte, Mihir, et al.
Published: (2025)
M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models
by: Maheshwary, Rishabh, et al.
Published: (2024)
by: Maheshwary, Rishabh, et al.
Published: (2024)
Self-Questioning Language Models
by: Chen, Lili, et al.
Published: (2025)
by: Chen, Lili, et al.
Published: (2025)
Consistency Models for Scalable and Fast Simulation-Based Inference
by: Schmitt, Marvin, et al.
Published: (2023)
by: Schmitt, Marvin, et al.
Published: (2023)
Scalable AI Inference: Performance Analysis and Optimization of AI Model Serving
by: Pham, Hung Cuong, et al.
Published: (2026)
by: Pham, Hung Cuong, et al.
Published: (2026)
Learning Hidden Markov Models Using Conditional Samples
by: Kakade, Sham M., et al.
Published: (2023)
by: Kakade, Sham M., et al.
Published: (2023)
Amortized Active Causal Induction with Deep Reinforcement Learning
by: Annadani, Yashas, et al.
Published: (2024)
by: Annadani, Yashas, et al.
Published: (2024)
Future-as-Label: Scalable Supervision from Real-World Outcomes
by: Turtel, Benjamin, et al.
Published: (2026)
by: Turtel, Benjamin, et al.
Published: (2026)
SIMU: Selective Influence Machine Unlearning
by: Agarwal, Anu, et al.
Published: (2025)
by: Agarwal, Anu, et al.
Published: (2025)
PolyNet: Learning Diverse Solution Strategies for Neural Combinatorial Optimization
by: Hottung, André, et al.
Published: (2024)
by: Hottung, André, et al.
Published: (2024)
Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation
by: Mahajan, Divyat, et al.
Published: (2022)
by: Mahajan, Divyat, et al.
Published: (2022)
Similar Items
-
Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases
by: Wong, Sherman, et al.
Published: (2025) -
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX
by: Bonnet, Clément, et al.
Published: (2023) -
JaxARC: A High-Performance JAX-based Environment for Abstraction and Reasoning Research
by: Aadam, et al.
Published: (2026) -
Simple, Good, Fast: Self-Supervised World Models Free of Baggage
by: Robine, Jan, et al.
Published: (2025) -
laplax -- Laplace Approximations with JAX
by: Weber, Tobias, et al.
Published: (2025)