:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mahajan, Mihir, Nguyen, Alfred, Srambical, Franz, Bauer, Stefan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.27002
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases
by: Wong, Sherman, et al.
Published: (2025)

Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX
by: Bonnet, Clément, et al.
Published: (2023)

JaxARC: A High-Performance JAX-based Environment for Abstraction and Reasoning Research
by: Aadam, et al.
Published: (2026)

Simple, Good, Fast: Self-Supervised World Models Free of Baggage
by: Robine, Jan, et al.
Published: (2025)

laplax -- Laplace Approximations with JAX
by: Weber, Tobias, et al.
Published: (2025)

PuzzleJAX: A Benchmark for Reasoning and Learning
by: Earle, Sam, et al.
Published: (2025)

CAX: Cellular Automata Accelerated in JAX
by: Faldor, Maxence, et al.
Published: (2024)

minimax: Efficient Baselines for Autocurricula in JAX
by: Jiang, Minqi, et al.
Published: (2023)

NAVIX: Scaling MiniGrid Environments with JAX
by: Pignatelli, Eduardo, et al.
Published: (2024)

A Simple and Scalable Representation for Graph Generation
by: Jang, Yunhui, et al.
Published: (2023)

Uncertainty Quantification for Gradient-based Explanations in Neural Networks
by: Mulye, Mihir, et al.
Published: (2024)

Marginals Before Conditionals
by: Sahasrabudhe, Mihir
Published: (2026)

Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX
by: Nishimori, Soichiro, et al.
Published: (2026)

Learning Bug Context for PyTorch-to-JAX Translation with LLMs
by: Phan, Hung, et al.
Published: (2025)

G-Core: A Simple, Scalable and Balanced RLHF Trainer
by: Wu, Junyu, et al.
Published: (2025)

Chargax: A JAX Accelerated EV Charging Simulator
by: Ponse, Koen, et al.
Published: (2025)

Training Agents Inside of Scalable World Models
by: Hafner, Danijar, et al.
Published: (2025)

Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces
by: Mahajan, Pranav, et al.
Published: (2026)

Simple and Scalable Strategies to Continually Pre-train Large Language Models
by: Ibrahim, Adam, et al.
Published: (2024)

WeChat-YATT: A Scalable, Simple, Efficient, and Production Ready Training Library
by: Wu, Junyu, et al.
Published: (2025)

JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
by: Rutherford, Alexander, et al.
Published: (2023)

Teleporter Theory: A General and Simple Approach for Modeling Cross-World Counterfactual Causality
by: Li, Jiangmeng, et al.
Published: (2024)

CaRL: Learning Scalable Planning Policies with Simple Rewards
by: Jaeger, Bernhard, et al.
Published: (2025)

Scaling Is All You Need: Autonomous Driving with JAX-Accelerated Reinforcement Learning
by: Harmel, Moritz, et al.
Published: (2023)

Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization
by: Berasi, Davide, et al.
Published: (2026)

Sanity Checks for Explanation Uncertainty
by: Valdenegro-Toro, Matias, et al.
Published: (2024)

JPC: Flexible Inference for Predictive Coding Networks in JAX
by: Innocenti, Francesco, et al.
Published: (2024)

Higher Embedding Dimension Creates a Stronger World Model for a Simple Sorting Task
by: Bhalla, Brady, et al.
Published: (2025)

Generalized Orders of Magnitude for Scalable, Parallel, High-Dynamic-Range Computation
by: Heinsen, Franz A., et al.
Published: (2025)

Is Implicit Knowledge Enough for LLMs? A RAG Approach for Tree-based Structures
by: Gupte, Mihir, et al.
Published: (2025)

M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models
by: Maheshwary, Rishabh, et al.
Published: (2024)

Self-Questioning Language Models
by: Chen, Lili, et al.
Published: (2025)

Consistency Models for Scalable and Fast Simulation-Based Inference
by: Schmitt, Marvin, et al.
Published: (2023)

Scalable AI Inference: Performance Analysis and Optimization of AI Model Serving
by: Pham, Hung Cuong, et al.
Published: (2026)

Learning Hidden Markov Models Using Conditional Samples
by: Kakade, Sham M., et al.
Published: (2023)

Amortized Active Causal Induction with Deep Reinforcement Learning
by: Annadani, Yashas, et al.
Published: (2024)

Future-as-Label: Scalable Supervision from Real-World Outcomes
by: Turtel, Benjamin, et al.
Published: (2026)

SIMU: Selective Influence Machine Unlearning
by: Agarwal, Anu, et al.
Published: (2025)

PolyNet: Learning Diverse Solution Strategies for Neural Combinatorial Optimization
by: Hottung, André, et al.
Published: (2024)

Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation
by: Mahajan, Divyat, et al.
Published: (2022)