Saved in:
| Main Authors: | Wu, Yinting, Peng, Pai, Cai, Bo, Li, Le, . |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.04070 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Batch Bayesian Active Learning with Partial Batch Label Sampling
by: Hu, Kangping, et al.
Published: (2025)
by: Hu, Kangping, et al.
Published: (2025)
How Does Critical Batch Size Scale in Pre-training?
by: Zhang, Hanlin, et al.
Published: (2024)
by: Zhang, Hanlin, et al.
Published: (2024)
Riemannian Batch Normalization: A Gyro Approach
by: Chen, Ziheng, et al.
Published: (2025)
by: Chen, Ziheng, et al.
Published: (2025)
BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching
by: Zheng, Zhen, et al.
Published: (2024)
by: Zheng, Zhen, et al.
Published: (2024)
BatchTopK Sparse Autoencoders
by: Bussmann, Bart, et al.
Published: (2024)
by: Bussmann, Bart, et al.
Published: (2024)
Batch Normalization Amplifies Memorization and Privacy Risks
by: Doan, Ngoc Phu, et al.
Published: (2026)
by: Doan, Ngoc Phu, et al.
Published: (2026)
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
by: Huang, Zixuan, et al.
Published: (2025)
by: Huang, Zixuan, et al.
Published: (2025)
Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
by: Bergsma, Shane, et al.
Published: (2025)
by: Bergsma, Shane, et al.
Published: (2025)
Batch and match: black-box variational inference with a score-based divergence
by: Cai, Diana, et al.
Published: (2024)
by: Cai, Diana, et al.
Published: (2024)
Making Batch Normalization Great in Federated Deep Learning
by: Zhong, Jike, et al.
Published: (2023)
by: Zhong, Jike, et al.
Published: (2023)
Mini-Batch Class Composition Bias in Link Prediction
by: Maguire, Kieran, et al.
Published: (2026)
by: Maguire, Kieran, et al.
Published: (2026)
SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models
by: Wu, Juntong, et al.
Published: (2026)
by: Wu, Juntong, et al.
Published: (2026)
Robust NAS under adversarial training: benchmark, theory, and beyond
by: Wu, Yongtao, et al.
Published: (2024)
by: Wu, Yongtao, et al.
Published: (2024)
A Simple and Efficient Approach to Batch Bayesian Optimization
by: Zhan, Dawei, et al.
Published: (2024)
by: Zhan, Dawei, et al.
Published: (2024)
Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization
by: Ahmadianshalchi, Alaleh, et al.
Published: (2024)
by: Ahmadianshalchi, Alaleh, et al.
Published: (2024)
Towards Batch-to-Streaming Deep Reinforcement Learning for Continuous Control
by: De Monte, Riccardo, et al.
Published: (2026)
by: De Monte, Riccardo, et al.
Published: (2026)
StableGrad: Backward Scale Control without Batch Normalization
by: Mestre, Jose I., et al.
Published: (2026)
by: Mestre, Jose I., et al.
Published: (2026)
Beware of the Batch Size: Hyperparameter Bias in Evaluating LoRA
by: Lee, Sangyoon, et al.
Published: (2026)
by: Lee, Sangyoon, et al.
Published: (2026)
Scaling Off-Policy Reinforcement Learning with Batch and Weight Normalization
by: Palenicek, Daniel, et al.
Published: (2025)
by: Palenicek, Daniel, et al.
Published: (2025)
Revisiting LARS for Large Batch Training Generalization of Neural Networks
by: Do, Khoi, et al.
Published: (2023)
by: Do, Khoi, et al.
Published: (2023)
Scalable On-Policy Reinforcement Learning via Adaptive Batch Scaling
by: Park, Jongchan
Published: (2026)
by: Park, Jongchan
Published: (2026)
Mini-Batch Kernel $k$-means
by: Jourdan, Ben, et al.
Published: (2024)
by: Jourdan, Ben, et al.
Published: (2024)
Batched Low-Rank Adaptation of Foundation Models
by: Wen, Yeming, et al.
Published: (2023)
by: Wen, Yeming, et al.
Published: (2023)
BlockBatch: Multi-Scale Consensus Decoding for Efficient Diffusion Language Model Inference
by: Wu, Xiaoyou, et al.
Published: (2026)
by: Wu, Xiaoyou, et al.
Published: (2026)
Deep MMD Gradient Flow without adversarial training
by: Galashov, Alexandre, et al.
Published: (2024)
by: Galashov, Alexandre, et al.
Published: (2024)
Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn
by: Yu, Chao, et al.
Published: (2025)
by: Yu, Chao, et al.
Published: (2025)
Tula: Optimizing Time, Cost, and Generalization in Distributed Large-Batch Training
by: Tyagi, Sahil, et al.
Published: (2026)
by: Tyagi, Sahil, et al.
Published: (2026)
Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration
by: Mlodozeniec, Bruno, et al.
Published: (2025)
by: Mlodozeniec, Bruno, et al.
Published: (2025)
Efficient GNN Training Through Structure-Aware Randomized Mini-Batching
by: Balaji, Vignesh, et al.
Published: (2025)
by: Balaji, Vignesh, et al.
Published: (2025)
SamBaTen: Sampling-based Batch Incremental Tensor Decomposition
by: Gujral, Ekta, et al.
Published: (2017)
by: Gujral, Ekta, et al.
Published: (2017)
On Using Large-Batches in Federated Learning
by: Tyagi, Sahil
Published: (2025)
by: Tyagi, Sahil
Published: (2025)
Seesaw: Accelerating Training by Balancing Learning Rate and Batch Size Scheduling
by: Meterez, Alexandru, et al.
Published: (2025)
by: Meterez, Alexandru, et al.
Published: (2025)
Batch Active Learning of Reward Functions from Human Preferences
by: Bıyık, Erdem, et al.
Published: (2024)
by: Bıyık, Erdem, et al.
Published: (2024)
A Lie Group Approach to Riemannian Batch Normalization
by: Chen, Ziheng, et al.
Published: (2024)
by: Chen, Ziheng, et al.
Published: (2024)
Semi-supervised Batch Learning From Logged Data
by: Aminian, Gholamali, et al.
Published: (2022)
by: Aminian, Gholamali, et al.
Published: (2022)
Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning
by: Agrawal, Rishabh, et al.
Published: (2024)
by: Agrawal, Rishabh, et al.
Published: (2024)
Optimizing Data Curation through Spectral Analysis and Joint Batch Selection (SALN)
by: Sharifi, Mohammadreza
Published: (2024)
by: Sharifi, Mohammadreza
Published: (2024)
Time Transfer: On Optimal Learning Rate and Batch Size In The Infinite Data Limit
by: Filatov, Oleg, et al.
Published: (2024)
by: Filatov, Oleg, et al.
Published: (2024)
Trust the Batch, On- or Off-Policy: Adaptive Policy Optimization for RL Post-Training
by: Fakoor, Rasool, et al.
Published: (2026)
by: Fakoor, Rasool, et al.
Published: (2026)
Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity
by: Johnson, Emmeran, et al.
Published: (2023)
by: Johnson, Emmeran, et al.
Published: (2023)
Similar Items
-
Batch Bayesian Active Learning with Partial Batch Label Sampling
by: Hu, Kangping, et al.
Published: (2025) -
How Does Critical Batch Size Scale in Pre-training?
by: Zhang, Hanlin, et al.
Published: (2024) -
Riemannian Batch Normalization: A Gyro Approach
by: Chen, Ziheng, et al.
Published: (2025) -
BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching
by: Zheng, Zhen, et al.
Published: (2024) -
BatchTopK Sparse Autoencoders
by: Bussmann, Bart, et al.
Published: (2024)