Saved in:
| Main Authors: | Ullah, Nasib, Zhang, Jinbin, Randrianantenaina, Jean Lucien, Schultheis, Erik, Babbar, Rohit |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.01117 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DynaSpec: Context-aware Dynamic Speculative Sampling for Large-Vocabulary Language Models
by: Zhang, Jinbin, et al.
Published: (2025)
by: Zhang, Jinbin, et al.
Published: (2025)
Navigating Extremes: Dynamic Sparsity in Large Output Spaces
by: Ullah, Nasib, et al.
Published: (2024)
by: Ullah, Nasib, et al.
Published: (2024)
ELMO: Efficiency via Low-precision and Peak Memory Optimization in Large Output Spaces
by: Zhang, Jinbin, et al.
Published: (2025)
by: Zhang, Jinbin, et al.
Published: (2025)
Labels in Extremes: How Well Calibrated are Extreme Multi-label Classifiers?
by: Ullah, Nasib, et al.
Published: (2024)
by: Ullah, Nasib, et al.
Published: (2024)
Large Language Model as a Teacher for Zero-shot Tagging at Extreme Scales
by: Zhang, Jinbin, et al.
Published: (2024)
by: Zhang, Jinbin, et al.
Published: (2024)
UniDEC : Unified Dual Encoder and Classifier Training for Extreme Multi-Label Classification
by: Kharbanda, Siddhant, et al.
Published: (2024)
by: Kharbanda, Siddhant, et al.
Published: (2024)
FFT-based Dynamic Subspace Selection for Low-Rank Adaptive Optimization of Large Language Models
by: Modoranu, Ionut-Vlad, et al.
Published: (2025)
by: Modoranu, Ionut-Vlad, et al.
Published: (2025)
Generalized test utilities for long-tail performance in extreme multi-label classification
by: Schultheis, Erik, et al.
Published: (2023)
by: Schultheis, Erik, et al.
Published: (2023)
A General Online Algorithm for Optimizing Complex Performance Metrics
by: Kotłowski, Wojciech, et al.
Published: (2024)
by: Kotłowski, Wojciech, et al.
Published: (2024)
InceptionXML: A Lightweight Framework with Synchronized Negative Sampling for Short Text Extreme Classification
by: Kharbanda, Siddhant, et al.
Published: (2021)
by: Kharbanda, Siddhant, et al.
Published: (2021)
"What is Different Between These Datasets?" A Framework for Explaining Data Distribution Shifts
by: Babbar, Varun, et al.
Published: (2024)
by: Babbar, Varun, et al.
Published: (2024)
Consistent algorithms for multi-label classification with macro-at-$k$ metrics
by: Schultheis, Erik, et al.
Published: (2024)
by: Schultheis, Erik, et al.
Published: (2024)
Topology-Aware Revival for Efficient Sparse Training
by: Jin, Meiling, et al.
Published: (2026)
by: Jin, Meiling, et al.
Published: (2026)
SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention
by: Xu, Hongtao, et al.
Published: (2026)
by: Xu, Hongtao, et al.
Published: (2026)
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
by: Yuan, Jingyang, et al.
Published: (2025)
by: Yuan, Jingyang, et al.
Published: (2025)
Steering Large Language Model Activations in Sparse Spaces
by: Bayat, Reza, et al.
Published: (2025)
by: Bayat, Reza, et al.
Published: (2025)
NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling
by: Grooten, Bram, et al.
Published: (2025)
by: Grooten, Bram, et al.
Published: (2025)
Learning label-label correlations in Extreme Multi-label Classification via Label Features
by: Kharbanda, Siddhant, et al.
Published: (2024)
by: Kharbanda, Siddhant, et al.
Published: (2024)
Hardware-Aware DNN Compression for Homogeneous Edge Devices
by: Zhang, Kunlong, et al.
Published: (2025)
by: Zhang, Kunlong, et al.
Published: (2025)
HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model Inference
by: Gong, Ping, et al.
Published: (2025)
by: Gong, Ping, et al.
Published: (2025)
Multi-Objective Hardware Aware Neural Architecture Search using Hardware Cost Diversity
by: Sinha, Nilotpal, et al.
Published: (2024)
by: Sinha, Nilotpal, et al.
Published: (2024)
packetLSTM: Dynamic LSTM Framework for Streaming Data with Varying Feature Space
by: Agarwal, Rohit, et al.
Published: (2024)
by: Agarwal, Rohit, et al.
Published: (2024)
Group-Aware Reinforcement Learning for Output Diversity in Large Language Models
by: Anschel, Oron, et al.
Published: (2025)
by: Anschel, Oron, et al.
Published: (2025)
BudgetDraft: Acceptance-Aware Multi-View Training for Sparse-KV Speculative Decoding
by: He, Liang, et al.
Published: (2026)
by: He, Liang, et al.
Published: (2026)
From Rashomon Theory to PRAXIS: Efficient Decision Tree Rashomon Sets
by: Heile, Zakk, et al.
Published: (2026)
by: Heile, Zakk, et al.
Published: (2026)
Correcting Influence: Unboxing LLM Outputs with Orthogonal Latent Spaces
by: Yu, Shixing, et al.
Published: (2026)
by: Yu, Shixing, et al.
Published: (2026)
Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
by: Tan, Qitao, et al.
Published: (2025)
by: Tan, Qitao, et al.
Published: (2025)
DAFOS: Dynamic Adaptive Fanout Optimization Sampler
by: Ullah, Irfan, et al.
Published: (2025)
by: Ullah, Irfan, et al.
Published: (2025)
Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design
by: Choi, Jaemoo, et al.
Published: (2026)
by: Choi, Jaemoo, et al.
Published: (2026)
Bitwidth-Specific Logarithmic Arithmetic for Future Hardware-Accelerated Training
by: Hamad, Hassan, et al.
Published: (2025)
by: Hamad, Hassan, et al.
Published: (2025)
HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space
by: Li, Ke, et al.
Published: (2025)
by: Li, Ke, et al.
Published: (2025)
Hardware Aware Ensemble Selection for Balancing Predictive Accuracy and Cost
by: Maier, Jannis, et al.
Published: (2024)
by: Maier, Jannis, et al.
Published: (2024)
Improving Sparse Autoencoder with Dynamic Attention
by: Wang, Dongsheng, et al.
Published: (2026)
by: Wang, Dongsheng, et al.
Published: (2026)
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
by: Qu, Xingwei, et al.
Published: (2025)
by: Qu, Xingwei, et al.
Published: (2025)
Advancing On-Device Neural Network Training with TinyPropv2: Dynamic, Sparse, and Efficient Backpropagation
by: Rüb, Marcus, et al.
Published: (2024)
by: Rüb, Marcus, et al.
Published: (2024)
OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition
by: Zhang, Stephen, et al.
Published: (2024)
by: Zhang, Stephen, et al.
Published: (2024)
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
by: Zhang, Xinyu, et al.
Published: (2025)
by: Zhang, Xinyu, et al.
Published: (2025)
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models
by: Sukthanker, Rhea Sanjay, et al.
Published: (2024)
by: Sukthanker, Rhea Sanjay, et al.
Published: (2024)
Optimistic Verifiable Training by Controlling Hardware Nondeterminism
by: Srivastava, Megha, et al.
Published: (2024)
by: Srivastava, Megha, et al.
Published: (2024)
Accumulator-Aware Post-Training Quantization for Large Language Models
by: Colbert, Ian, et al.
Published: (2024)
by: Colbert, Ian, et al.
Published: (2024)
Similar Items
-
DynaSpec: Context-aware Dynamic Speculative Sampling for Large-Vocabulary Language Models
by: Zhang, Jinbin, et al.
Published: (2025) -
Navigating Extremes: Dynamic Sparsity in Large Output Spaces
by: Ullah, Nasib, et al.
Published: (2024) -
ELMO: Efficiency via Low-precision and Peak Memory Optimization in Large Output Spaces
by: Zhang, Jinbin, et al.
Published: (2025) -
Labels in Extremes: How Well Calibrated are Extreme Multi-label Classifiers?
by: Ullah, Nasib, et al.
Published: (2024) -
Large Language Model as a Teacher for Zero-shot Tagging at Extreme Scales
by: Zhang, Jinbin, et al.
Published: (2024)