:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ullah, Nasib, Zhang, Jinbin, Randrianantenaina, Jean Lucien, Schultheis, Erik, Babbar, Rohit
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2606.01117
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DynaSpec: Context-aware Dynamic Speculative Sampling for Large-Vocabulary Language Models
by: Zhang, Jinbin, et al.
Published: (2025)

Navigating Extremes: Dynamic Sparsity in Large Output Spaces
by: Ullah, Nasib, et al.
Published: (2024)

ELMO: Efficiency via Low-precision and Peak Memory Optimization in Large Output Spaces
by: Zhang, Jinbin, et al.
Published: (2025)

Labels in Extremes: How Well Calibrated are Extreme Multi-label Classifiers?
by: Ullah, Nasib, et al.
Published: (2024)

Large Language Model as a Teacher for Zero-shot Tagging at Extreme Scales
by: Zhang, Jinbin, et al.
Published: (2024)

UniDEC : Unified Dual Encoder and Classifier Training for Extreme Multi-Label Classification
by: Kharbanda, Siddhant, et al.
Published: (2024)

FFT-based Dynamic Subspace Selection for Low-Rank Adaptive Optimization of Large Language Models
by: Modoranu, Ionut-Vlad, et al.
Published: (2025)

Generalized test utilities for long-tail performance in extreme multi-label classification
by: Schultheis, Erik, et al.
Published: (2023)

A General Online Algorithm for Optimizing Complex Performance Metrics
by: Kotłowski, Wojciech, et al.
Published: (2024)

InceptionXML: A Lightweight Framework with Synchronized Negative Sampling for Short Text Extreme Classification
by: Kharbanda, Siddhant, et al.
Published: (2021)

"What is Different Between These Datasets?" A Framework for Explaining Data Distribution Shifts
by: Babbar, Varun, et al.
Published: (2024)

Consistent algorithms for multi-label classification with macro-at-$k$ metrics
by: Schultheis, Erik, et al.
Published: (2024)

Topology-Aware Revival for Efficient Sparse Training
by: Jin, Meiling, et al.
Published: (2026)

SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention
by: Xu, Hongtao, et al.
Published: (2026)

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
by: Yuan, Jingyang, et al.
Published: (2025)

Steering Large Language Model Activations in Sparse Spaces
by: Bayat, Reza, et al.
Published: (2025)

NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling
by: Grooten, Bram, et al.
Published: (2025)

Learning label-label correlations in Extreme Multi-label Classification via Label Features
by: Kharbanda, Siddhant, et al.
Published: (2024)

Hardware-Aware DNN Compression for Homogeneous Edge Devices
by: Zhang, Kunlong, et al.
Published: (2025)

HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model Inference
by: Gong, Ping, et al.
Published: (2025)

Multi-Objective Hardware Aware Neural Architecture Search using Hardware Cost Diversity
by: Sinha, Nilotpal, et al.
Published: (2024)

packetLSTM: Dynamic LSTM Framework for Streaming Data with Varying Feature Space
by: Agarwal, Rohit, et al.
Published: (2024)

Group-Aware Reinforcement Learning for Output Diversity in Large Language Models
by: Anschel, Oron, et al.
Published: (2025)

BudgetDraft: Acceptance-Aware Multi-View Training for Sparse-KV Speculative Decoding
by: He, Liang, et al.
Published: (2026)

From Rashomon Theory to PRAXIS: Efficient Decision Tree Rashomon Sets
by: Heile, Zakk, et al.
Published: (2026)

Correcting Influence: Unboxing LLM Outputs with Orthogonal Latent Spaces
by: Yu, Shixing, et al.
Published: (2026)

Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
by: Tan, Qitao, et al.
Published: (2025)

DAFOS: Dynamic Adaptive Fanout Optimization Sampler
by: Ullah, Irfan, et al.
Published: (2025)

Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design
by: Choi, Jaemoo, et al.
Published: (2026)

Bitwidth-Specific Logarithmic Arithmetic for Future Hardware-Accelerated Training
by: Hamad, Hassan, et al.
Published: (2025)

HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space
by: Li, Ke, et al.
Published: (2025)

Hardware Aware Ensemble Selection for Balancing Predictive Accuracy and Cost
by: Maier, Jannis, et al.
Published: (2024)

Improving Sparse Autoencoder with Dynamic Attention
by: Wang, Dongsheng, et al.
Published: (2026)

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
by: Qu, Xingwei, et al.
Published: (2025)

Advancing On-Device Neural Network Training with TinyPropv2: Dynamic, Sparse, and Efficient Backpropagation
by: Rüb, Marcus, et al.
Published: (2024)

OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition
by: Zhang, Stephen, et al.
Published: (2024)

Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
by: Zhang, Xinyu, et al.
Published: (2025)

HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models
by: Sukthanker, Rhea Sanjay, et al.
Published: (2024)

Optimistic Verifiable Training by Controlling Hardware Nondeterminism
by: Srivastava, Megha, et al.
Published: (2024)

Accumulator-Aware Post-Training Quantization for Large Language Models
by: Colbert, Ian, et al.
Published: (2024)