Saved in:
| Main Authors: | Wendler, Chris, Alistarh, Dan, Püschel, Markus |
|---|---|
| Format: | Preprint |
| Published: |
2019
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/1909.02253 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning DAGs from Data with Few Root Causes
by: Misiakos, Panagiotis, et al.
Published: (2023)
by: Misiakos, Panagiotis, et al.
Published: (2023)
Causal Fourier Analysis on Directed Acyclic Graphs and Posets
by: Seifert, Bastian, et al.
Published: (2022)
by: Seifert, Bastian, et al.
Published: (2022)
SpinSVAR: Estimating Structural Vector Autoregression Assuming Sparse Input
by: Misiakos, Panagiotis, et al.
Published: (2025)
by: Misiakos, Panagiotis, et al.
Published: (2025)
SPADE: Sparsity-Guided Debugging for Deep Neural Networks
by: Moakhar, Arshia Soltani, et al.
Published: (2023)
by: Moakhar, Arshia Soltani, et al.
Published: (2023)
Behemoth: Benchmarking Unlearning in LLMs Using Fully Synthetic Data
by: Iofinova, Eugenia, et al.
Published: (2026)
by: Iofinova, Eugenia, et al.
Published: (2026)
Model Compression with Exact Budget Constraints via Riemannian Manifolds
by: Helcig, Michael, et al.
Published: (2026)
by: Helcig, Michael, et al.
Published: (2026)
Scalable Mechanistic Neural Networks for Differential Equations and Machine Learning
by: Chen, Jiale, et al.
Published: (2024)
by: Chen, Jiale, et al.
Published: (2024)
Towards Combinatorial Interpretability of Neural Computation
by: Adler, Micah, et al.
Published: (2025)
by: Adler, Micah, et al.
Published: (2025)
LLMQ: Efficient Lower-Precision Pretraining for Consumer GPUs
by: Schultheis, Erik, et al.
Published: (2025)
by: Schultheis, Erik, et al.
Published: (2025)
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
by: Kleinegger, Maximilian, et al.
Published: (2026)
by: Kleinegger, Maximilian, et al.
Published: (2026)
Statistically-Lossless Quantization of Large Language Models
by: Helcig, Michael, et al.
Published: (2026)
by: Helcig, Michael, et al.
Published: (2026)
Simple Opinion Dynamics for No-Regret Learning
by: Lazarsfeld, John, et al.
Published: (2023)
by: Lazarsfeld, John, et al.
Published: (2023)
Apertus LLM Family Expansion via Distillation and Quantization
by: Panferov, Andrei, et al.
Published: (2026)
by: Panferov, Andrei, et al.
Published: (2026)
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
by: Sieberling, Oliver, et al.
Published: (2024)
by: Sieberling, Oliver, et al.
Published: (2024)
Communication-Efficient Federated Learning With Data and Client Heterogeneity
by: Zakerinia, Hossein, et al.
Published: (2022)
by: Zakerinia, Hossein, et al.
Published: (2022)
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation
by: Panferov, Andrei, et al.
Published: (2026)
by: Panferov, Andrei, et al.
Published: (2026)
Towards Robust Scaling Laws for Optimizers
by: Volkova, Alexandra, et al.
Published: (2026)
by: Volkova, Alexandra, et al.
Published: (2026)
MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning
by: Modoranu, Ionut-Vlad, et al.
Published: (2026)
by: Modoranu, Ionut-Vlad, et al.
Published: (2026)
Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models
by: Kurtic, Eldar, et al.
Published: (2024)
by: Kurtic, Eldar, et al.
Published: (2024)
Review of blockchain application with Graph Neural Networks, Graph Convolutional Networks and Convolutional Neural Networks
by: Ancelotti, Amy, et al.
Published: (2024)
by: Ancelotti, Amy, et al.
Published: (2024)
Benchmarking Convolutional Neural Network and Graph Neural Network based Surrogate Models on a Real-World Car External Aerodynamics Dataset
by: Jacob, Sam Jacob, et al.
Published: (2025)
by: Jacob, Sam Jacob, et al.
Published: (2025)
Beyond Outliers: A Study of Optimizers Under Quantization
by: Vlassis, Georgios, et al.
Published: (2025)
by: Vlassis, Georgios, et al.
Published: (2025)
CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training
by: Tabesh, Soroush, et al.
Published: (2025)
by: Tabesh, Soroush, et al.
Published: (2025)
Efficient Data Selection at Scale via Influence Distillation
by: Nikdan, Mahdi, et al.
Published: (2025)
by: Nikdan, Mahdi, et al.
Published: (2025)
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
by: Robert, Thomas, et al.
Published: (2024)
by: Robert, Thomas, et al.
Published: (2024)
Hybrid Decentralized Optimization: Leveraging Both First- and Zeroth-Order Optimizers for Faster Convergence
by: Ansaripour, Matin, et al.
Published: (2022)
by: Ansaripour, Matin, et al.
Published: (2022)
ECO: Quantized Training without Full-Precision Master Weights
by: Nikdan, Mahdi, et al.
Published: (2026)
by: Nikdan, Mahdi, et al.
Published: (2026)
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation
by: Nikdan, Mahdi, et al.
Published: (2024)
by: Nikdan, Mahdi, et al.
Published: (2024)
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
by: Frantar, Elias, et al.
Published: (2024)
by: Frantar, Elias, et al.
Published: (2024)
The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information
by: Wu, Diyuan, et al.
Published: (2024)
by: Wu, Diyuan, et al.
Published: (2024)
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers
by: Modoranu, Ionut-Vlad, et al.
Published: (2026)
by: Modoranu, Ionut-Vlad, et al.
Published: (2026)
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization
by: Chen, Jiale, et al.
Published: (2025)
by: Chen, Jiale, et al.
Published: (2025)
Multiscale Training of Convolutional Neural Networks
by: Ahamed, Shadab, et al.
Published: (2025)
by: Ahamed, Shadab, et al.
Published: (2025)
Advection Augmented Convolutional Neural Networks
by: Zakariaei, Niloufar, et al.
Published: (2024)
by: Zakariaei, Niloufar, et al.
Published: (2024)
Variational Graph Convolutional Neural Networks
by: Oleksiienko, Illia, et al.
Published: (2025)
by: Oleksiienko, Illia, et al.
Published: (2025)
Dual Convexified Convolutional Neural Networks
by: Bai, Site, et al.
Published: (2022)
by: Bai, Site, et al.
Published: (2022)
DarwinLM: Evolutionary Structured Pruning of Large Language Models
by: Tang, Shengkun, et al.
Published: (2025)
by: Tang, Shengkun, et al.
Published: (2025)
Wasserstein Distances, Neuronal Entanglement, and Sparsity
by: Sawmya, Shashata, et al.
Published: (2024)
by: Sawmya, Shashata, et al.
Published: (2024)
The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM
by: Lee, Kwanhee, et al.
Published: (2025)
by: Lee, Kwanhee, et al.
Published: (2025)
Compression Scaling Laws:Unifying Sparsity and Quantization
by: Frantar, Elias, et al.
Published: (2025)
by: Frantar, Elias, et al.
Published: (2025)
Similar Items
-
Learning DAGs from Data with Few Root Causes
by: Misiakos, Panagiotis, et al.
Published: (2023) -
Causal Fourier Analysis on Directed Acyclic Graphs and Posets
by: Seifert, Bastian, et al.
Published: (2022) -
SpinSVAR: Estimating Structural Vector Autoregression Assuming Sparse Input
by: Misiakos, Panagiotis, et al.
Published: (2025) -
SPADE: Sparsity-Guided Debugging for Deep Neural Networks
by: Moakhar, Arshia Soltani, et al.
Published: (2023) -
Behemoth: Benchmarking Unlearning in LLMs Using Fully Synthetic Data
by: Iofinova, Eugenia, et al.
Published: (2026)