Saved in:
| Main Authors: | Dozier, Haley, Henslee, Althea, Abraham, Ashley, Strelzoff, Andrew, Chappell, Mark |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.03374 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Large-Scale Data Parallelization of Product Quantization and Inverted Indexing Using Dask
by: Abraham, Ashley N., et al.
Published: (2026)
by: Abraham, Ashley N., et al.
Published: (2026)
Characteristic Energy Behavior Profiling of Non-Residential Buildings
by: Dozier, Haley, et al.
Published: (2025)
by: Dozier, Haley, et al.
Published: (2025)
AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs
by: McCoy, David, et al.
Published: (2025)
by: McCoy, David, et al.
Published: (2025)
Similarity-Quantized Relative Difference Learning for Improved Molecular Activity Prediction
by: Zadorozhny, Karina, et al.
Published: (2025)
by: Zadorozhny, Karina, et al.
Published: (2025)
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
by: Tong, Yujia, et al.
Published: (2025)
by: Tong, Yujia, et al.
Published: (2025)
The Hidden Pitfalls of the Cosine Similarity Loss
by: Draganov, Andrew, et al.
Published: (2024)
by: Draganov, Andrew, et al.
Published: (2024)
The Muon Space GNSS-R Surface Soil Moisture Product
by: Roberts, Max, et al.
Published: (2024)
by: Roberts, Max, et al.
Published: (2024)
How not to Stitch Representations to Measure Similarity: Task Loss Matching versus Direct Matching
by: Balogh, András, et al.
Published: (2024)
by: Balogh, András, et al.
Published: (2024)
Physics-Informed Operator Learning for Hemodynamic Modeling
by: Chappell, Ryan, et al.
Published: (2025)
by: Chappell, Ryan, et al.
Published: (2025)
On the importance of structural identifiability for machine learning with partially observed dynamical systems
by: Norden, Janis, et al.
Published: (2025)
by: Norden, Janis, et al.
Published: (2025)
Exploiting LLM Quantization
by: Egashira, Kazuki, et al.
Published: (2024)
by: Egashira, Kazuki, et al.
Published: (2024)
PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration
by: AbouElhamayed, Ahmed F., et al.
Published: (2023)
by: AbouElhamayed, Ahmed F., et al.
Published: (2023)
Locally-Adaptive Quantization for Streaming Vector Search
by: Aguerrebere, Cecilia, et al.
Published: (2024)
by: Aguerrebere, Cecilia, et al.
Published: (2024)
Predicting Probabilities of Error to Combine Quantization and Early Exiting: QuEE
by: Regol, Florence, et al.
Published: (2024)
by: Regol, Florence, et al.
Published: (2024)
Beyond Surface-Level Similarity: Hierarchical Contamination Detection for Synthetic Training Data in Foundation Models
by: Mehta, Sushant
Published: (2025)
by: Mehta, Sushant
Published: (2025)
Is Cosine-Similarity of Embeddings Really About Similarity?
by: Steck, Harald, et al.
Published: (2024)
by: Steck, Harald, et al.
Published: (2024)
LORE: Jointly Learning the Intrinsic Dimensionality and Relative Similarity Structure From Ordinal Data
by: Anand, Vivek, et al.
Published: (2026)
by: Anand, Vivek, et al.
Published: (2026)
Causal Inference as Distribution Adaptation: Optimizing ATE Risk under Propensity Uncertainty
by: Zhang, Ashley
Published: (2025)
by: Zhang, Ashley
Published: (2025)
GPT2MEG: Quantizing MEG for Autoregressive Generation
by: Csaky, Richard, et al.
Published: (2024)
by: Csaky, Richard, et al.
Published: (2024)
Widening the Gap: Exploiting LLM Quantization via Outlier Injection
by: Zhan, Xiaohua, et al.
Published: (2026)
by: Zhan, Xiaohua, et al.
Published: (2026)
Weight-Parameterization in Continuous Time Deep Neural Networks for Surrogate Modeling
by: Rosso, Haley, et al.
Published: (2025)
by: Rosso, Haley, et al.
Published: (2025)
On-Device Training of Fully Quantized Deep Neural Networks on Cortex-M Microcontrollers
by: Deutel, Mark, et al.
Published: (2024)
by: Deutel, Mark, et al.
Published: (2024)
On the Similarities of Embeddings in Contrastive Learning
by: Lee, Chungpa, et al.
Published: (2025)
by: Lee, Chungpa, et al.
Published: (2025)
Random Similarity Isolation Forests
by: Chwilczyński, Sebastian, et al.
Published: (2025)
by: Chwilczyński, Sebastian, et al.
Published: (2025)
Clustering, Coding, and the Concept of Similarity
by: McCarty, L. Thorne
Published: (2014)
by: McCarty, L. Thorne
Published: (2014)
NestQuant: Nested Lattice Quantization for Matrix Products and LLMs
by: Savkin, Semyon, et al.
Published: (2025)
by: Savkin, Semyon, et al.
Published: (2025)
Inner Product Aware Quantization: Provably Fast, Accurate, and Adaptive Algorithms
by: White, Nathan, et al.
Published: (2026)
by: White, Nathan, et al.
Published: (2026)
SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization
by: Bai, Runsheng, et al.
Published: (2024)
by: Bai, Runsheng, et al.
Published: (2024)
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
by: Choi, Kanghyun, et al.
Published: (2024)
by: Choi, Kanghyun, et al.
Published: (2024)
Operationalizing Quantized Disentanglement
by: Barin-Pacela, Vitoria, et al.
Published: (2025)
by: Barin-Pacela, Vitoria, et al.
Published: (2025)
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
by: Egiazarian, Vage, et al.
Published: (2025)
by: Egiazarian, Vage, et al.
Published: (2025)
Iterate to Accelerate: A Unified Framework for Iterative Reasoning and Feedback Convergence
by: Fein-Ashley, Jacob
Published: (2025)
by: Fein-Ashley, Jacob
Published: (2025)
Linear Diffusion Networks
by: Fein-Ashley, Jacob
Published: (2025)
by: Fein-Ashley, Jacob
Published: (2025)
Mind the Gap: A Practical Attack on GGUF Quantization
by: Egashira, Kazuki, et al.
Published: (2025)
by: Egashira, Kazuki, et al.
Published: (2025)
Do Similar Entities have Similar Embeddings?
by: Hubert, Nicolas, et al.
Published: (2023)
by: Hubert, Nicolas, et al.
Published: (2023)
Graph Feedback Bandits with Similar Arms
by: Qi, Han, et al.
Published: (2024)
by: Qi, Han, et al.
Published: (2024)
PQCache: Product Quantization-based KVCache for Long Context LLM Inference
by: Zhang, Hailin, et al.
Published: (2024)
by: Zhang, Hailin, et al.
Published: (2024)
Layer-wise Quantization for Quantized Optimistic Dual Averaging
by: Nguyen, Anh Duc, et al.
Published: (2025)
by: Nguyen, Anh Duc, et al.
Published: (2025)
Value-Aware Product Recommendation by Customer Segmentation using a suitable High-Dimensional Similarity Measure
by: Acosta, María Florencia, et al.
Published: (2026)
by: Acosta, María Florencia, et al.
Published: (2026)
Online Vector Quantized Attention
by: Alonso, Nick, et al.
Published: (2026)
by: Alonso, Nick, et al.
Published: (2026)
Similar Items
-
Large-Scale Data Parallelization of Product Quantization and Inverted Indexing Using Dask
by: Abraham, Ashley N., et al.
Published: (2026) -
Characteristic Energy Behavior Profiling of Non-Residential Buildings
by: Dozier, Haley, et al.
Published: (2025) -
AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs
by: McCoy, David, et al.
Published: (2025) -
Similarity-Quantized Relative Difference Learning for Improved Molecular Activity Prediction
by: Zadorozhny, Karina, et al.
Published: (2025) -
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
by: Tong, Yujia, et al.
Published: (2025)