Saved in:
| Main Author: | Newgas, Adam |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.09816 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Greedy Selection under Independent Increments: A Toy Model Analysis
by: Yang, Huitao
Published: (2025)
by: Yang, Huitao
Published: (2025)
SafeSeek: Universal Attribution of Safety Circuits in Language Models
by: Yu, Miao, et al.
Published: (2026)
by: Yu, Miao, et al.
Published: (2026)
Semantic Optimal Transport for Sparse Autoencoder Feature Matching and Circuit Compression
by: Cao, Tue M., et al.
Published: (2026)
by: Cao, Tue M., et al.
Published: (2026)
Statistical Analysis of Policy Space Compression Problem
by: Molaei, Majid, et al.
Published: (2024)
by: Molaei, Majid, et al.
Published: (2024)
Latent Generative Models with Tunable Complexity for Compressed Sensing and other Inverse Problems
by: Gunn, Sean, et al.
Published: (2026)
by: Gunn, Sean, et al.
Published: (2026)
Quantifying LLM Attention-Head Stability: Implications for Circuit Universality
by: Bali, Karan, et al.
Published: (2026)
by: Bali, Karan, et al.
Published: (2026)
Task-Circuit Quantization: Leveraging Knowledge Localization and Interpretability for Compression
by: Xiao, Hanqi, et al.
Published: (2025)
by: Xiao, Hanqi, et al.
Published: (2025)
Quantum Circuit-Based Learning Models: Bridging Quantum Computing and Machine Learning
by: Fan, Fan, et al.
Published: (2026)
by: Fan, Fan, et al.
Published: (2026)
JustDense: Just using Dense instead of Sequence Mixer for Time Series analysis
by: Park, TaekHyun, et al.
Published: (2025)
by: Park, TaekHyun, et al.
Published: (2025)
Compute Aligned Training: Optimizing for Test Time Inference
by: Ousherovitch, Adam, et al.
Published: (2026)
by: Ousherovitch, Adam, et al.
Published: (2026)
Training-free Ultra Small Model for Universal Sparse Reconstruction in Compressed Sensing
by: Tang, Chaoqing, et al.
Published: (2025)
by: Tang, Chaoqing, et al.
Published: (2025)
SynCircuit: Automated Generation of New Synthetic RTL Circuits Can Enable Big Data in Circuits
by: Liu, Shang, et al.
Published: (2025)
by: Liu, Shang, et al.
Published: (2025)
Finding Interpretable Prompt-Specific Circuits in Language Models
by: Franco, Gabriel, et al.
Published: (2026)
by: Franco, Gabriel, et al.
Published: (2026)
Bayesian Federated Model Compression for Communication and Computation Efficiency
by: Xia, Chengyu, et al.
Published: (2024)
by: Xia, Chengyu, et al.
Published: (2024)
To Compress or Not? Pushing the Frontier of Lossless GenAI Model Weights Compression with Exponent Concentration
by: Yang, Zeyu, et al.
Published: (2025)
by: Yang, Zeyu, et al.
Published: (2025)
When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reasoning Models
by: Zhang, Nan, et al.
Published: (2025)
by: Zhang, Nan, et al.
Published: (2025)
Quasimetric Value Functions with Dense Rewards
by: Valieva, Khadichabonu, et al.
Published: (2024)
by: Valieva, Khadichabonu, et al.
Published: (2024)
Structure as Computation: Developmental Generation of Minimal Neural Circuits
by: Zhou, Duan
Published: (2026)
by: Zhou, Duan
Published: (2026)
Scaling Continuous Latent Variable Models as Probabilistic Integral Circuits
by: Gala, Gennaro, et al.
Published: (2024)
by: Gala, Gennaro, et al.
Published: (2024)
Generalizing Scaling Laws for Dense and Sparse Large Language Models
by: Hossain, Md Arafat, et al.
Published: (2025)
by: Hossain, Md Arafat, et al.
Published: (2025)
Hyper-Compression: Model Compression via Hyperfunction
by: Fan, Fenglei, et al.
Published: (2024)
by: Fan, Fenglei, et al.
Published: (2024)
Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models
by: Wang, Siqi, et al.
Published: (2024)
by: Wang, Siqi, et al.
Published: (2024)
Densely Multiplied Physics Informed Neural Networks
by: Jiang, Feilong, et al.
Published: (2024)
by: Jiang, Feilong, et al.
Published: (2024)
Optimizing Dense Feed-Forward Neural Networks
by: Balderas, Luis, et al.
Published: (2023)
by: Balderas, Luis, et al.
Published: (2023)
EUGens: Efficient, Unified, and General Dense Layers
by: Kim, Sang Min, et al.
Published: (2026)
by: Kim, Sang Min, et al.
Published: (2026)
The Expressivity Boundary of Probabilistic Circuits: A Comparison with Large Language Models
by: Zhao, Zhiyu, et al.
Published: (2026)
by: Zhao, Zhiyu, et al.
Published: (2026)
The Computational Limits of State-Space Models and Mamba via the Lens of Circuit Complexity
by: Chen, Yifang, et al.
Published: (2024)
by: Chen, Yifang, et al.
Published: (2024)
A Curriculum-Based Deep Reinforcement Learning Framework for the Electric Vehicle Routing Problem
by: Daysalilar, Mertcan, et al.
Published: (2026)
by: Daysalilar, Mertcan, et al.
Published: (2026)
On the Computational Capability of Graph Neural Networks: A Circuit Complexity Bound Perspective
by: Li, Xiaoyu, et al.
Published: (2025)
by: Li, Xiaoyu, et al.
Published: (2025)
Efficiently Editing Mixture-of-Experts Models with Compressed Experts
by: He, Yifei, et al.
Published: (2025)
by: He, Yifei, et al.
Published: (2025)
CURing Large Models: Compression via CUR Decomposition
by: Park, Sanghyeon, et al.
Published: (2025)
by: Park, Sanghyeon, et al.
Published: (2025)
Dependable Distributed Training of Compressed Machine Learning Models
by: Malandrino, Francesco, et al.
Published: (2024)
by: Malandrino, Francesco, et al.
Published: (2024)
Robust Basis Spline Decoupling for the Compression of Transformer Models
by: De Jonghe, Joppe, et al.
Published: (2026)
by: De Jonghe, Joppe, et al.
Published: (2026)
Sparse Probabilistic Graph Circuits
by: Rektoris, Martin, et al.
Published: (2025)
by: Rektoris, Martin, et al.
Published: (2025)
Restructuring Tractable Probabilistic Circuits
by: Zhang, Honghua, et al.
Published: (2024)
by: Zhang, Honghua, et al.
Published: (2024)
Soft Learning Probabilistic Circuits
by: Ghandi, Soroush, et al.
Published: (2024)
by: Ghandi, Soroush, et al.
Published: (2024)
Causal Neural Probabilistic Circuits
by: Chen, Weixin, et al.
Published: (2026)
by: Chen, Weixin, et al.
Published: (2026)
Pruning and Distilling Mixture-of-Experts into Dense Language Models
by: Kim, Junhyuck, et al.
Published: (2026)
by: Kim, Junhyuck, et al.
Published: (2026)
Universal Reinforcement Learning in Coalgebras: Asynchronous Stochastic Computation via Conduction
by: Mahadevan, Sridhar
Published: (2025)
by: Mahadevan, Sridhar
Published: (2025)
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
by: Panda, Ashwinee, et al.
Published: (2025)
by: Panda, Ashwinee, et al.
Published: (2025)
Similar Items
-
Greedy Selection under Independent Increments: A Toy Model Analysis
by: Yang, Huitao
Published: (2025) -
SafeSeek: Universal Attribution of Safety Circuits in Language Models
by: Yu, Miao, et al.
Published: (2026) -
Semantic Optimal Transport for Sparse Autoencoder Feature Matching and Circuit Compression
by: Cao, Tue M., et al.
Published: (2026) -
Statistical Analysis of Policy Space Compression Problem
by: Molaei, Majid, et al.
Published: (2024) -
Latent Generative Models with Tunable Complexity for Compressed Sensing and other Inverse Problems
by: Gunn, Sean, et al.
Published: (2026)