Saved in:
| Main Authors: | Yang, Zi, Liu, Ziyue, Choudhary, Samridhi, Xie, Xinfeng, Gao, Cao, Kunzmann, Siegfried, Zhang, Zheng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.14377 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding
by: Yang, Zi, et al.
Published: (2023)
by: Yang, Zi, et al.
Published: (2023)
CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
by: Liu, Ziyue, et al.
Published: (2025)
by: Liu, Ziyue, et al.
Published: (2025)
Saten: Sparse Augmented Tensor Networks for Post-Training Compression of Large Language Models
by: Solgi, Ryan, et al.
Published: (2025)
by: Solgi, Ryan, et al.
Published: (2025)
Ultra Memory-Efficient On-FPGA Training of Transformers via Tensor-Compressed Optimization
by: Tian, Jiayi, et al.
Published: (2025)
by: Tian, Jiayi, et al.
Published: (2025)
LaX: Boosting Low-Rank Training of Foundation Models via Latent Crossing
by: Zhang, Ruijie, et al.
Published: (2025)
by: Zhang, Ruijie, et al.
Published: (2025)
Identification and Evaluation of Novel SGLT-2 Inhibitors having Anti-diabetic Potential
by: Saini, Kunika, et al.
Published: (2026)
by: Saini, Kunika, et al.
Published: (2026)
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning
by: Yang, Yifan, et al.
Published: (2024)
by: Yang, Yifan, et al.
Published: (2024)
Quantum-Inspired Tensor Network Autoencoders for Anomaly Detection: A MERA-Based Approach
by: Gurkanli, Emre, et al.
Published: (2026)
by: Gurkanli, Emre, et al.
Published: (2026)
Joint Tensor-Train Parameterization for Efficient and Expressive Low-Rank Adaptation
by: Qi, Jun, et al.
Published: (2025)
by: Qi, Jun, et al.
Published: (2025)
Adaptive Patching for Tensor Train Computations
by: Grosso, Gianluca, et al.
Published: (2026)
by: Grosso, Gianluca, et al.
Published: (2026)
MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models
by: Zhang, Zhen, et al.
Published: (2025)
by: Zhang, Zhen, et al.
Published: (2025)
Muon$^2$: Boosting Muon via Adaptive Second-Moment Preconditioning
by: Liu, Ziyue, et al.
Published: (2026)
by: Liu, Ziyue, et al.
Published: (2026)
OASIS: Online Activation Subspace Learning for Memory-Efficient Training
by: Choudhary, Sakshi, et al.
Published: (2026)
by: Choudhary, Sakshi, et al.
Published: (2026)
Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
by: Zhu, Zhanda, et al.
Published: (2025)
by: Zhu, Zhanda, et al.
Published: (2025)
Robust Low‐Rank Tensor Recovery Using a Self‐Adaptive Learnable Weighted Tensor Total Variation Method
by: Fanyin Yang, et al.
Published: (2025)
by: Fanyin Yang, et al.
Published: (2025)
BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models
by: Wang, Zhengyang, et al.
Published: (2025)
by: Wang, Zhengyang, et al.
Published: (2025)
AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
by: Refael, Yehonathan, et al.
Published: (2024)
by: Refael, Yehonathan, et al.
Published: (2024)
Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
by: Wang, Yezhen, et al.
Published: (2025)
by: Wang, Yezhen, et al.
Published: (2025)
A Riemannian Rank‐Adaptive Method for Higher‐Order Tensor Completion in the Tensor‐Train Format
by: Charlotte Vermeylen, et al.
Published: (2024)
by: Charlotte Vermeylen, et al.
Published: (2024)
COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs
by: Liu, Liming, et al.
Published: (2025)
by: Liu, Liming, et al.
Published: (2025)
Local Interpolation via Low-Rank Tensor Trains
by: Guzman, Siddhartha E., et al.
Published: (2026)
by: Guzman, Siddhartha E., et al.
Published: (2026)
Low Rank Tensor Completion via Adaptive ADMM
by: Führling, Niclas, et al.
Published: (2026)
by: Führling, Niclas, et al.
Published: (2026)
Dynamics‐Memory‐Event‐Based Adaptive Attitude Consensus Control for Multi‐UAVs With Self‐Adjusting Performance
by: Xinfeng Shao, et al.
Published: (2025)
by: Xinfeng Shao, et al.
Published: (2025)
RankGuide: Tensor-Rank-Guided Routing and Steering for Efficient Reasoning
by: Tian, Jiayi, et al.
Published: (2026)
by: Tian, Jiayi, et al.
Published: (2026)
PYTHALAB-MERA: Validation-Grounded Memory, Retrieval, and Acceptance Control for Frozen-LLM Coding Agents
by: Iscan, Mehmet
Published: (2026)
by: Iscan, Mehmet
Published: (2026)
Area-Efficient In-Memory Computing for Mixture-of-Experts via Multiplexing and Caching
by: Gao, Hanyuan, et al.
Published: (2026)
by: Gao, Hanyuan, et al.
Published: (2026)
MERA: A Comprehensive LLM Evaluation in Russian
by: Fenogenova, Alena, et al.
Published: (2024)
by: Fenogenova, Alena, et al.
Published: (2024)
Efficient Generalized Low-Rank Tensor Contextual Bandits
by: Yi, Qianxin, et al.
Published: (2023)
by: Yi, Qianxin, et al.
Published: (2023)
Low-Rank Robust Subspace Tensor Clustering for Metro Passenger Flow Modeling
by: Hu, Jiuyun, et al.
Published: (2024)
by: Hu, Jiuyun, et al.
Published: (2024)
Real-Time FJ/MAC PDE Solvers via Tensorized, Back-Propagation-Free Optical PINN Training
by: Zhao, Yequan, et al.
Published: (2023)
by: Zhao, Yequan, et al.
Published: (2023)
CoScale-RL: Efficient Post-Training by Co-Scaling Data and Computation
by: Chen, Yutong, et al.
Published: (2026)
by: Chen, Yutong, et al.
Published: (2026)
Efficient Implementation of Third-Order Tensor Methods with Adaptive Regularization for Unconstrained Optimization
by: Cartis, Coralia, et al.
Published: (2024)
by: Cartis, Coralia, et al.
Published: (2024)
TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training
by: Zhang, Ruijie, et al.
Published: (2026)
by: Zhang, Ruijie, et al.
Published: (2026)
Exact Renormalization of Wave Functionals yields Continuous MERA
by: Goldman, Samuel, et al.
Published: (2023)
by: Goldman, Samuel, et al.
Published: (2023)
TensorGRaD: Tensor Gradient Robust Decomposition for Memory-Efficient Neural Operator Training
by: Loeschcke, Sebastian, et al.
Published: (2025)
by: Loeschcke, Sebastian, et al.
Published: (2025)
Training Tensor Attention Efficiently: From Cubic to Almost Linear Time
by: Cao, Yang, et al.
Published: (2024)
by: Cao, Yang, et al.
Published: (2024)
Antiferromagnetic Tunnel Junctions (AFMTJs) for In-Memory Computing: Modeling and Case Study
by: Choudhary, Yousuf, et al.
Published: (2026)
by: Choudhary, Yousuf, et al.
Published: (2026)
Dynamical Simulations of Schrödinger's Equation via Rank-Adaptive Tensor Decompositions
by: Petersson, N. Anders, et al.
Published: (2026)
by: Petersson, N. Anders, et al.
Published: (2026)
Folding Tensor and Sequence Parallelism for Memory-Efficient Transformer Training & Inference
by: Shyam, Vasu, et al.
Published: (2026)
by: Shyam, Vasu, et al.
Published: (2026)
Adaptive Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization
by: Ji, Yixin, et al.
Published: (2024)
by: Ji, Yixin, et al.
Published: (2024)
Similar Items
-
Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding
by: Yang, Zi, et al.
Published: (2023) -
CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
by: Liu, Ziyue, et al.
Published: (2025) -
Saten: Sparse Augmented Tensor Networks for Post-Training Compression of Large Language Models
by: Solgi, Ryan, et al.
Published: (2025) -
Ultra Memory-Efficient On-FPGA Training of Transformers via Tensor-Compressed Optimization
by: Tian, Jiayi, et al.
Published: (2025) -
LaX: Boosting Low-Rank Training of Foundation Models via Latent Crossing
by: Zhang, Ruijie, et al.
Published: (2025)