:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Zi, Liu, Ziyue, Choudhary, Samridhi, Xie, Xinfeng, Gao, Cao, Kunzmann, Siegfried, Zhang, Zheng
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2405.14377
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding
by: Yang, Zi, et al.
Published: (2023)

CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
by: Liu, Ziyue, et al.
Published: (2025)

Saten: Sparse Augmented Tensor Networks for Post-Training Compression of Large Language Models
by: Solgi, Ryan, et al.
Published: (2025)

Ultra Memory-Efficient On-FPGA Training of Transformers via Tensor-Compressed Optimization
by: Tian, Jiayi, et al.
Published: (2025)

LaX: Boosting Low-Rank Training of Foundation Models via Latent Crossing
by: Zhang, Ruijie, et al.
Published: (2025)

Identification and Evaluation of Novel SGLT-2 Inhibitors having Anti-diabetic Potential
by: Saini, Kunika, et al.
Published: (2026)

AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning
by: Yang, Yifan, et al.
Published: (2024)

Quantum-Inspired Tensor Network Autoencoders for Anomaly Detection: A MERA-Based Approach
by: Gurkanli, Emre, et al.
Published: (2026)

Joint Tensor-Train Parameterization for Efficient and Expressive Low-Rank Adaptation
by: Qi, Jun, et al.
Published: (2025)

Adaptive Patching for Tensor Train Computations
by: Grosso, Gianluca, et al.
Published: (2026)

MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models
by: Zhang, Zhen, et al.
Published: (2025)

Muon$^2$: Boosting Muon via Adaptive Second-Moment Preconditioning
by: Liu, Ziyue, et al.
Published: (2026)

OASIS: Online Activation Subspace Learning for Memory-Efficient Training
by: Choudhary, Sakshi, et al.
Published: (2026)

Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
by: Zhu, Zhanda, et al.
Published: (2025)

Robust Low‐Rank Tensor Recovery Using a Self‐Adaptive Learnable Weighted Tensor Total Variation Method
by: Fanyin Yang, et al.
Published: (2025)

BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models
by: Wang, Zhengyang, et al.
Published: (2025)

AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
by: Refael, Yehonathan, et al.
Published: (2024)

Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
by: Wang, Yezhen, et al.
Published: (2025)

A Riemannian Rank‐Adaptive Method for Higher‐Order Tensor Completion in the Tensor‐Train Format
by: Charlotte Vermeylen, et al.
Published: (2024)

COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs
by: Liu, Liming, et al.
Published: (2025)

Local Interpolation via Low-Rank Tensor Trains
by: Guzman, Siddhartha E., et al.
Published: (2026)

Low Rank Tensor Completion via Adaptive ADMM
by: Führling, Niclas, et al.
Published: (2026)

Dynamics‐Memory‐Event‐Based Adaptive Attitude Consensus Control for Multi‐UAVs With Self‐Adjusting Performance
by: Xinfeng Shao, et al.
Published: (2025)

RankGuide: Tensor-Rank-Guided Routing and Steering for Efficient Reasoning
by: Tian, Jiayi, et al.
Published: (2026)

PYTHALAB-MERA: Validation-Grounded Memory, Retrieval, and Acceptance Control for Frozen-LLM Coding Agents
by: Iscan, Mehmet
Published: (2026)

Area-Efficient In-Memory Computing for Mixture-of-Experts via Multiplexing and Caching
by: Gao, Hanyuan, et al.
Published: (2026)

MERA: A Comprehensive LLM Evaluation in Russian
by: Fenogenova, Alena, et al.
Published: (2024)

Efficient Generalized Low-Rank Tensor Contextual Bandits
by: Yi, Qianxin, et al.
Published: (2023)

Low-Rank Robust Subspace Tensor Clustering for Metro Passenger Flow Modeling
by: Hu, Jiuyun, et al.
Published: (2024)

Real-Time FJ/MAC PDE Solvers via Tensorized, Back-Propagation-Free Optical PINN Training
by: Zhao, Yequan, et al.
Published: (2023)

CoScale-RL: Efficient Post-Training by Co-Scaling Data and Computation
by: Chen, Yutong, et al.
Published: (2026)

Efficient Implementation of Third-Order Tensor Methods with Adaptive Regularization for Unconstrained Optimization
by: Cartis, Coralia, et al.
Published: (2024)

TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training
by: Zhang, Ruijie, et al.
Published: (2026)

Exact Renormalization of Wave Functionals yields Continuous MERA
by: Goldman, Samuel, et al.
Published: (2023)

TensorGRaD: Tensor Gradient Robust Decomposition for Memory-Efficient Neural Operator Training
by: Loeschcke, Sebastian, et al.
Published: (2025)

Training Tensor Attention Efficiently: From Cubic to Almost Linear Time
by: Cao, Yang, et al.
Published: (2024)

Antiferromagnetic Tunnel Junctions (AFMTJs) for In-Memory Computing: Modeling and Case Study
by: Choudhary, Yousuf, et al.
Published: (2026)

Dynamical Simulations of Schrödinger's Equation via Rank-Adaptive Tensor Decompositions
by: Petersson, N. Anders, et al.
Published: (2026)

Folding Tensor and Sequence Parallelism for Memory-Efficient Transformer Training & Inference
by: Shyam, Vasu, et al.
Published: (2026)

Adaptive Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization
by: Ji, Yixin, et al.
Published: (2024)