:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Muhamed, Aashiq, Li, Oscar, Woodruff, David, Diab, Mona, Smith, Virginia
Formato:	Preprint
Publicado:	2024
Materias:	Machine Learning
Acceso en línea:	https://arxiv.org/abs/2406.17660
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
por: Muhamed, Aashiq, et al.
Publicado: (2024)

CoRAG: Collaborative Retrieval-Augmented Generation
por: Muhamed, Aashiq, et al.
Publicado: (2025)

SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
por: Muhamed, Aashiq, et al.
Publicado: (2025)

DSPA: Dynamic SAE Steering for Data-Efficient Preference Alignment
por: Wedgwood, James, et al.
Publicado: (2026)

Pando: Do Interpretability Methods Work When Models Won't Explain Themselves?
por: Zhong, Ziqian, et al.
Publicado: (2026)

RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models
por: Muhamed, Aashiq, et al.
Publicado: (2025)

CCRS: A Zero-Shot LLM-as-a-Judge Framework for Comprehensive RAG Evaluation
por: Muhamed, Aashiq
Publicado: (2025)

Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs
por: Song, Xiangchen, et al.
Publicado: (2025)

Beyond Understanding: Evaluating the Pragmatic Gap in LLMs' Cultural Processing of Figurative Language
por: Attia, Mena, et al.
Publicado: (2025)

Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
por: Wang, Yezhen, et al.
Publicado: (2025)

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
por: Zhao, Jiawei, et al.
Publicado: (2024)

Emotion Classification in Low and Moderate Resource Languages
por: Tafreshi, Shabnam, et al.
Publicado: (2024)

Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification
por: ElNokrashy, Muhammad, et al.
Publicado: (2022)

DWTSumm: Discrete Wavelet Transform for Document Summarization
por: Salama, Rana, et al.
Publicado: (2026)

Gradient Weight-normalized Low-rank Projection for Efficient LLM Training
por: Huang, Jia-Hong, et al.
Publicado: (2024)

Efficient Sparse Training with Structured Dropout
por: Lo, Andy
Publicado: (2024)

SimBA: Simplifying Benchmark Analysis Using Performance Matrices Alone
por: Subramani, Nishant, et al.
Publicado: (2025)

Beyond Worst-Case Dimensionality Reduction for Sparse Vectors
por: Silwal, Sandeep, et al.
Publicado: (2025)

Reweighted Solutions for Weighted Low Rank Approximation
por: Woodruff, David P., et al.
Publicado: (2024)

Personal Information Parroting in Language Models
por: Subramani, Nishant, et al.
Publicado: (2026)

Low-rank Momentum Factorization for Memory Efficient Training
por: Mahdavinia, Pouria, et al.
Publicado: (2025)

Efficient Attention via Pre-Scoring: Prioritizing Informative Keys in Transformers
por: Li, Zhexiang, et al.
Publicado: (2025)

Efficient Dynamic Structured Sparse Training with Learned Shuffles
por: Tyagi, Abhishek, et al.
Publicado: (2025)

Generative Value Conflicts Reveal LLM Priorities
por: Liu, Andy, et al.
Publicado: (2025)

Lotus: Efficient LLM Training by Randomized Low-Rank Gradient Projection with Adaptive Subspace Switching
por: Miao, Tianhao, et al.
Publicado: (2026)

Gradient-Congruity Guided Federated Sparse Training
por: Tian, Chris Xing, et al.
Publicado: (2024)

TensorGRaD: Tensor Gradient Robust Decomposition for Memory-Efficient Neural Operator Training
por: Loeschcke, Sebastian, et al.
Publicado: (2025)

SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training
por: Adnan, Mohammed, et al.
Publicado: (2026)

FOAM: Blocked State Folding for Memory-Efficient LLM Training
por: Wen, Ziqing, et al.
Publicado: (2025)

PRAC: Principal-Random Subspace for LLM Activation Compression and Memory-Efficient Training
por: Li, Yanyi, et al.
Publicado: (2026)

BigMac: Breaking the Pareto Frontier of Compute and Memory in Multimodal LLM Training
por: Zhang, Zili, et al.
Publicado: (2026)

Sensor Response-Time Reduction using Long-Short Term Memory Network Forecasting
por: Ward, Simon J., et al.
Publicado: (2024)

Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC
por: Lin, Wu, et al.
Publicado: (2023)

LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM
por: Refael, Yehonathan, et al.
Publicado: (2025)

CompAct: Compressed Activations for Memory-Efficient LLM Training
por: Shamshoum, Yara, et al.
Publicado: (2024)

Memory-Efficient Differentially Private Training with Gradient Random Projection
por: Mulrooney, Alex, et al.
Publicado: (2025)

Memory-Efficient LLM Training with Online Subspace Descent
por: Liang, Kaizhao, et al.
Publicado: (2024)

Low Rank and Sparse Fourier Structure in Recurrent Networks Trained on Modular Addition
por: Rangamani, Akshay
Publicado: (2025)

AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
por: Refael, Yehonathan, et al.
Publicado: (2024)

On Socially Fair Low-Rank Approximation and Column Subset Selection
por: Song, Zhao, et al.
Publicado: (2024)