Guardado en:
| Autores principales: | Muhamed, Aashiq, Li, Oscar, Woodruff, David, Diab, Mona, Smith, Virginia |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2406.17660 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
por: Muhamed, Aashiq, et al.
Publicado: (2024)
por: Muhamed, Aashiq, et al.
Publicado: (2024)
CoRAG: Collaborative Retrieval-Augmented Generation
por: Muhamed, Aashiq, et al.
Publicado: (2025)
por: Muhamed, Aashiq, et al.
Publicado: (2025)
SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
por: Muhamed, Aashiq, et al.
Publicado: (2025)
por: Muhamed, Aashiq, et al.
Publicado: (2025)
DSPA: Dynamic SAE Steering for Data-Efficient Preference Alignment
por: Wedgwood, James, et al.
Publicado: (2026)
por: Wedgwood, James, et al.
Publicado: (2026)
Pando: Do Interpretability Methods Work When Models Won't Explain Themselves?
por: Zhong, Ziqian, et al.
Publicado: (2026)
por: Zhong, Ziqian, et al.
Publicado: (2026)
RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models
por: Muhamed, Aashiq, et al.
Publicado: (2025)
por: Muhamed, Aashiq, et al.
Publicado: (2025)
CCRS: A Zero-Shot LLM-as-a-Judge Framework for Comprehensive RAG Evaluation
por: Muhamed, Aashiq
Publicado: (2025)
por: Muhamed, Aashiq
Publicado: (2025)
Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs
por: Song, Xiangchen, et al.
Publicado: (2025)
por: Song, Xiangchen, et al.
Publicado: (2025)
Beyond Understanding: Evaluating the Pragmatic Gap in LLMs' Cultural Processing of Figurative Language
por: Attia, Mena, et al.
Publicado: (2025)
por: Attia, Mena, et al.
Publicado: (2025)
Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients
por: Wang, Yezhen, et al.
Publicado: (2025)
por: Wang, Yezhen, et al.
Publicado: (2025)
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
por: Zhao, Jiawei, et al.
Publicado: (2024)
por: Zhao, Jiawei, et al.
Publicado: (2024)
Emotion Classification in Low and Moderate Resource Languages
por: Tafreshi, Shabnam, et al.
Publicado: (2024)
por: Tafreshi, Shabnam, et al.
Publicado: (2024)
Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification
por: ElNokrashy, Muhammad, et al.
Publicado: (2022)
por: ElNokrashy, Muhammad, et al.
Publicado: (2022)
DWTSumm: Discrete Wavelet Transform for Document Summarization
por: Salama, Rana, et al.
Publicado: (2026)
por: Salama, Rana, et al.
Publicado: (2026)
Gradient Weight-normalized Low-rank Projection for Efficient LLM Training
por: Huang, Jia-Hong, et al.
Publicado: (2024)
por: Huang, Jia-Hong, et al.
Publicado: (2024)
Efficient Sparse Training with Structured Dropout
por: Lo, Andy
Publicado: (2024)
por: Lo, Andy
Publicado: (2024)
SimBA: Simplifying Benchmark Analysis Using Performance Matrices Alone
por: Subramani, Nishant, et al.
Publicado: (2025)
por: Subramani, Nishant, et al.
Publicado: (2025)
Beyond Worst-Case Dimensionality Reduction for Sparse Vectors
por: Silwal, Sandeep, et al.
Publicado: (2025)
por: Silwal, Sandeep, et al.
Publicado: (2025)
Reweighted Solutions for Weighted Low Rank Approximation
por: Woodruff, David P., et al.
Publicado: (2024)
por: Woodruff, David P., et al.
Publicado: (2024)
Personal Information Parroting in Language Models
por: Subramani, Nishant, et al.
Publicado: (2026)
por: Subramani, Nishant, et al.
Publicado: (2026)
Low-rank Momentum Factorization for Memory Efficient Training
por: Mahdavinia, Pouria, et al.
Publicado: (2025)
por: Mahdavinia, Pouria, et al.
Publicado: (2025)
Efficient Attention via Pre-Scoring: Prioritizing Informative Keys in Transformers
por: Li, Zhexiang, et al.
Publicado: (2025)
por: Li, Zhexiang, et al.
Publicado: (2025)
Efficient Dynamic Structured Sparse Training with Learned Shuffles
por: Tyagi, Abhishek, et al.
Publicado: (2025)
por: Tyagi, Abhishek, et al.
Publicado: (2025)
Generative Value Conflicts Reveal LLM Priorities
por: Liu, Andy, et al.
Publicado: (2025)
por: Liu, Andy, et al.
Publicado: (2025)
Lotus: Efficient LLM Training by Randomized Low-Rank Gradient Projection with Adaptive Subspace Switching
por: Miao, Tianhao, et al.
Publicado: (2026)
por: Miao, Tianhao, et al.
Publicado: (2026)
Gradient-Congruity Guided Federated Sparse Training
por: Tian, Chris Xing, et al.
Publicado: (2024)
por: Tian, Chris Xing, et al.
Publicado: (2024)
TensorGRaD: Tensor Gradient Robust Decomposition for Memory-Efficient Neural Operator Training
por: Loeschcke, Sebastian, et al.
Publicado: (2025)
por: Loeschcke, Sebastian, et al.
Publicado: (2025)
SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training
por: Adnan, Mohammed, et al.
Publicado: (2026)
por: Adnan, Mohammed, et al.
Publicado: (2026)
FOAM: Blocked State Folding for Memory-Efficient LLM Training
por: Wen, Ziqing, et al.
Publicado: (2025)
por: Wen, Ziqing, et al.
Publicado: (2025)
PRAC: Principal-Random Subspace for LLM Activation Compression and Memory-Efficient Training
por: Li, Yanyi, et al.
Publicado: (2026)
por: Li, Yanyi, et al.
Publicado: (2026)
BigMac: Breaking the Pareto Frontier of Compute and Memory in Multimodal LLM Training
por: Zhang, Zili, et al.
Publicado: (2026)
por: Zhang, Zili, et al.
Publicado: (2026)
Sensor Response-Time Reduction using Long-Short Term Memory Network Forecasting
por: Ward, Simon J., et al.
Publicado: (2024)
por: Ward, Simon J., et al.
Publicado: (2024)
Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC
por: Lin, Wu, et al.
Publicado: (2023)
por: Lin, Wu, et al.
Publicado: (2023)
LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM
por: Refael, Yehonathan, et al.
Publicado: (2025)
por: Refael, Yehonathan, et al.
Publicado: (2025)
CompAct: Compressed Activations for Memory-Efficient LLM Training
por: Shamshoum, Yara, et al.
Publicado: (2024)
por: Shamshoum, Yara, et al.
Publicado: (2024)
Memory-Efficient Differentially Private Training with Gradient Random Projection
por: Mulrooney, Alex, et al.
Publicado: (2025)
por: Mulrooney, Alex, et al.
Publicado: (2025)
Memory-Efficient LLM Training with Online Subspace Descent
por: Liang, Kaizhao, et al.
Publicado: (2024)
por: Liang, Kaizhao, et al.
Publicado: (2024)
Low Rank and Sparse Fourier Structure in Recurrent Networks Trained on Modular Addition
por: Rangamani, Akshay
Publicado: (2025)
por: Rangamani, Akshay
Publicado: (2025)
AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
por: Refael, Yehonathan, et al.
Publicado: (2024)
por: Refael, Yehonathan, et al.
Publicado: (2024)
On Socially Fair Low-Rank Approximation and Column Subset Selection
por: Song, Zhao, et al.
Publicado: (2024)
por: Song, Zhao, et al.
Publicado: (2024)
Ejemplares similares
-
Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
por: Muhamed, Aashiq, et al.
Publicado: (2024) -
CoRAG: Collaborative Retrieval-Augmented Generation
por: Muhamed, Aashiq, et al.
Publicado: (2025) -
SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
por: Muhamed, Aashiq, et al.
Publicado: (2025) -
DSPA: Dynamic SAE Steering for Data-Efficient Preference Alignment
por: Wedgwood, James, et al.
Publicado: (2026) -
Pando: Do Interpretability Methods Work When Models Won't Explain Themselves?
por: Zhong, Ziqian, et al.
Publicado: (2026)