:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Fleshman, William, Khan, Aleem, Marone, Marc, Van Durme, Benjamin
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2404.08417
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SEQR: Secure and Efficient QR-based LoRA Routing
by: Fleshman, William, et al.
Published: (2025)

LoRA-Augmented Generation (LAG) for Knowledge-Intensive Language Tasks
by: Fleshman, William, et al.
Published: (2025)

RE-Adapt: Reverse Engineered Adaptation of Large Language Models
by: Fleshman, William, et al.
Published: (2024)

SpectR: Dynamically Composing LM Experts with Spectral Routing
by: Fleshman, William, et al.
Published: (2025)

RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation
by: Fleshman, William, et al.
Published: (2024)

mmBERT: A Modern Multilingual Encoder with Annealed Language Learning
by: Marone, Marc, et al.
Published: (2025)

"According to ...": Prompting Language Models Improves Quoting from Pre-Training Data
by: Weller, Orion, et al.
Published: (2023)

Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass
by: Chen, Tong, et al.
Published: (2024)

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
by: Wang, Boshi, et al.
Published: (2024)

SELF-[IN]CORRECT: LLMs Struggle with Discriminating Self-Generated Responses
by: Jiang, Dongwei, et al.
Published: (2024)

Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
by: Wang, Liaoyaqi, et al.
Published: (2025)

Do Androids Know They're Only Dreaming of Electric Sheep?
by: CH-Wang, Sky, et al.
Published: (2023)

Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
by: Hu, Michael Y., et al.
Published: (2025)

MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools
by: Subramani, Nishant, et al.
Published: (2025)

How to Train Data-Efficient LLMs
by: Sachdeva, Noveen, et al.
Published: (2024)

Continuous Approximations for Improving Quantization Aware Training of LLMs
by: Li, He, et al.
Published: (2024)

Seq vs Seq: An Open Suite of Paired Encoders and Decoders
by: Weller, Orion, et al.
Published: (2025)

Training-Free Bayesianization for Low-Rank Adapters of Large Language Models
by: Shi, Haizhou, et al.
Published: (2024)

Learning to Route for Dynamic Adapter Composition in Continual Learning with Language Models
by: Araujo, Vladimir, et al.
Published: (2024)

Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
by: Bandarkar, Lucas, et al.
Published: (2024)

JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models
by: Dragomir, Alexandra, et al.
Published: (2026)

K-Merge: Online Continual Merging of Adapters for On-device Large Language Models
by: Shenaj, Donald, et al.
Published: (2025)

CuMA: Aligning LLMs with Sparse Cultural Values via Demographic-Aware Mixture of Adapters
by: Sun, Ao, et al.
Published: (2026)

Data-driven Clustering and Merging of Adapters for On-device Large Language Models
by: Bohdal, Ondrej, et al.
Published: (2026)

Learning Self-Interpretation from Interpretability Artifacts: Training Lightweight Adapters on Vector-Label Pairs
by: Pepper, Keenan, et al.
Published: (2026)

KV-Distill: Nearly Lossless Learnable Context Compression for LLMs
by: Chari, Vivek, et al.
Published: (2025)

Efficient Exploration for LLMs
by: Dwaracherla, Vikranth, et al.
Published: (2024)

AutoScale: Scale-Aware Data Mixing for Pre-Training LLMs
by: Kang, Feiyang, et al.
Published: (2024)

Not All Adapters Matter: Selective Adapter Freezing for Memory-Efficient Fine-Tuning of Language Models
by: Son, Hyegang, et al.
Published: (2024)

Predicting Training Re-evaluation Curves Enables Effective Data Curriculums for LLMs
by: Bergsma, Shane, et al.
Published: (2025)

Finer Parameter Steps for Low-Rank PEFT: A Controlled Study with CP Tensor Adapters
by: Wang, Xinjue, et al.
Published: (2026)

Ensembles of Low-Rank Expert Adapters
by: Li, Yinghao, et al.
Published: (2025)

MoKA: Mixture of Kronecker Adapters
by: Sadeghi, Mohammadreza, et al.
Published: (2025)

Random Initialization of Gated Sparse Adapters
by: Retault, Vi, et al.
Published: (2025)

Dodo: Dynamic Contextual Compression for Decoder-only LMs
by: Qin, Guanghui, et al.
Published: (2023)

PII-Scope: A Comprehensive Study on Training Data PII Extraction Attacks in LLMs
by: Nakka, Krishna Kanth, et al.
Published: (2024)

Dual-Personalizing Adapter for Federated Foundation Models
by: Yang, Yiyuan, et al.
Published: (2024)

Learning Adapter Rank via Symmetry Breaking
by: Doyle, Cooper, et al.
Published: (2025)

Neutral Residues: Revisiting Adapters for Model Extension
by: Talla, Franck Signe, et al.
Published: (2024)

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data
by: Treutlein, Johannes, et al.
Published: (2024)