Saved in:
| Main Authors: | Gade, Pranav, Lermen, Simon, Rogers-Smith, Charlie, Ladish, Jeffrey |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2311.00117 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
by: Lermen, Simon, et al.
Published: (2023)
by: Lermen, Simon, et al.
Published: (2023)
Applying Refusal-Vector Ablation to Llama 3.1 70B Agents
by: Lermen, Simon, et al.
Published: (2024)
by: Lermen, Simon, et al.
Published: (2024)
Badllama 3: removing safety finetuning from Llama 3 in minutes
by: Volkov, Dmitrii
Published: (2024)
by: Volkov, Dmitrii
Published: (2024)
ChocoLlama: Lessons Learned From Teaching Llamas Dutch
by: Meeus, Matthieu, et al.
Published: (2024)
by: Meeus, Matthieu, et al.
Published: (2024)
MGH Radiology Llama: A Llama 3 70B Model for Radiology
by: Shi, Yucheng, et al.
Published: (2024)
by: Shi, Yucheng, et al.
Published: (2024)
Investigating Bias Representations in Llama 2 Chat via Activation Steering
by: Lu, Dawn, et al.
Published: (2024)
by: Lu, Dawn, et al.
Published: (2024)
Llama-Mob: Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction
by: Tang, Peizhi, et al.
Published: (2024)
by: Tang, Peizhi, et al.
Published: (2024)
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
by: He, Zhengfu, et al.
Published: (2024)
by: He, Zhengfu, et al.
Published: (2024)
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
by: Niu, Jingcheng, et al.
Published: (2025)
by: Niu, Jingcheng, et al.
Published: (2025)
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
by: Fathullah, Yassir, et al.
Published: (2023)
by: Fathullah, Yassir, et al.
Published: (2023)
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi
by: Choudhury, Monojit, et al.
Published: (2025)
by: Choudhury, Monojit, et al.
Published: (2025)
The Llama 3 Herd of Models
by: Grattafiori, Aaron, et al.
Published: (2024)
by: Grattafiori, Aaron, et al.
Published: (2024)
Code Llama: Open Foundation Models for Code
by: Rozière, Baptiste, et al.
Published: (2023)
by: Rozière, Baptiste, et al.
Published: (2023)
To Err Is Human, but Llamas Can Learn It Too
by: Luhtaru, Agnes, et al.
Published: (2024)
by: Luhtaru, Agnes, et al.
Published: (2024)
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
by: Nadeau, David, et al.
Published: (2024)
by: Nadeau, David, et al.
Published: (2024)
Open Llama2 Model for the Lithuanian Language
by: Nakvosas, Artūras, et al.
Published: (2024)
by: Nakvosas, Artūras, et al.
Published: (2024)
Llama-Nemotron: Efficient Reasoning Models
by: Bercovich, Akhiad, et al.
Published: (2025)
by: Bercovich, Akhiad, et al.
Published: (2025)
SinLlama -- A Large Language Model for Sinhala
by: Aravinda, H. W. K., et al.
Published: (2025)
by: Aravinda, H. W. K., et al.
Published: (2025)
Extending Llama-3's Context Ten-Fold Overnight
by: Zhang, Peitian, et al.
Published: (2024)
by: Zhang, Peitian, et al.
Published: (2024)
Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions
by: Tang, Bangsheng, et al.
Published: (2025)
by: Tang, Bangsheng, et al.
Published: (2025)
Steering Llama 2 via Contrastive Activation Addition
by: Panickssery, Nina, et al.
Published: (2023)
by: Panickssery, Nina, et al.
Published: (2023)
BanglaLlama: LLaMA for Bangla Language
by: Zehady, Abdullah Khan, et al.
Published: (2024)
by: Zehady, Abdullah Khan, et al.
Published: (2024)
Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
by: Chen, Tianxiang, et al.
Published: (2024)
by: Chen, Tianxiang, et al.
Published: (2024)
Do Llamas Work in English? On the Latent Language of Multilingual Transformers
by: Wendler, Chris, et al.
Published: (2024)
by: Wendler, Chris, et al.
Published: (2024)
TinyLlama: An Open-Source Small Language Model
by: Zhang, Peiyuan, et al.
Published: (2024)
by: Zhang, Peiyuan, et al.
Published: (2024)
Zebra-Llama: Towards Extremely Efficient Hybrid Models
by: Yang, Mingyu, et al.
Published: (2025)
by: Yang, Mingyu, et al.
Published: (2025)
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
by: Thawakar, Omkar, et al.
Published: (2024)
by: Thawakar, Omkar, et al.
Published: (2024)
Llama-Mimi: Exploring the Limits of Flattened Speech Language Modeling
by: Sugiura, Issa, et al.
Published: (2025)
by: Sugiura, Issa, et al.
Published: (2025)
Forbidden Facts: An Investigation of Competing Objectives in Llama-2
by: Wang, Tony T., et al.
Published: (2023)
by: Wang, Tony T., et al.
Published: (2023)
Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
by: Chi, Jianfeng, et al.
Published: (2024)
by: Chi, Jianfeng, et al.
Published: (2024)
Llama2Vec: Unsupervised Adaptation of Large Language Models for Dense Retrieval
by: Liu, Zheng, et al.
Published: (2023)
by: Liu, Zheng, et al.
Published: (2023)
Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness
by: Feng, Xincan, et al.
Published: (2024)
by: Feng, Xincan, et al.
Published: (2024)
Incomplete Tasks Induce Shutdown Resistance in Some Frontier LLMs
by: Schlatter, Jeremy, et al.
Published: (2025)
by: Schlatter, Jeremy, et al.
Published: (2025)
Feedback Indicators: The Alignment between Llama and a Teacher in Language Learning
by: Rüdian, Sylvio, et al.
Published: (2025)
by: Rüdian, Sylvio, et al.
Published: (2025)
Chinese-Vicuna: A Chinese Instruction-following Llama-based Model
by: Fan, Chenghao, et al.
Published: (2025)
by: Fan, Chenghao, et al.
Published: (2025)
The Mysterious Case of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics of Meta's Llama 2 Model
by: Smith, Brenden, et al.
Published: (2024)
by: Smith, Brenden, et al.
Published: (2024)
TableLlama: Towards Open Large Generalist Models for Tables
by: Zhang, Tianshu, et al.
Published: (2023)
by: Zhang, Tianshu, et al.
Published: (2023)
You can remove GPT2's LayerNorm by fine-tuning
by: Heimersheim, Stefan
Published: (2024)
by: Heimersheim, Stefan
Published: (2024)
Code Generation and Algorithmic Problem Solving Using Llama 3.1 405B
by: Deroy, Aniket, et al.
Published: (2024)
by: Deroy, Aniket, et al.
Published: (2024)
Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs
by: Greschner, Lynn, et al.
Published: (2024)
by: Greschner, Lynn, et al.
Published: (2024)
Similar Items
-
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
by: Lermen, Simon, et al.
Published: (2023) -
Applying Refusal-Vector Ablation to Llama 3.1 70B Agents
by: Lermen, Simon, et al.
Published: (2024) -
Badllama 3: removing safety finetuning from Llama 3 in minutes
by: Volkov, Dmitrii
Published: (2024) -
ChocoLlama: Lessons Learned From Teaching Llamas Dutch
by: Meeus, Matthieu, et al.
Published: (2024) -
MGH Radiology Llama: A Llama 3 70B Model for Radiology
by: Shi, Yucheng, et al.
Published: (2024)