:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Niu, Jingcheng, Yuan, Xingdi, Wang, Tong, Saghir, Hamidreza, Abdi, Amir H.
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Computation and Language
Accesso online:	https://arxiv.org/abs/2505.09338
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

ChocoLlama: Lessons Learned From Teaching Llamas Dutch
di: Meeus, Matthieu, et al.
Pubblicazione: (2024)

Llama-Mob: Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction
di: Tang, Peizhi, et al.
Pubblicazione: (2024)

How Language Models Process Out-of-Distribution Inputs: A Two-Pathway Framework
di: Saghir, Hamidreza
Pubblicazione: (2026)

MGH Radiology Llama: A Llama 3 70B Model for Radiology
di: Shi, Yucheng, et al.
Pubblicazione: (2024)

Do Llamas Work in English? On the Latent Language of Multilingual Transformers
di: Wendler, Chris, et al.
Pubblicazione: (2024)

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
di: Fathullah, Yassir, et al.
Pubblicazione: (2023)

BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
di: Gade, Pranav, et al.
Pubblicazione: (2023)

The Llama 3 Herd of Models
di: Grattafiori, Aaron, et al.
Pubblicazione: (2024)

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
di: He, Zhengfu, et al.
Pubblicazione: (2024)

SinLlama -- A Large Language Model for Sinhala
di: Aravinda, H. W. K., et al.
Pubblicazione: (2025)

Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation
di: Sani, Samin Mahdizadeh, et al.
Pubblicazione: (2024)

Steering Llama 2 via Contrastive Activation Addition
di: Panickssery, Nina, et al.
Pubblicazione: (2023)

Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs
di: Greschner, Lynn, et al.
Pubblicazione: (2024)

Llama meets EU: Investigating the European Political Spectrum through the Lens of LLMs
di: Chalkidis, Ilias, et al.
Pubblicazione: (2024)

To Err Is Human, but Llamas Can Learn It Too
di: Luhtaru, Agnes, et al.
Pubblicazione: (2024)

Code Llama: Open Foundation Models for Code
di: Rozière, Baptiste, et al.
Pubblicazione: (2023)

Transducer-Llama: Integrating LLMs into Streamable Transducer-based Speech Recognition
di: Deng, Keqi, et al.
Pubblicazione: (2024)

Llama-Nemotron: Efficient Reasoning Models
di: Bercovich, Akhiad, et al.
Pubblicazione: (2025)

Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions
di: Tang, Bangsheng, et al.
Pubblicazione: (2025)

TinyLlama: An Open-Source Small Language Model
di: Zhang, Peiyuan, et al.
Pubblicazione: (2024)

Extending Llama-3's Context Ten-Fold Overnight
di: Zhang, Peitian, et al.
Pubblicazione: (2024)

TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts
di: Wang, Ruida, et al.
Pubblicazione: (2024)

BanglaLlama: LLaMA for Bangla Language
di: Zehady, Abdullah Khan, et al.
Pubblicazione: (2024)

Open Llama2 Model for the Lithuanian Language
di: Nakvosas, Artūras, et al.
Pubblicazione: (2024)

Zebra-Llama: Towards Extremely Efficient Hybrid Models
di: Yang, Mingyu, et al.
Pubblicazione: (2025)

Forbidden Facts: An Investigation of Competing Objectives in Llama-2
di: Wang, Tony T., et al.
Pubblicazione: (2023)

Llama-Mimi: Exploring the Limits of Flattened Speech Language Modeling
di: Sugiura, Issa, et al.
Pubblicazione: (2025)

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
di: Thawakar, Omkar, et al.
Pubblicazione: (2024)

Chinese-Vicuna: A Chinese Instruction-following Llama-based Model
di: Fan, Chenghao, et al.
Pubblicazione: (2025)

Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness
di: Feng, Xincan, et al.
Pubblicazione: (2024)

Feedback Indicators: The Alignment between Llama and a Teacher in Language Learning
di: Rüdian, Sylvio, et al.
Pubblicazione: (2025)

The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities
di: Research, MediaTek, et al.
Pubblicazione: (2025)

TableLlama: Towards Open Large Generalist Models for Tables
di: Zhang, Tianshu, et al.
Pubblicazione: (2023)

MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation
di: Kumar, Gurucharan Marthi Krishna, et al.
Pubblicazione: (2024)

Llama2Vec: Unsupervised Adaptation of Large Language Models for Dense Retrieval
di: Liu, Zheng, et al.
Pubblicazione: (2023)

Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer
di: Kuulmets, Hele-Andra, et al.
Pubblicazione: (2024)

Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
di: Chen, Tianxiang, et al.
Pubblicazione: (2024)

Lugha-Llama: Adapting Large Language Models for African Languages
di: Buzaaba, Happy, et al.
Pubblicazione: (2025)

Investigating Bias Representations in Llama 2 Chat via Activation Steering
di: Lu, Dawn, et al.
Pubblicazione: (2024)

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
di: Zheng, Yaowei, et al.
Pubblicazione: (2024)