:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Dréano, Sören, Molloy, Derek, Murphy, Noel
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning Computation and Language
Online-Zugang:	https://arxiv.org/abs/2511.17589
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

ECHO-LLaMA: Efficient Caching for High-Performance LLaMA Training
von: Dialameh, Maryam, et al.
Veröffentlicht: (2025)

LogLLaMA: Transformer-based log anomaly detection with LLaMA
von: Yang, Zhuoyi, et al.
Veröffentlicht: (2025)

Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca
von: Cui, Yiming, et al.
Veröffentlicht: (2023)

BanglaLlama: LLaMA for Bangla Language
von: Zehady, Abdullah Khan, et al.
Veröffentlicht: (2024)

360-LLaMA-Factory: Plug & Play Sequence Parallelism for Long Post-Training
von: Zou, Haosheng, et al.
Veröffentlicht: (2025)

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain
von: Gema, Aryo Pradipta, et al.
Veröffentlicht: (2023)

Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods
von: Kim, Bo-Kyeong, et al.
Veröffentlicht: (2024)

LLaMA Pro: Progressive LLaMA with Block Expansion
von: Wu, Chengyue, et al.
Veröffentlicht: (2024)

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference
von: Kavehzadeh, Parsa, et al.
Veröffentlicht: (2023)

LoMA: Lossless Compressed Memory Attention
von: Wang, Yumeng, et al.
Veröffentlicht: (2024)

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
von: Xia, Mengzhou, et al.
Veröffentlicht: (2023)

Enhancing Document-Level Question Answering via Multi-Hop Retrieval-Augmented Generation with LLaMA 3
von: Huang, Xinyue, et al.
Veröffentlicht: (2025)

I Have No Mouth, and I Must Rhyme: Uncovering Internal Phonetic Representations in LLaMA 3.2
von: McLaughlin, Oliver, et al.
Veröffentlicht: (2025)

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
von: Zhang, Renrui, et al.
Veröffentlicht: (2023)

LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
von: Qu, Xiaoye, et al.
Veröffentlicht: (2024)

Zero-Shot End-to-End Relation Extraction in Chinese: A Comparative Study of Gemini, LLaMA and ChatGPT
von: Du, Shaoshuai, et al.
Veröffentlicht: (2025)

LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
von: Zhu, Tong, et al.
Veröffentlicht: (2024)

The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA
von: Youngmin, Lee, et al.
Veröffentlicht: (2024)

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
von: Wang, Zhengyi, et al.
Veröffentlicht: (2024)

AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets
von: Perkowski, Ernest, et al.
Veröffentlicht: (2024)

Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource Languages
von: Andersland, Michael
Veröffentlicht: (2024)

Lossless Compression of Large Language Model-Generated Text via Next-Token Prediction
von: Mao, Yu, et al.
Veröffentlicht: (2025)

LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing
von: Di Palma, Dario, et al.
Veröffentlicht: (2025)

Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets
von: Azime, Israel Abebe, et al.
Veröffentlicht: (2024)

How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
von: Yuan, Fei, et al.
Veröffentlicht: (2023)

LLaMA-Based Models for Aspect-Based Sentiment Analysis
von: Šmíd, Jakub, et al.
Veröffentlicht: (2025)

Evaluating LLaMA 3.2 for Software Vulnerability Detection
von: Gonçalves, José, et al.
Veröffentlicht: (2025)

Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study
von: Ma, Chi, et al.
Veröffentlicht: (2024)

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
von: Lee, Jewon, et al.
Veröffentlicht: (2025)

FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression
von: Mittu, Fazal, et al.
Veröffentlicht: (2024)

LLaMA based Punctuation Restoration With Forward Pass Only Decoding
von: Pang, Yutong, et al.
Veröffentlicht: (2024)

MusiScene: Leveraging MU-LLaMA for Scene Imagination and Enhanced Video Background Music Generation
von: Izzati, Fathinah, et al.
Veröffentlicht: (2025)

An empirical study of LLaMA3 quantization: from LLMs to MLLMs
von: Huang, Wei, et al.
Veröffentlicht: (2024)

What If We Recaption Billions of Web Images with LLaMA-3?
von: Li, Xianhang, et al.
Veröffentlicht: (2024)

LLaMA Beyond English: An Empirical Study on Language Capability Transfer
von: Zhao, Jun, et al.
Veröffentlicht: (2024)

Me LLaMA: Foundation Large Language Models for Medical Applications
von: Xie, Qianqian, et al.
Veröffentlicht: (2024)

Lossless Token Sequence Compression via Meta-Tokens
von: Harvill, John, et al.
Veröffentlicht: (2025)

Training LLMs over Neurally Compressed Text
von: Lester, Brian, et al.
Veröffentlicht: (2024)

Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
von: Chen, Nuo, et al.
Veröffentlicht: (2023)

Instruction Finetuning LLaMA-3-8B Model Using LoRA for Financial Named Entity Recognition
von: Lian, Zhiming
Veröffentlicht: (2026)