:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Zou, Haosheng, Lv, Xiaowei, Jia, Shousheng, Li, Lin, Gong, Xiaochun, Zhang, Xiangzheng
Format:	Preprint
Publié:	2025
Sujets:	Computation and Language Machine Learning
Accès en ligne:	https://arxiv.org/abs/2505.22296
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

ECHO-LLaMA: Efficient Caching for High-Performance LLaMA Training
par: Dialameh, Maryam, et autres
Publié: (2025)

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond
par: Wen, Liang, et autres
Publié: (2025)

LogLLaMA: Transformer-based log anomaly detection with LLaMA
par: Yang, Zhuoyi, et autres
Publié: (2025)

LLaMA Pro: Progressive LLaMA with Block Expansion
par: Wu, Chengyue, et autres
Publié: (2024)

LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
par: Qu, Xiaoye, et autres
Publié: (2024)

BanglaLlama: LLaMA for Bangla Language
par: Zehady, Abdullah Khan, et autres
Publié: (2024)

Llamazip: Leveraging LLaMA for Lossless Text Compression and Training Dataset Detection
par: Dréano, Sören, et autres
Publié: (2025)

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain
par: Gema, Aryo Pradipta, et autres
Publié: (2023)

Light-IF: Endowing LLMs with Generalizable Reasoning via Preview and Self-Checking for Complex Instruction Following
par: Wang, Chenyang, et autres
Publié: (2025)

Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca
par: Cui, Yiming, et autres
Publié: (2023)

LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
par: Zhu, Tong, et autres
Publié: (2024)

Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
par: Chen, Nuo, et autres
Publié: (2023)

Enhancing Document-Level Question Answering via Multi-Hop Retrieval-Augmented Generation with LLaMA 3
par: Huang, Xinyue, et autres
Publié: (2025)

Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods
par: Kim, Bo-Kyeong, et autres
Publié: (2024)

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
par: Xia, Mengzhou, et autres
Publié: (2023)

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference
par: Kavehzadeh, Parsa, et autres
Publié: (2023)

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
par: Zhang, Renrui, et autres
Publié: (2023)

Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource Languages
par: Andersland, Michael
Publié: (2024)

LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing
par: Di Palma, Dario, et autres
Publié: (2025)

How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
par: Yuan, Fei, et autres
Publié: (2023)

I Have No Mouth, and I Must Rhyme: Uncovering Internal Phonetic Representations in LLaMA 3.2
par: McLaughlin, Oliver, et autres
Publié: (2025)

LLaMA-Based Models for Aspect-Based Sentiment Analysis
par: Šmíd, Jakub, et autres
Publié: (2025)

LLaMA based Punctuation Restoration With Forward Pass Only Decoding
par: Pang, Yutong, et autres
Publié: (2024)

Zero-Shot End-to-End Relation Extraction in Chinese: A Comparative Study of Gemini, LLaMA and ChatGPT
par: Du, Shaoshuai, et autres
Publié: (2025)

LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction
par: Zou, Bo, et autres
Publié: (2024)

LLaMA Beyond English: An Empirical Study on Language Capability Transfer
par: Zhao, Jun, et autres
Publié: (2024)

Me LLaMA: Foundation Large Language Models for Medical Applications
par: Xie, Qianqian, et autres
Publié: (2024)

An empirical study of LLaMA3 quantization: from LLMs to MLLMs
par: Huang, Wei, et autres
Publié: (2024)

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
par: Wang, Zhengyi, et autres
Publié: (2024)

VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models
par: Liu, Chonghan, et autres
Publié: (2026)

LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
par: Kang, Boyi, et autres
Publié: (2025)

The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA
par: Youngmin, Lee, et autres
Publié: (2024)

What If We Recaption Billions of Web Images with LLaMA-3?
par: Li, Xianhang, et autres
Publié: (2024)

LLaMA-Omni: Seamless Speech Interaction with Large Language Models
par: Fang, Qingkai, et autres
Publié: (2024)

Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study
par: Ma, Chi, et autres
Publié: (2024)

ChatGPT vs Gemini vs LLaMA on Multilingual Sentiment Analysis
par: Buscemi, Alessio, et autres
Publié: (2024)

Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets
par: Azime, Israel Abebe, et autres
Publié: (2024)

LLaMA-Reg: Using LLaMA 2 for Unsupervised Medical Image Registration
par: Ma, Mingrui, et autres
Publié: (2024)

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
par: Zhang, Di, et autres
Publié: (2024)

Evaluating LLaMA 3.2 for Software Vulnerability Detection
par: Gonçalves, José, et autres
Publié: (2025)