:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Xia, Yuchen, Kim, Jiho, Chen, Yuhan, Ye, Haojie, Kundu, Souvik, Hao, Cong, Talati, Nishil
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Computation and Language Artificial Intelligence Machine Learning
Accesso online:	https://arxiv.org/abs/2408.04693
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation
di: Azizi, Seyedarmin, et al.
Pubblicazione: (2024)

AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models
di: Liu, Zeyu, et al.
Pubblicazione: (2024)

Sphinx: Efficiently Serving Novel View Synthesis using Regression-Guided Selective Refinement
di: Xia, Yuchen, et al.
Pubblicazione: (2025)

GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
di: Kang, Hao, et al.
Pubblicazione: (2024)

MoE-Lens: Towards the Hardware Limit of High-Throughput MoE LLM Serving Under Resource Constraints
di: Yuan, Yichao, et al.
Pubblicazione: (2025)

Rethinking Parameter Sharing for LLM Fine-Tuning with Multiple LoRAs
di: Ban, Hao, et al.
Pubblicazione: (2025)

EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning
di: Kong, Lingxiao, et al.
Pubblicazione: (2025)

Alignment Dynamics in LLM Fine-Tuning
di: Huang, Yuhan, et al.
Pubblicazione: (2026)

LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs
di: Kim, Taeho, et al.
Pubblicazione: (2024)

MoDM: Efficient Serving for Image Generation via Mixture-of-Diffusion Models
di: Xia, Yuchen, et al.
Pubblicazione: (2025)

Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems
di: Agrawal, Amey, et al.
Pubblicazione: (2024)

Supervised Fine-Tuning Needs to Unlock the Potential of Token Priority
di: Shen, Zhanming, et al.
Pubblicazione: (2026)

Understanding Forgetting in LLM Supervised Fine-Tuning and Preference Learning -- A Convex Optimization Perspective
di: Fernando, Heshan, et al.
Pubblicazione: (2024)

Supervised Fine-Tuning as Inverse Reinforcement Learning
di: Sun, Hao
Pubblicazione: (2024)

Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation
di: Potraghloo, Erfan Baghaei, et al.
Pubblicazione: (2025)

Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning
di: Liu, Yong, et al.
Pubblicazione: (2024)

MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning
di: Lu, Yiyang, et al.
Pubblicazione: (2026)

Fine-Tune an SLM or Prompt an LLM? The Case of Generating Low-Code Workflows
di: Ayala, Orlando Marquez, et al.
Pubblicazione: (2025)

Tuning LLM Judge Design Decisions for 1/1000 of the Cost
di: Salinas, David, et al.
Pubblicazione: (2025)

One Token Away from Collapse: The Fragility of Instruction-Tuned Helpfulness
di: Potraghloo, Erfan Baghaei, et al.
Pubblicazione: (2026)

KAIROS: Stateful, Context-Aware Power-Efficient Agentic Inference Serving
di: Yuan, Yichao, et al.
Pubblicazione: (2026)

Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis
di: Wang, Xu, et al.
Pubblicazione: (2025)

Selection of LLM Fine-Tuning Data based on Orthogonal Rules
di: Li, Xiaomin, et al.
Pubblicazione: (2024)

Fine-Tuning Language Models with Reward Learning on Policy
di: Lang, Hao, et al.
Pubblicazione: (2024)

Bridging the Gap: Enhancing LLM Performance for Low-Resource African Languages with New Benchmarks, Fine-Tuning, and Cultural Adjustments
di: Alhanai, Tuka, et al.
Pubblicazione: (2024)

ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine
di: Qasem, Rabee, et al.
Pubblicazione: (2024)

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
di: You, Haoran, et al.
Pubblicazione: (2024)

Filter-then-Weight: Online Data Selection and Reweighting for LLM Fine-Tuning
di: Wang, Fangxin, et al.
Pubblicazione: (2026)

Stabilizing LLM Supervised Fine-Tuning via Explicit Distributional Control
di: Wang, Xinyu, et al.
Pubblicazione: (2026)

Neural Networks for Learnable and Scalable Influence Estimation of Instruction Fine-Tuning Data
di: Agarwal, Ishika, et al.
Pubblicazione: (2025)

CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation
di: Fawi, Muhammad
Pubblicazione: (2024)

Aligning Backchannel and Dialogue Context Representations via Contrastive LLM Fine-Tuning
di: Qian, Livia, et al.
Pubblicazione: (2026)

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging
di: Ju, Yiming, et al.
Pubblicazione: (2024)

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
di: Jin, Hongye, et al.
Pubblicazione: (2024)

AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air
di: Yang, Shiyi, et al.
Pubblicazione: (2025)

LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard
di: Rao, Varun, et al.
Pubblicazione: (2025)

Understanding the planning of LLM agents: A survey
di: Huang, Xu, et al.
Pubblicazione: (2024)

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks
di: Poppi, Samuele, et al.
Pubblicazione: (2024)

Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
di: Gao, Ziqi, et al.
Pubblicazione: (2024)

Not All Adapters Matter: Selective Adapter Freezing for Memory-Efficient Fine-Tuning of Language Models
di: Son, Hyegang, et al.
Pubblicazione: (2024)