Saved in:
| Main Authors: | Pentyala, Shiva Kumar, Wang, Zhichao, Bi, Bin, Ramnath, Kiran, Mao, Xiang-Bo, Radhakrishnan, Regunathan, Asur, Sitaram, Na, Cheng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.17923 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reinforcement Learning for LLM Post-Training: A Survey
by: Wang, Zhichao, et al.
Published: (2024)
by: Wang, Zhichao, et al.
Published: (2024)
Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
by: Hsu, Aliyah R., et al.
Published: (2024)
by: Hsu, Aliyah R., et al.
Published: (2024)
UNA: A Unified Supervised Framework for Efficient LLM Alignment Across Feedback Types
by: Wang, Zhichao, et al.
Published: (2024)
by: Wang, Zhichao, et al.
Published: (2024)
Diversity Enhances an LLM's Performance in RAG and Long-context Task
by: Wang, Zhichao, et al.
Published: (2025)
by: Wang, Zhichao, et al.
Published: (2025)
PAFT: Prompt-Agnostic Fine-Tuning
by: Wei, Chenxing, et al.
Published: (2025)
by: Wei, Chenxing, et al.
Published: (2025)
PAFT: Preservation Aware Fine-Tuning for Minimal-Edit Program Repair
by: Yang, Boyang, et al.
Published: (2026)
by: Yang, Boyang, et al.
Published: (2026)
BayesFlow: A Probability Inference Framework for Meta-Agent Assisted Workflow Generation
by: Yuan, Bo, et al.
Published: (2026)
by: Yuan, Bo, et al.
Published: (2026)
UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function
by: Wang, Zhichao, et al.
Published: (2024)
by: Wang, Zhichao, et al.
Published: (2024)
Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents
by: Choubey, Prafulla Kumar, et al.
Published: (2025)
by: Choubey, Prafulla Kumar, et al.
Published: (2025)
Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning
by: Pang, Jinlong, et al.
Published: (2025)
by: Pang, Jinlong, et al.
Published: (2025)
TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species Generation
by: Monsefi, Amin Karimi, et al.
Published: (2025)
by: Monsefi, Amin Karimi, et al.
Published: (2025)
Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model
by: Aggarwal, Divyanshu, et al.
Published: (2024)
by: Aggarwal, Divyanshu, et al.
Published: (2024)
GIFT: Group-Relative Implicit Fine-Tuning Integrates GRPO with DPO and UNA
by: Wang, Zhichao
Published: (2025)
by: Wang, Zhichao
Published: (2025)
Training Large Language Models To Reason In Parallel With Global Forking Tokens
by: Jia, Sheng, et al.
Published: (2025)
by: Jia, Sheng, et al.
Published: (2025)
TuningIQA: Fine-Grained Blind Image Quality Assessment for Livestreaming Camera Tuning
by: Sheng, Xiangfei, et al.
Published: (2025)
by: Sheng, Xiangfei, et al.
Published: (2025)
Learning to Ideate for Machine Learning Engineering Agents
by: Zhang, Yunxiang, et al.
Published: (2026)
by: Zhang, Yunxiang, et al.
Published: (2026)
A multi-phase-field model for fiber-reinforced composite laminates based on puck failure theory
by: Kumar, Pavan Kumar Asur Vijaya, et al.
Published: (2026)
by: Kumar, Pavan Kumar Asur Vijaya, et al.
Published: (2026)
Amulet: Putting Complex Multi-Turn Conversations on the Stand with LLM Juries
by: Ramnath, Sahana, et al.
Published: (2025)
by: Ramnath, Sahana, et al.
Published: (2025)
Absence of a Putative Mannose-Specific Phosphotransferase System Enzyme IIAB Component in a Leucocin A-Resistant Strain ofListeria monocytogenes, as Shown by Two-Dimensional Sodium Dodecyl Sulfate-Polyacrylamide Gel Electrophoresis. / M. Ramnath
by: Ramnath, M
Published: (2000)
by: Ramnath, M
Published: (2000)
Goal-Conditioned Supervised Learning for LLM Fine-Tuning
by: Li, Shijun, et al.
Published: (2026)
by: Li, Shijun, et al.
Published: (2026)
Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training
by: Shirkavand, Reza, et al.
Published: (2025)
by: Shirkavand, Reza, et al.
Published: (2025)
On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text
by: Gromadzki, Michał, et al.
Published: (2026)
by: Gromadzki, Michał, et al.
Published: (2026)
JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning
by: Tahir, Anique, et al.
Published: (2024)
by: Tahir, Anique, et al.
Published: (2024)
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
by: Wang, Yubo, et al.
Published: (2025)
by: Wang, Yubo, et al.
Published: (2025)
Pixels Under Pressure: Exploring Fine-Tuning Paradigms for Foundation Models in High-Resolution Medical Imaging
by: TehraniNasab, Zahra, et al.
Published: (2025)
by: TehraniNasab, Zahra, et al.
Published: (2025)
Selective Mixup Fine-Tuning for Optimizing Non-Decomposable Objectives
by: Ramasubramanian, Shrinivas, et al.
Published: (2024)
by: Ramasubramanian, Shrinivas, et al.
Published: (2024)
Decoupled Training with Local Reinforcement Fine-Tuning in Federated Learning
by: Ma, Yuting, et al.
Published: (2026)
by: Ma, Yuting, et al.
Published: (2026)
Communication-Aware Knowledge Distillation for Federated LLM Fine-Tuning over Wireless Networks
by: Zhang, Xinlu, et al.
Published: (2025)
by: Zhang, Xinlu, et al.
Published: (2025)
FineMedLM-o1: Enhancing Medical Knowledge Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training
by: Yu, Hongzhou, et al.
Published: (2025)
by: Yu, Hongzhou, et al.
Published: (2025)
CultureLLM: Incorporating Cultural Differences into Large Language Models
by: Li, Cheng, et al.
Published: (2024)
by: Li, Cheng, et al.
Published: (2024)
Brian Moore: An Ambassador of Feminism
by: Ramnath Singh Rathore
Published: (2019)
by: Ramnath Singh Rathore
Published: (2019)
Bayesimax Theory: Selecting Priors by Minimizing Total Information
by: Vangala, Sitaram
Published: (2025)
by: Vangala, Sitaram
Published: (2025)
Teach for America, Relay Graduate School, and the Charter School Networks: The Making of a Parallel Education Structure
by: Angus Shiva Mungal
Published: (2016)
by: Angus Shiva Mungal
Published: (2016)
"Personal Portfolio Builder Using MERN Stack With AI Integration"
by: Kalva, Ajay Kumar, et al.
Published: (2025)
by: Kalva, Ajay Kumar, et al.
Published: (2025)
FeedbackLLM: Metadata driven Multi-Agentic Language Agnostic Test Case Generator with Evolving prompt and Coverage Feedback
by: Jasti, Kushal, et al.
Published: (2026)
by: Jasti, Kushal, et al.
Published: (2026)
Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging
by: Ju, Yiming, et al.
Published: (2024)
by: Ju, Yiming, et al.
Published: (2024)
Quantum-Enhanced LLM Efficient Fine Tuning
by: Kong, Xiaofei, et al.
Published: (2025)
by: Kong, Xiaofei, et al.
Published: (2025)
Secure Cross-Silo Synthetic Genomic Data Generation
by: Filienko, Daniil, et al.
Published: (2026)
by: Filienko, Daniil, et al.
Published: (2026)
Numerical study of unsteady bioconvective transport of oxytactic microorganisms over a stretching cone
by: Pentyala Srinivasa Rao, et al.
Published: (2025)
by: Pentyala Srinivasa Rao, et al.
Published: (2025)
CaPS: Collaborative and Private Synthetic Data Generation from Distributed Sources
by: Pentyala, Sikha, et al.
Published: (2024)
by: Pentyala, Sikha, et al.
Published: (2024)
Similar Items
-
Reinforcement Learning for LLM Post-Training: A Survey
by: Wang, Zhichao, et al.
Published: (2024) -
Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
by: Hsu, Aliyah R., et al.
Published: (2024) -
UNA: A Unified Supervised Framework for Efficient LLM Alignment Across Feedback Types
by: Wang, Zhichao, et al.
Published: (2024) -
Diversity Enhances an LLM's Performance in RAG and Long-context Task
by: Wang, Zhichao, et al.
Published: (2025) -
PAFT: Prompt-Agnostic Fine-Tuning
by: Wei, Chenxing, et al.
Published: (2025)