:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pentyala, Shiva Kumar, Wang, Zhichao, Bi, Bin, Ramnath, Kiran, Mao, Xiang-Bo, Radhakrishnan, Regunathan, Asur, Sitaram, Na, Cheng
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2406.17923
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Reinforcement Learning for LLM Post-Training: A Survey
by: Wang, Zhichao, et al.
Published: (2024)

Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
by: Hsu, Aliyah R., et al.
Published: (2024)

UNA: A Unified Supervised Framework for Efficient LLM Alignment Across Feedback Types
by: Wang, Zhichao, et al.
Published: (2024)

Diversity Enhances an LLM's Performance in RAG and Long-context Task
by: Wang, Zhichao, et al.
Published: (2025)

PAFT: Prompt-Agnostic Fine-Tuning
by: Wei, Chenxing, et al.
Published: (2025)

PAFT: Preservation Aware Fine-Tuning for Minimal-Edit Program Repair
by: Yang, Boyang, et al.
Published: (2026)

BayesFlow: A Probability Inference Framework for Meta-Agent Assisted Workflow Generation
by: Yuan, Bo, et al.
Published: (2026)

UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function
by: Wang, Zhichao, et al.
Published: (2024)

Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents
by: Choubey, Prafulla Kumar, et al.
Published: (2025)

Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning
by: Pang, Jinlong, et al.
Published: (2025)

TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species Generation
by: Monsefi, Amin Karimi, et al.
Published: (2025)

Exploring Continual Fine-Tuning for Enhancing Language Ability in Large Language Model
by: Aggarwal, Divyanshu, et al.
Published: (2024)

GIFT: Group-Relative Implicit Fine-Tuning Integrates GRPO with DPO and UNA
by: Wang, Zhichao
Published: (2025)

Training Large Language Models To Reason In Parallel With Global Forking Tokens
by: Jia, Sheng, et al.
Published: (2025)

TuningIQA: Fine-Grained Blind Image Quality Assessment for Livestreaming Camera Tuning
by: Sheng, Xiangfei, et al.
Published: (2025)

Learning to Ideate for Machine Learning Engineering Agents
by: Zhang, Yunxiang, et al.
Published: (2026)

A multi-phase-field model for fiber-reinforced composite laminates based on puck failure theory
by: Kumar, Pavan Kumar Asur Vijaya, et al.
Published: (2026)

Amulet: Putting Complex Multi-Turn Conversations on the Stand with LLM Juries
by: Ramnath, Sahana, et al.
Published: (2025)

Absence of a Putative Mannose-Specific Phosphotransferase System Enzyme IIAB Component in a Leucocin A-Resistant Strain ofListeria monocytogenes, as Shown by Two-Dimensional Sodium Dodecyl Sulfate-Polyacrylamide Gel Electrophoresis. / M. Ramnath
by: Ramnath, M
Published: (2000)

Goal-Conditioned Supervised Learning for LLM Fine-Tuning
by: Li, Shijun, et al.
Published: (2026)

Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training
by: Shirkavand, Reza, et al.
Published: (2025)

On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text
by: Gromadzki, Michał, et al.
Published: (2026)

JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning
by: Tahir, Anique, et al.
Published: (2024)

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
by: Wang, Yubo, et al.
Published: (2025)

Pixels Under Pressure: Exploring Fine-Tuning Paradigms for Foundation Models in High-Resolution Medical Imaging
by: TehraniNasab, Zahra, et al.
Published: (2025)

Selective Mixup Fine-Tuning for Optimizing Non-Decomposable Objectives
by: Ramasubramanian, Shrinivas, et al.
Published: (2024)

Decoupled Training with Local Reinforcement Fine-Tuning in Federated Learning
by: Ma, Yuting, et al.
Published: (2026)

Communication-Aware Knowledge Distillation for Federated LLM Fine-Tuning over Wireless Networks
by: Zhang, Xinlu, et al.
Published: (2025)

FineMedLM-o1: Enhancing Medical Knowledge Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training
by: Yu, Hongzhou, et al.
Published: (2025)

CultureLLM: Incorporating Cultural Differences into Large Language Models
by: Li, Cheng, et al.
Published: (2024)

Brian Moore: An Ambassador of Feminism
by: Ramnath Singh Rathore
Published: (2019)

Bayesimax Theory: Selecting Priors by Minimizing Total Information
by: Vangala, Sitaram
Published: (2025)

Teach for America, Relay Graduate School, and the Charter School Networks: The Making of a Parallel Education Structure
by: Angus Shiva Mungal
Published: (2016)

"Personal Portfolio Builder Using MERN Stack With AI Integration"
by: Kalva, Ajay Kumar, et al.
Published: (2025)

FeedbackLLM: Metadata driven Multi-Agentic Language Agnostic Test Case Generator with Evolving prompt and Coverage Feedback
by: Jasti, Kushal, et al.
Published: (2026)

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging
by: Ju, Yiming, et al.
Published: (2024)

Quantum-Enhanced LLM Efficient Fine Tuning
by: Kong, Xiaofei, et al.
Published: (2025)

Secure Cross-Silo Synthetic Genomic Data Generation
by: Filienko, Daniil, et al.
Published: (2026)

Numerical study of unsteady bioconvective transport of oxytactic microorganisms over a stretching cone
by: Pentyala Srinivasa Rao, et al.
Published: (2025)

CaPS: Collaborative and Private Synthetic Data Generation from Distributed Sources
by: Pentyala, Sikha, et al.
Published: (2024)