Saved in:
| Main Authors: | Gruppi, Mauricio, Dan, Soham, Murugesan, Keerthiram, Chaudhury, Subhajit |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.10174 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
by: Basu, Kinjal, et al.
Published: (2024)
by: Basu, Kinjal, et al.
Published: (2024)
Language Guided Exploration for RL Agents in Text Environments
by: Golchha, Hitesh, et al.
Published: (2024)
by: Golchha, Hitesh, et al.
Published: (2024)
STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models
by: Basavatia, Shreyas, et al.
Published: (2024)
by: Basavatia, Shreyas, et al.
Published: (2024)
Needle in the Haystack for Memory Based Large Language Models
by: Nelson, Elliot, et al.
Published: (2024)
by: Nelson, Elliot, et al.
Published: (2024)
CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design
by: Neehal, Nafis, et al.
Published: (2024)
by: Neehal, Nafis, et al.
Published: (2024)
ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval
by: Yang, David H., et al.
Published: (2026)
by: Yang, David H., et al.
Published: (2026)
Tracking the Temporal Dynamics of News Coverage of Catastrophic and Violent Events
by: Lugos, Emily, et al.
Published: (2026)
by: Lugos, Emily, et al.
Published: (2026)
Towards Aligning Language Models with Textual Feedback
by: Lloret, Saüc Abadal, et al.
Published: (2024)
by: Lloret, Saüc Abadal, et al.
Published: (2024)
Large Language Models can be Strong Self-Detoxifiers
by: Ko, Ching-Yun, et al.
Published: (2024)
by: Ko, Ching-Yun, et al.
Published: (2024)
Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations
by: Villa, Danielle, et al.
Published: (2025)
by: Villa, Danielle, et al.
Published: (2025)
Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach
by: Fernando, Heshan, et al.
Published: (2022)
by: Fernando, Heshan, et al.
Published: (2022)
Breaking mBad! Supervised Fine-tuning for Cross-Lingual Detoxification
by: Beniwal, Himanshu, et al.
Published: (2025)
by: Beniwal, Himanshu, et al.
Published: (2025)
Generation Constraint Scaling Can Mitigate Hallucination
by: Kollias, Georgios, et al.
Published: (2024)
by: Kollias, Georgios, et al.
Published: (2024)
SentinelLMs: Encrypted Input Adaptation and Fine-tuning of Language Models for Private and Secure Inference
by: Mishra, Abhijit, et al.
Published: (2023)
by: Mishra, Abhijit, et al.
Published: (2023)
Can Memory-Augmented Language Models Generalize on Reasoning-in-a-Haystack Tasks?
by: Das, Payel, et al.
Published: (2025)
by: Das, Payel, et al.
Published: (2025)
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs
by: Basu, Kinjal, et al.
Published: (2024)
by: Basu, Kinjal, et al.
Published: (2024)
Mitigating Misalignment Contagion by Steering with Implicit Traits
by: Chang, Maria, et al.
Published: (2026)
by: Chang, Maria, et al.
Published: (2026)
Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning
by: Ukarapol, Trapoom, et al.
Published: (2024)
by: Ukarapol, Trapoom, et al.
Published: (2024)
NG-Router: Graph-Supervised Multi-Agent Collaboration for Nutrition Question Answering
by: Shi, Kaiwen, et al.
Published: (2025)
by: Shi, Kaiwen, et al.
Published: (2025)
AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models
by: Guggilla, Chinnappa, et al.
Published: (2025)
by: Guggilla, Chinnappa, et al.
Published: (2025)
Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata
by: Yuan, Zhengqing, et al.
Published: (2025)
by: Yuan, Zhengqing, et al.
Published: (2025)
MEDVOC: Vocabulary Adaptation for Fine-tuning Pre-trained Language Models on Medical Text Summarization
by: Balde, Gunjan, et al.
Published: (2024)
by: Balde, Gunjan, et al.
Published: (2024)
How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning
by: Rehman, Tohida, et al.
Published: (2025)
by: Rehman, Tohida, et al.
Published: (2025)
Fine-tuning Large Language Models with Sequential Instructions
by: Hu, Hanxu, et al.
Published: (2024)
by: Hu, Hanxu, et al.
Published: (2024)
Sparse Matrix in Large Language Model Fine-tuning
by: He, Haoze, et al.
Published: (2024)
by: He, Haoze, et al.
Published: (2024)
MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning
by: Zhang, Jingfan, et al.
Published: (2024)
by: Zhang, Jingfan, et al.
Published: (2024)
Fine-tuning Large Language Models for Multigenerator, Multidomain, and Multilingual Machine-Generated Text Detection
by: Xiong, Feng, et al.
Published: (2024)
by: Xiong, Feng, et al.
Published: (2024)
Advancing Single and Multi-task Text Classification through Large Language Model Fine-tuning
by: Zhao, Hang, et al.
Published: (2024)
by: Zhao, Hang, et al.
Published: (2024)
Resource-Efficient Adaptation of Large Language Models for Text Embeddings via Prompt Engineering and Contrastive Fine-tuning
by: Roth, Benedikt, et al.
Published: (2025)
by: Roth, Benedikt, et al.
Published: (2025)
Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions
by: Orlikowski, Matthias, et al.
Published: (2025)
by: Orlikowski, Matthias, et al.
Published: (2025)
Assertion Detection Large Language Model In-context Learning LoRA Fine-tuning
by: Ji, Yuelyu, et al.
Published: (2024)
by: Ji, Yuelyu, et al.
Published: (2024)
Vocabulary-level Memory Efficiency for Language Model Fine-tuning
by: Williams, Miles, et al.
Published: (2023)
by: Williams, Miles, et al.
Published: (2023)
Investigating the Representation of Backchannels and Fillers in Fine-tuned Language Models
by: Wang, Yu, et al.
Published: (2025)
by: Wang, Yu, et al.
Published: (2025)
AgentRouter: A Knowledge-Graph-Guided LLM Router for Collaborative Multi-Agent Question Answering
by: Zhang, Zheyuan, et al.
Published: (2025)
by: Zhang, Zheyuan, et al.
Published: (2025)
OjaKV: Context-Aware Online Low-Rank KV Cache Compression
by: Zhu, Yuxuan, et al.
Published: (2025)
by: Zhu, Yuxuan, et al.
Published: (2025)
Stage-wise Fine-tuning for Graph-to-Text Generation
by: Wang, Qingyun, et al.
Published: (2021)
by: Wang, Qingyun, et al.
Published: (2021)
Semi-supervised Fine-tuning for Large Language Models
by: Luo, Junyu, et al.
Published: (2024)
by: Luo, Junyu, et al.
Published: (2024)
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
by: Najafi, Saeed, et al.
Published: (2024)
by: Najafi, Saeed, et al.
Published: (2024)
Exploring Memorization in Fine-tuned Language Models
by: Zeng, Shenglai, et al.
Published: (2023)
by: Zeng, Shenglai, et al.
Published: (2023)
On the Utility of Domain-Adjacent Fine-Tuned Model Ensembles for Few-shot Problems
by: Alam, Md Ibrahim Ibne, et al.
Published: (2024)
by: Alam, Md Ibrahim Ibne, et al.
Published: (2024)
Similar Items
-
EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
by: Basu, Kinjal, et al.
Published: (2024) -
Language Guided Exploration for RL Agents in Text Environments
by: Golchha, Hitesh, et al.
Published: (2024) -
STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models
by: Basavatia, Shreyas, et al.
Published: (2024) -
Needle in the Haystack for Memory Based Large Language Models
by: Nelson, Elliot, et al.
Published: (2024) -
CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design
by: Neehal, Nafis, et al.
Published: (2024)