:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jaffe, Andrew, Reicin, Noah, Choi, Jinho D.
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.18924
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Enhancing LLM Instruction Following: An Evaluation-Driven Multi-Agentic Workflow for Prompt Instructions Optimization
by: Purpura, Alberto, et al.
Published: (2026)

Multi-Level Compositional Reasoning for Interactive Instruction Following
by: Bhambri, Suvaansh, et al.
Published: (2023)

Financial Instruction Following Evaluation (FIFE)
by: Matlin, Glenn, et al.
Published: (2025)

Boosting Instruction Following at Scale
by: Elder, Ben, et al.
Published: (2025)

M-IFEval: Multilingual Instruction-Following Evaluation
by: Dussolle, Antoine, et al.
Published: (2025)

Situated Instruction Following
by: Min, So Yeon, et al.
Published: (2024)

Online Continual Learning For Interactive Instruction Following Agents
by: Kim, Byeonghwi, et al.
Published: (2024)

Beyond Instruction Following: Evaluating Inferential Rule Following of Large Language Models
by: Sun, Wangtao, et al.
Published: (2024)

The Instruction Gap: LLMs get lost in Following Instruction
by: Tripathi, Vishesh, et al.
Published: (2025)

Instruction-Following Evaluation in Function Calling for Large Language Models
by: Skripko, Nikolai
Published: (2025)

Embodied Instruction Following in Unknown Environments
by: Wu, Zhenyu, et al.
Published: (2024)

MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
by: Lou, Renze, et al.
Published: (2023)

ReIFE: Re-evaluating Instruction-Following Evaluation
by: Liu, Yixin, et al.
Published: (2024)

Training with Pseudo-Code for Instruction Following
by: Kumar, Prince, et al.
Published: (2025)

WildIFEval: Instruction Following in the Wild
by: Lior, Gili, et al.
Published: (2025)

LsrIF: Enhancing Logic-Structured Instruction Following of Large Language Models
by: Ren, Qingyu, et al.
Published: (2026)

DIALEVAL: Automated Type-Theoretic Evaluation of LLM Instruction Following
by: Basta, Nardine, et al.
Published: (2026)

Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering
by: Adlakha, Vaibhav, et al.
Published: (2023)

MaXIFE: Multilingual and Cross-lingual Instruction Following Evaluation
by: Liu, Yile, et al.
Published: (2025)

LIFEBench: Evaluating Length Instruction Following in Large Language Models
by: Zhang, Wei, et al.
Published: (2025)

Neuro-Symbolic Verification on Instruction Following of LLMs
by: Su, Yiming, et al.
Published: (2026)

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments
by: Kim, Taewoong, et al.
Published: (2024)

Enhancing and Assessing Instruction-Following with Fine-Grained Instruction Variants
by: Yang, Jiuding, et al.
Published: (2024)

Deconstructing Instruction-Following: A New Benchmark for Granular Evaluation of Large Language Model Instruction Compliance Abilities
by: Purpura, Alberto, et al.
Published: (2026)

Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents
by: Kim, Byeonghwi, et al.
Published: (2023)

Zero-Shot Instruction Following in RL via Structured LTL Representations
by: Jackermeier, Mathias, et al.
Published: (2026)

Zero-Shot Instruction Following in RL via Structured LTL Representations
by: Giuri, Mattia, et al.
Published: (2025)

UltraIF: Advancing Instruction Following from the Wild
by: An, Kaikai, et al.
Published: (2025)

HREF: Human Response-Guided Evaluation of Instruction Following in Language Models
by: Lyu, Xinxi, et al.
Published: (2024)

RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models
by: Yan, Jianhao, et al.
Published: (2024)

Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions
by: Cassano, Federico, et al.
Published: (2023)

How Many Instructions Can LLMs Follow at Once?
by: Jaroslawicz, Daniel, et al.
Published: (2025)

Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following
by: Ren, Qingyu, et al.
Published: (2025)

Verifiably Following Complex Robot Instructions with Foundation Models
by: Quartey, Benedict, et al.
Published: (2024)

Revisiting the Reliability of Language Models in Instruction-Following
by: Dong, Jianshuo, et al.
Published: (2025)

Is In-Context Learning Sufficient for Instruction Following in LLMs?
by: Zhao, Hao, et al.
Published: (2024)

InFoBench: Evaluating Instruction Following Ability in Large Language Models
by: Qin, Yiwei, et al.
Published: (2024)

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models
by: Fu, Tingchen, et al.
Published: (2025)

LexInstructEval: Lexical Instruction Following Evaluation for Large Language Models
by: Ren, Huimin, et al.
Published: (2025)

Non-instructional Fine-tuning: Enabling Instruction-Following Capabilities in Pre-trained Language Models without Instruction-Following Data
by: Xie, Juncheng, et al.
Published: (2024)