:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tso, Joseph, Schmittou, Preston, Huynh, Quan, Hutchins, Jibran
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.22465
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

R-ConstraintBench: Evaluating LLMs on NP-Complete Scheduling
by: Jain, Raj, et al.
Published: (2025)

MCJudgeBench: A Benchmark for Constraint-Level Judge Evaluation in Multi-Constraint Instruction Following
by: Lee, Jaeyun, et al.
Published: (2026)

ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints
by: Handa, Divij, et al.
Published: (2024)

ORIGAMISPACE: Benchmarking Multimodal LLMs in Multi-Step Spatial Reasoning with Mathematical Constraints
by: Xu, Rui, et al.
Published: (2025)

An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
by: Sun, Yi, et al.
Published: (2025)

BikeBench: A Bicycle Design Benchmark for Generative Models with Objectives and Constraints
by: Regenwetter, Lyle, et al.
Published: (2025)

Can Large Language Models Reason and Optimize Under Constraints?
by: Bernier, Fabien, et al.
Published: (2026)

Reasoning with Preference Constraints: A Benchmark for Language Models in Many-to-One Matching Markets
by: Fauchard, Marylou, et al.
Published: (2025)

ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming
by: Shi, Weichun, et al.
Published: (2025)

Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences
by: Cheng, Quan
Published: (2026)

SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes
by: Li, Kuan, et al.
Published: (2026)

CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases
by: Xue, Xiaona, et al.
Published: (2026)

Direct Encoding of Declare Constraints in ASP
by: Chiariello, Francesco, et al.
Published: (2024)

PilotBench: A Benchmark for General Aviation Agents with Safety Constraints
by: Wu, Yalun, et al.
Published: (2026)

Hyperparameter Optimization of Constraint Programming Solvers
by: Haddad, Hedieh, et al.
Published: (2026)

Explainable Distributed Constraint Optimization Problems
by: Rachmut, Ben, et al.
Published: (2025)

DCP-Bench-Open: Evaluating LLMs for Constraint Modelling of Discrete Combinatorial Problems
by: Michailidis, Kostis, et al.
Published: (2025)

Hard Constraints Meet Soft Generation: Guaranteed Feasibility for LLM-based Combinatorial Optimization
by: Liu, Yang, et al.
Published: (2026)

TCP: a Benchmark for Temporal Constraint-Based Planning
by: Ding, Zifeng, et al.
Published: (2025)

Trivial Vocabulary Bans Improve LLM Reasoning More Than Deep Linguistic Constraints
by: Jehu-Appiah, Rodney
Published: (2026)

The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning
by: Li, Yubo, et al.
Published: (2026)

Constraint-Based Analysis of Reasoning Shortcuts in Neurosymbolic Learning
by: Takemura, Akihiro, et al.
Published: (2026)

Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning
by: Zhang, Xuan, et al.
Published: (2025)

LinAlg-Bench: A Forensic Benchmark Revealing Structural Failure Modes in LLM Mathematical Reasoning
by: Agarwal, Shradha, et al.
Published: (2026)

Omission Constraints Decay While Commission Constraints Persist in Long-Context LLM Agents
by: Gamage, Yeran
Published: (2026)

Generalizing Constraint Models in Constraint Acquisition
by: Tsouros, Dimos, et al.
Published: (2024)

KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs
by: Markowitz, Elan, et al.
Published: (2025)

CombiBench: Benchmarking LLM Capability for Combinatorial Mathematics
by: Liu, Junqi, et al.
Published: (2025)

LLM-Enhanced Bayesian Optimization for Efficient Analog Layout Constraint Generation
by: Chen, Guojin, et al.
Published: (2024)

Evaluating the Safety and Skill Reasoning of Large Reasoning Models Under Compute Constraints
by: Balaji, Adarsha, et al.
Published: (2025)

ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation
by: Yang, Zhuojie, et al.
Published: (2026)

FeynmanBench: Benchmarking Multimodal LLMs on Diagrammatic Physics Reasoning
by: Wang, Zeyu, et al.
Published: (2026)

MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning
by: Wang, Xukai, et al.
Published: (2025)

Living Off the LLM: How LLMs Will Change Adversary Tactics
by: Oesch, Sean, et al.
Published: (2025)

OckBench: Measuring the Efficiency of LLM Reasoning
by: Du, Zheng, et al.
Published: (2025)

Benchmarking Complex Instruction-Following with Multiple Constraints Composition
by: Wen, Bosi, et al.
Published: (2024)

MMR-Bench: A Comprehensive Benchmark for Multimodal LLM Routing
by: Ma, Haoxuan, et al.
Published: (2026)

TopoBench: Benchmarking LLMs on Hard Topological Reasoning
by: Maniparambil, Mayug, et al.
Published: (2026)

Automatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement Learning
by: Han, Xinchen, et al.
Published: (2026)

Eidoku: A Neuro-Symbolic Verification Gate for LLM Reasoning via Structural Constraint Satisfaction
by: Miya, Shinobu
Published: (2025)