Saved in:
| Main Authors: | Tso, Joseph, Schmittou, Preston, Huynh, Quan, Hutchins, Jibran |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.22465 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
R-ConstraintBench: Evaluating LLMs on NP-Complete Scheduling
by: Jain, Raj, et al.
Published: (2025)
by: Jain, Raj, et al.
Published: (2025)
MCJudgeBench: A Benchmark for Constraint-Level Judge Evaluation in Multi-Constraint Instruction Following
by: Lee, Jaeyun, et al.
Published: (2026)
by: Lee, Jaeyun, et al.
Published: (2026)
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints
by: Handa, Divij, et al.
Published: (2024)
by: Handa, Divij, et al.
Published: (2024)
ORIGAMISPACE: Benchmarking Multimodal LLMs in Multi-Step Spatial Reasoning with Mathematical Constraints
by: Xu, Rui, et al.
Published: (2025)
by: Xu, Rui, et al.
Published: (2025)
An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
by: Sun, Yi, et al.
Published: (2025)
by: Sun, Yi, et al.
Published: (2025)
BikeBench: A Bicycle Design Benchmark for Generative Models with Objectives and Constraints
by: Regenwetter, Lyle, et al.
Published: (2025)
by: Regenwetter, Lyle, et al.
Published: (2025)
Can Large Language Models Reason and Optimize Under Constraints?
by: Bernier, Fabien, et al.
Published: (2026)
by: Bernier, Fabien, et al.
Published: (2026)
Reasoning with Preference Constraints: A Benchmark for Language Models in Many-to-One Matching Markets
by: Fauchard, Marylou, et al.
Published: (2025)
by: Fauchard, Marylou, et al.
Published: (2025)
ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming
by: Shi, Weichun, et al.
Published: (2025)
by: Shi, Weichun, et al.
Published: (2025)
Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences
by: Cheng, Quan
Published: (2026)
by: Cheng, Quan
Published: (2026)
SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes
by: Li, Kuan, et al.
Published: (2026)
by: Li, Kuan, et al.
Published: (2026)
CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases
by: Xue, Xiaona, et al.
Published: (2026)
by: Xue, Xiaona, et al.
Published: (2026)
Direct Encoding of Declare Constraints in ASP
by: Chiariello, Francesco, et al.
Published: (2024)
by: Chiariello, Francesco, et al.
Published: (2024)
PilotBench: A Benchmark for General Aviation Agents with Safety Constraints
by: Wu, Yalun, et al.
Published: (2026)
by: Wu, Yalun, et al.
Published: (2026)
Hyperparameter Optimization of Constraint Programming Solvers
by: Haddad, Hedieh, et al.
Published: (2026)
by: Haddad, Hedieh, et al.
Published: (2026)
Explainable Distributed Constraint Optimization Problems
by: Rachmut, Ben, et al.
Published: (2025)
by: Rachmut, Ben, et al.
Published: (2025)
DCP-Bench-Open: Evaluating LLMs for Constraint Modelling of Discrete Combinatorial Problems
by: Michailidis, Kostis, et al.
Published: (2025)
by: Michailidis, Kostis, et al.
Published: (2025)
Hard Constraints Meet Soft Generation: Guaranteed Feasibility for LLM-based Combinatorial Optimization
by: Liu, Yang, et al.
Published: (2026)
by: Liu, Yang, et al.
Published: (2026)
TCP: a Benchmark for Temporal Constraint-Based Planning
by: Ding, Zifeng, et al.
Published: (2025)
by: Ding, Zifeng, et al.
Published: (2025)
Trivial Vocabulary Bans Improve LLM Reasoning More Than Deep Linguistic Constraints
by: Jehu-Appiah, Rodney
Published: (2026)
by: Jehu-Appiah, Rodney
Published: (2026)
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning
by: Li, Yubo, et al.
Published: (2026)
by: Li, Yubo, et al.
Published: (2026)
Constraint-Based Analysis of Reasoning Shortcuts in Neurosymbolic Learning
by: Takemura, Akihiro, et al.
Published: (2026)
by: Takemura, Akihiro, et al.
Published: (2026)
Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning
by: Zhang, Xuan, et al.
Published: (2025)
by: Zhang, Xuan, et al.
Published: (2025)
LinAlg-Bench: A Forensic Benchmark Revealing Structural Failure Modes in LLM Mathematical Reasoning
by: Agarwal, Shradha, et al.
Published: (2026)
by: Agarwal, Shradha, et al.
Published: (2026)
Omission Constraints Decay While Commission Constraints Persist in Long-Context LLM Agents
by: Gamage, Yeran
Published: (2026)
by: Gamage, Yeran
Published: (2026)
Generalizing Constraint Models in Constraint Acquisition
by: Tsouros, Dimos, et al.
Published: (2024)
by: Tsouros, Dimos, et al.
Published: (2024)
KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs
by: Markowitz, Elan, et al.
Published: (2025)
by: Markowitz, Elan, et al.
Published: (2025)
CombiBench: Benchmarking LLM Capability for Combinatorial Mathematics
by: Liu, Junqi, et al.
Published: (2025)
by: Liu, Junqi, et al.
Published: (2025)
LLM-Enhanced Bayesian Optimization for Efficient Analog Layout Constraint Generation
by: Chen, Guojin, et al.
Published: (2024)
by: Chen, Guojin, et al.
Published: (2024)
Evaluating the Safety and Skill Reasoning of Large Reasoning Models Under Compute Constraints
by: Balaji, Adarsha, et al.
Published: (2025)
by: Balaji, Adarsha, et al.
Published: (2025)
ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation
by: Yang, Zhuojie, et al.
Published: (2026)
by: Yang, Zhuojie, et al.
Published: (2026)
FeynmanBench: Benchmarking Multimodal LLMs on Diagrammatic Physics Reasoning
by: Wang, Zeyu, et al.
Published: (2026)
by: Wang, Zeyu, et al.
Published: (2026)
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning
by: Wang, Xukai, et al.
Published: (2025)
by: Wang, Xukai, et al.
Published: (2025)
Living Off the LLM: How LLMs Will Change Adversary Tactics
by: Oesch, Sean, et al.
Published: (2025)
by: Oesch, Sean, et al.
Published: (2025)
OckBench: Measuring the Efficiency of LLM Reasoning
by: Du, Zheng, et al.
Published: (2025)
by: Du, Zheng, et al.
Published: (2025)
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
by: Wen, Bosi, et al.
Published: (2024)
by: Wen, Bosi, et al.
Published: (2024)
MMR-Bench: A Comprehensive Benchmark for Multimodal LLM Routing
by: Ma, Haoxuan, et al.
Published: (2026)
by: Ma, Haoxuan, et al.
Published: (2026)
TopoBench: Benchmarking LLMs on Hard Topological Reasoning
by: Maniparambil, Mayug, et al.
Published: (2026)
by: Maniparambil, Mayug, et al.
Published: (2026)
Automatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement Learning
by: Han, Xinchen, et al.
Published: (2026)
by: Han, Xinchen, et al.
Published: (2026)
Eidoku: A Neuro-Symbolic Verification Gate for LLM Reasoning via Structural Constraint Satisfaction
by: Miya, Shinobu
Published: (2025)
by: Miya, Shinobu
Published: (2025)
Similar Items
-
R-ConstraintBench: Evaluating LLMs on NP-Complete Scheduling
by: Jain, Raj, et al.
Published: (2025) -
MCJudgeBench: A Benchmark for Constraint-Level Judge Evaluation in Multi-Constraint Instruction Following
by: Lee, Jaeyun, et al.
Published: (2026) -
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints
by: Handa, Divij, et al.
Published: (2024) -
ORIGAMISPACE: Benchmarking Multimodal LLMs in Multi-Step Spatial Reasoning with Mathematical Constraints
by: Xu, Rui, et al.
Published: (2025) -
An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
by: Sun, Yi, et al.
Published: (2025)