:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Yu, Hongkun
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.03227
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Logic Sketch Prompting (LSP): A Deterministic and Interpretable Prompting Method
by: Tripathi, Satvik
Published: (2025)

Protecting Context and Prompts: Deterministic Security for Non-Deterministic AI
by: Rajagopalan, Mohan, et al.
Published: (2026)

PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading
by: Ravishankara, Mayank
Published: (2026)

On the Holographic Geometry of Deterministic Computation
by: Nye, Logan
Published: (2025)

TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation
by: Wang, Ranmin, et al.
Published: (2024)

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
by: Hu, Xavier, et al.
Published: (2026)

LLMs as Method Actors: A Model for Prompt Engineering and Architecture
by: Doyle, Colin
Published: (2024)

Scalable Solution Methods for Dec-POMDPs with Deterministic Dynamics
by: You, Yang, et al.
Published: (2025)

The Self-Execution Benchmark: Measuring LLMs' Attempts to Overcome Their Lack of Self-Execution
by: Ezra, Elon, et al.
Published: (2025)

Teaching LLMs to Learn Tool Trialing and Execution through Environment Interaction
by: Gao, Xingjie, et al.
Published: (2026)

Deterministic or probabilistic? The psychology of LLMs as random number generators
by: Coronado-Blázquez, Javier
Published: (2025)

Can LLMs Compress (and Decompress)? Evaluating Code Understanding and Execution via Invertibility
by: Maveli, Nickil, et al.
Published: (2026)

From Agent Loops to Deterministic Graphs: Execution Lineage for Reproducible AI-Native Work
by: Rosen, Josh, et al.
Published: (2026)

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
by: Sinha, Akshit, et al.
Published: (2025)

Executable Governance for AI: Translating Policies into Rules Using LLMs
by: Datla, Gautam Varma, et al.
Published: (2025)

Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs
by: Hua, Andong, et al.
Published: (2025)

On LLM-generated Logic Programs and their Inference Execution Methods
by: Tarau, Paul
Published: (2025)

Accelerated AI Inference via Dynamic Execution Methods
by: Barad, Haim, et al.
Published: (2024)

PromptKeeper: Safeguarding System Prompts for LLMs
by: Jiang, Zhifeng, et al.
Published: (2024)

The Illusion of Procedural Reasoning: Measuring Long-Horizon FSM Execution in LLMs
by: Samiei, Mahdi, et al.
Published: (2025)

Computing the Reachability Value of Posterior-Deterministic POMDPs
by: Fijalkow, Nathanaël, et al.
Published: (2026)

How Focused Are LLMs? A Quantitative Study via Repetitive Deterministic Prediction Tasks
by: Hou, Wanda, et al.
Published: (2025)

A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics
by: Bode, Jonas, et al.
Published: (2024)

Deterministic Computing Power Networking: Architecture, Technologies and Prospects
by: Jia, Qingmin, et al.
Published: (2024)

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs
by: Seleznyov, Mikhail, et al.
Published: (2025)

ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs
by: Thomas, Rohan Subramanian, et al.
Published: (2026)

Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning
by: Li, Xinran, et al.
Published: (2025)

On the Feasibility of Using MultiModal LLMs to Execute AR Social Engineering Attacks
by: Bi, Ting, et al.
Published: (2025)

LLMs for LLMs: A Structured Prompting Methodology for Long Legal Documents
by: Klem, Strahinja, et al.
Published: (2025)

Forecasting Time Series with LLMs via Patch-Based Prompting and Decomposition
by: Bumb, Mayank, et al.
Published: (2025)

Anchor-Controlled Generative Adversarial Network for High-Fidelity Electromagnetic and Structurally Diverse Metasurface Design
by: Zeng, Yunhui, et al.
Published: (2024)

Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
by: Ning, Xuefei, et al.
Published: (2023)

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
by: Gehring, Jonas, et al.
Published: (2024)

Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs
by: Sakharova, Marina, et al.
Published: (2025)

Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs
by: Xie, Juncheng, et al.
Published: (2025)

TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure Attacks
by: Zhang, Qihai, et al.
Published: (2025)

Bring Your Own Prompts: Use-Case-Specific Bias and Fairness Evaluation for LLMs
by: Bouchard, Dylan
Published: (2024)

Evaluating Code Generation of LLMs in Advanced Computer Science Problems
by: Catir, Emir, et al.
Published: (2025)

What I cannot execute, I do not understand: Training and Evaluating LLMs on Program Execution Traces
by: Armengol-Estapé, Jordi, et al.
Published: (2025)

CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation
by: Yan, Weixiang, et al.
Published: (2023)