Saved in:
| Main Author: | Yu, Hongkun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.03227 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Logic Sketch Prompting (LSP): A Deterministic and Interpretable Prompting Method
by: Tripathi, Satvik
Published: (2025)
by: Tripathi, Satvik
Published: (2025)
Protecting Context and Prompts: Deterministic Security for Non-Deterministic AI
by: Rajagopalan, Mohan, et al.
Published: (2026)
by: Rajagopalan, Mohan, et al.
Published: (2026)
PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading
by: Ravishankara, Mayank
Published: (2026)
by: Ravishankara, Mayank
Published: (2026)
On the Holographic Geometry of Deterministic Computation
by: Nye, Logan
Published: (2025)
by: Nye, Logan
Published: (2025)
TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation
by: Wang, Ranmin, et al.
Published: (2024)
by: Wang, Ranmin, et al.
Published: (2024)
EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
by: Hu, Xavier, et al.
Published: (2026)
by: Hu, Xavier, et al.
Published: (2026)
LLMs as Method Actors: A Model for Prompt Engineering and Architecture
by: Doyle, Colin
Published: (2024)
by: Doyle, Colin
Published: (2024)
Scalable Solution Methods for Dec-POMDPs with Deterministic Dynamics
by: You, Yang, et al.
Published: (2025)
by: You, Yang, et al.
Published: (2025)
The Self-Execution Benchmark: Measuring LLMs' Attempts to Overcome Their Lack of Self-Execution
by: Ezra, Elon, et al.
Published: (2025)
by: Ezra, Elon, et al.
Published: (2025)
Teaching LLMs to Learn Tool Trialing and Execution through Environment Interaction
by: Gao, Xingjie, et al.
Published: (2026)
by: Gao, Xingjie, et al.
Published: (2026)
Deterministic or probabilistic? The psychology of LLMs as random number generators
by: Coronado-Blázquez, Javier
Published: (2025)
by: Coronado-Blázquez, Javier
Published: (2025)
Can LLMs Compress (and Decompress)? Evaluating Code Understanding and Execution via Invertibility
by: Maveli, Nickil, et al.
Published: (2026)
by: Maveli, Nickil, et al.
Published: (2026)
From Agent Loops to Deterministic Graphs: Execution Lineage for Reproducible AI-Native Work
by: Rosen, Josh, et al.
Published: (2026)
by: Rosen, Josh, et al.
Published: (2026)
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs
by: Sinha, Akshit, et al.
Published: (2025)
by: Sinha, Akshit, et al.
Published: (2025)
Executable Governance for AI: Translating Policies into Rules Using LLMs
by: Datla, Gautam Varma, et al.
Published: (2025)
by: Datla, Gautam Varma, et al.
Published: (2025)
Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs
by: Hua, Andong, et al.
Published: (2025)
by: Hua, Andong, et al.
Published: (2025)
On LLM-generated Logic Programs and their Inference Execution Methods
by: Tarau, Paul
Published: (2025)
by: Tarau, Paul
Published: (2025)
Accelerated AI Inference via Dynamic Execution Methods
by: Barad, Haim, et al.
Published: (2024)
by: Barad, Haim, et al.
Published: (2024)
PromptKeeper: Safeguarding System Prompts for LLMs
by: Jiang, Zhifeng, et al.
Published: (2024)
by: Jiang, Zhifeng, et al.
Published: (2024)
The Illusion of Procedural Reasoning: Measuring Long-Horizon FSM Execution in LLMs
by: Samiei, Mahdi, et al.
Published: (2025)
by: Samiei, Mahdi, et al.
Published: (2025)
Computing the Reachability Value of Posterior-Deterministic POMDPs
by: Fijalkow, Nathanaël, et al.
Published: (2026)
by: Fijalkow, Nathanaël, et al.
Published: (2026)
How Focused Are LLMs? A Quantitative Study via Repetitive Deterministic Prediction Tasks
by: Hou, Wanda, et al.
Published: (2025)
by: Hou, Wanda, et al.
Published: (2025)
A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics
by: Bode, Jonas, et al.
Published: (2024)
by: Bode, Jonas, et al.
Published: (2024)
Deterministic Computing Power Networking: Architecture, Technologies and Prospects
by: Jia, Qingmin, et al.
Published: (2024)
by: Jia, Qingmin, et al.
Published: (2024)
When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs
by: Seleznyov, Mikhail, et al.
Published: (2025)
by: Seleznyov, Mikhail, et al.
Published: (2025)
ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs
by: Thomas, Rohan Subramanian, et al.
Published: (2026)
by: Thomas, Rohan Subramanian, et al.
Published: (2026)
Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning
by: Li, Xinran, et al.
Published: (2025)
by: Li, Xinran, et al.
Published: (2025)
On the Feasibility of Using MultiModal LLMs to Execute AR Social Engineering Attacks
by: Bi, Ting, et al.
Published: (2025)
by: Bi, Ting, et al.
Published: (2025)
LLMs for LLMs: A Structured Prompting Methodology for Long Legal Documents
by: Klem, Strahinja, et al.
Published: (2025)
by: Klem, Strahinja, et al.
Published: (2025)
Forecasting Time Series with LLMs via Patch-Based Prompting and Decomposition
by: Bumb, Mayank, et al.
Published: (2025)
by: Bumb, Mayank, et al.
Published: (2025)
Anchor-Controlled Generative Adversarial Network for High-Fidelity Electromagnetic and Structurally Diverse Metasurface Design
by: Zeng, Yunhui, et al.
Published: (2024)
by: Zeng, Yunhui, et al.
Published: (2024)
Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
by: Ning, Xuefei, et al.
Published: (2023)
by: Ning, Xuefei, et al.
Published: (2023)
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
by: Gehring, Jonas, et al.
Published: (2024)
by: Gehring, Jonas, et al.
Published: (2024)
Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs
by: Sakharova, Marina, et al.
Published: (2025)
by: Sakharova, Marina, et al.
Published: (2025)
Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs
by: Xie, Juncheng, et al.
Published: (2025)
by: Xie, Juncheng, et al.
Published: (2025)
TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure Attacks
by: Zhang, Qihai, et al.
Published: (2025)
by: Zhang, Qihai, et al.
Published: (2025)
Bring Your Own Prompts: Use-Case-Specific Bias and Fairness Evaluation for LLMs
by: Bouchard, Dylan
Published: (2024)
by: Bouchard, Dylan
Published: (2024)
Evaluating Code Generation of LLMs in Advanced Computer Science Problems
by: Catir, Emir, et al.
Published: (2025)
by: Catir, Emir, et al.
Published: (2025)
What I cannot execute, I do not understand: Training and Evaluating LLMs on Program Execution Traces
by: Armengol-Estapé, Jordi, et al.
Published: (2025)
by: Armengol-Estapé, Jordi, et al.
Published: (2025)
CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation
by: Yan, Weixiang, et al.
Published: (2023)
by: Yan, Weixiang, et al.
Published: (2023)
Similar Items
-
Logic Sketch Prompting (LSP): A Deterministic and Interpretable Prompting Method
by: Tripathi, Satvik
Published: (2025) -
Protecting Context and Prompts: Deterministic Security for Non-Deterministic AI
by: Rajagopalan, Mohan, et al.
Published: (2026) -
PlotChain: Deterministic Checkpointed Evaluation of Multimodal LLMs on Engineering Plot Reading
by: Ravishankara, Mayank
Published: (2026) -
On the Holographic Geometry of Deterministic Computation
by: Nye, Logan
Published: (2025) -
TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation
by: Wang, Ranmin, et al.
Published: (2024)