:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cui, Langyuan, Ling, Chun Kai, Ng, Hwee Tou
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence Computer Science and Game Theory I.2.7; I.2.8
Online Access:	https://arxiv.org/abs/2602.01708
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment
by: Ansell, Rebecca, et al.
Published: (2026)

REPOT: Recoverable Program-of-Thought via Checkpoint Repair
by: Mazaheri, Parsa
Published: (2026)

HELEA: Hard-Negative Benchmark and LLM-based Reranking for Robust Entity Alignment
by: Jang, Yoonjin, et al.
Published: (2026)

GuardVal: Dynamic Large Language Model Jailbreak Evaluation for Comprehensive Safety Testing
by: Zhang, Peiyan, et al.
Published: (2025)

Critical Insights into Leading Conversational AI Models
by: Kohli, Urja, et al.
Published: (2025)

ChatGPT4PCG Competition: Character-like Level Generation for Science Birds
by: Taveekitworachai, Pittawat, et al.
Published: (2023)

LLMs for Game Theory: Entropy-Guided In-Context Learning and Adaptive CoT Reasoning
by: Banfi, Tommaso Felice, et al.
Published: (2026)

Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents
by: Chua, Jaymari, et al.
Published: (2025)

Automated Theorem Provers Help Improve Large Language Model Reasoning
by: McGinness, Lachlan, et al.
Published: (2024)

Reinforced Language Models for Sequential Decision Making
by: Dilkes, Jim, et al.
Published: (2025)

REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models
by: Forniés-Tabuenca, Diego, et al.
Published: (2025)

ReaGeo: Reasoning-Enhanced End-to-End Geocoding with LLMs
by: Cui, Jian, et al.
Published: (2026)

Intention Communication and Hypothesis Likelihood in Game-Theoretic Motion Planning
by: Chahine, Makram, et al.
Published: (2022)

On Dynamic Programming Theory for Leader-Follower Stochastic Games
by: Dibangoye, Jilles Steeve, et al.
Published: (2025)

XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
by: Estevanell-Valladares, Ernesto L., et al.
Published: (2025)

CoupleEvo: Evolving Heuristics for Coupled Optimization Problems Using Large Language Models
by: Bömer, Thomas, et al.
Published: (2026)

Open-TI: Open Traffic Intelligence with Augmented Language Model
by: Da, Longchao, et al.
Published: (2023)

GraphWalk: Enabling Reasoning in Large Language Models through Tool-Based Graph Navigation
by: Ghandi, Taraneh, et al.
Published: (2026)

TrafficRAG: A Multimodal RAG Framework for Traffic Accident Liability Determination
by: Li, Xu, et al.
Published: (2026)

ChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level Generation
by: Taveekitworachai, Pittawat, et al.
Published: (2024)

Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning
by: Thil, Lucas-Andreï, et al.
Published: (2024)

TRACE: Temporal Rule-Anchored Chain-of-Evidence on Knowledge Graphs for Interpretable Stock Movement Prediction
by: Ding, Qianggang, et al.
Published: (2026)

Assisting humans in complex comparisons: automated information comparison at scale
by: Yuen, Truman, et al.
Published: (2024)

REVOLVE: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization
by: Zhang, Peiyan, et al.
Published: (2024)

GraphEval36K: Benchmarking Coding and Reasoning Capabilities of Large Language Models on Graph Datasets
by: Wu, Qiming, et al.
Published: (2024)

Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More
by: Frydenlund, Arvid
Published: (2025)

Can AI Assist in Olympiad Coding
by: Ren, Samuel
Published: (2025)

Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation
by: Koc, Vincent
Published: (2025)

Evaluating Large Language Models in a Complex Hidden Role Game
by: Bauer, Niklas
Published: (2026)

Resolving Action Bottleneck: Agentic Reinforcement Learning Informed by Token-Level Energy
by: He, Langzhou, et al.
Published: (2026)

Dynamic Policy Induction for Adaptive Prompt Optimization: Bridging the Efficiency-Accuracy Gap via Lightweight Reinforcement Learning
by: Xu, Jiexi
Published: (2025)

Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task
by: Ali, Hassan, et al.
Published: (2024)

Aligning LLMs on a Budget: Inference-Time Alignment with Heuristic Reward Models
by: Nakamura, Mason, et al.
Published: (2025)

Systematic Classification of Studies Investigating Social Media Conversations about Long COVID Using a Novel Zero-Shot Transformer Framework
by: Thakur, Nirmalya, et al.
Published: (2025)

Emoji Retrieval from Gibberish or Garbled Social Media Text: A Novel Methodology and A Case Study
by: Cui, Shuqi, et al.
Published: (2024)

FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation
by: Hildebrand, Samuel, et al.
Published: (2025)

Evo-DKD: Dual-Knowledge Decoding for Autonomous Ontology Evolution in Large Language Models
by: Raman, Vishal, et al.
Published: (2025)

An Explainable Collaborative Dialogue System using a Theory of Mind
by: Cohen, Philip R., et al.
Published: (2023)

Quantifying Public Response to COVID-19 Events: Introducing the Community Sentiment and Engagement Index
by: Thakur, Nirmalya, et al.
Published: (2024)

Retrieval Augmented Thought Process for Private Data Handling in Healthcare
by: Pouplin, Thomas, et al.
Published: (2024)