Saved in:
| Main Authors: | Cui, Langyuan, Ling, Chun Kai, Ng, Hwee Tou |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment
by: Ansell, Rebecca, et al.
Published: (2026)
by: Ansell, Rebecca, et al.
Published: (2026)
REPOT: Recoverable Program-of-Thought via Checkpoint Repair
by: Mazaheri, Parsa
Published: (2026)
by: Mazaheri, Parsa
Published: (2026)
HELEA: Hard-Negative Benchmark and LLM-based Reranking for Robust Entity Alignment
by: Jang, Yoonjin, et al.
Published: (2026)
by: Jang, Yoonjin, et al.
Published: (2026)
GuardVal: Dynamic Large Language Model Jailbreak Evaluation for Comprehensive Safety Testing
by: Zhang, Peiyan, et al.
Published: (2025)
by: Zhang, Peiyan, et al.
Published: (2025)
Critical Insights into Leading Conversational AI Models
by: Kohli, Urja, et al.
Published: (2025)
by: Kohli, Urja, et al.
Published: (2025)
ChatGPT4PCG Competition: Character-like Level Generation for Science Birds
by: Taveekitworachai, Pittawat, et al.
Published: (2023)
by: Taveekitworachai, Pittawat, et al.
Published: (2023)
LLMs for Game Theory: Entropy-Guided In-Context Learning and Adaptive CoT Reasoning
by: Banfi, Tommaso Felice, et al.
Published: (2026)
by: Banfi, Tommaso Felice, et al.
Published: (2026)
Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents
by: Chua, Jaymari, et al.
Published: (2025)
by: Chua, Jaymari, et al.
Published: (2025)
Automated Theorem Provers Help Improve Large Language Model Reasoning
by: McGinness, Lachlan, et al.
Published: (2024)
by: McGinness, Lachlan, et al.
Published: (2024)
Reinforced Language Models for Sequential Decision Making
by: Dilkes, Jim, et al.
Published: (2025)
by: Dilkes, Jim, et al.
Published: (2025)
REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models
by: Forniés-Tabuenca, Diego, et al.
Published: (2025)
by: Forniés-Tabuenca, Diego, et al.
Published: (2025)
ReaGeo: Reasoning-Enhanced End-to-End Geocoding with LLMs
by: Cui, Jian, et al.
Published: (2026)
by: Cui, Jian, et al.
Published: (2026)
Intention Communication and Hypothesis Likelihood in Game-Theoretic Motion Planning
by: Chahine, Makram, et al.
Published: (2022)
by: Chahine, Makram, et al.
Published: (2022)
On Dynamic Programming Theory for Leader-Follower Stochastic Games
by: Dibangoye, Jilles Steeve, et al.
Published: (2025)
by: Dibangoye, Jilles Steeve, et al.
Published: (2025)
XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
by: Estevanell-Valladares, Ernesto L., et al.
Published: (2025)
by: Estevanell-Valladares, Ernesto L., et al.
Published: (2025)
CoupleEvo: Evolving Heuristics for Coupled Optimization Problems Using Large Language Models
by: Bömer, Thomas, et al.
Published: (2026)
by: Bömer, Thomas, et al.
Published: (2026)
Open-TI: Open Traffic Intelligence with Augmented Language Model
by: Da, Longchao, et al.
Published: (2023)
by: Da, Longchao, et al.
Published: (2023)
GraphWalk: Enabling Reasoning in Large Language Models through Tool-Based Graph Navigation
by: Ghandi, Taraneh, et al.
Published: (2026)
by: Ghandi, Taraneh, et al.
Published: (2026)
TrafficRAG: A Multimodal RAG Framework for Traffic Accident Liability Determination
by: Li, Xu, et al.
Published: (2026)
by: Li, Xu, et al.
Published: (2026)
ChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level Generation
by: Taveekitworachai, Pittawat, et al.
Published: (2024)
by: Taveekitworachai, Pittawat, et al.
Published: (2024)
Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning
by: Thil, Lucas-Andreï, et al.
Published: (2024)
by: Thil, Lucas-Andreï, et al.
Published: (2024)
TRACE: Temporal Rule-Anchored Chain-of-Evidence on Knowledge Graphs for Interpretable Stock Movement Prediction
by: Ding, Qianggang, et al.
Published: (2026)
by: Ding, Qianggang, et al.
Published: (2026)
Assisting humans in complex comparisons: automated information comparison at scale
by: Yuen, Truman, et al.
Published: (2024)
by: Yuen, Truman, et al.
Published: (2024)
REVOLVE: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization
by: Zhang, Peiyan, et al.
Published: (2024)
by: Zhang, Peiyan, et al.
Published: (2024)
GraphEval36K: Benchmarking Coding and Reasoning Capabilities of Large Language Models on Graph Datasets
by: Wu, Qiming, et al.
Published: (2024)
by: Wu, Qiming, et al.
Published: (2024)
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More
by: Frydenlund, Arvid
Published: (2025)
by: Frydenlund, Arvid
Published: (2025)
Can AI Assist in Olympiad Coding
by: Ren, Samuel
Published: (2025)
by: Ren, Samuel
Published: (2025)
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation
by: Koc, Vincent
Published: (2025)
by: Koc, Vincent
Published: (2025)
Evaluating Large Language Models in a Complex Hidden Role Game
by: Bauer, Niklas
Published: (2026)
by: Bauer, Niklas
Published: (2026)
Resolving Action Bottleneck: Agentic Reinforcement Learning Informed by Token-Level Energy
by: He, Langzhou, et al.
Published: (2026)
by: He, Langzhou, et al.
Published: (2026)
Dynamic Policy Induction for Adaptive Prompt Optimization: Bridging the Efficiency-Accuracy Gap via Lightweight Reinforcement Learning
by: Xu, Jiexi
Published: (2025)
by: Xu, Jiexi
Published: (2025)
Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task
by: Ali, Hassan, et al.
Published: (2024)
by: Ali, Hassan, et al.
Published: (2024)
Aligning LLMs on a Budget: Inference-Time Alignment with Heuristic Reward Models
by: Nakamura, Mason, et al.
Published: (2025)
by: Nakamura, Mason, et al.
Published: (2025)
Systematic Classification of Studies Investigating Social Media Conversations about Long COVID Using a Novel Zero-Shot Transformer Framework
by: Thakur, Nirmalya, et al.
Published: (2025)
by: Thakur, Nirmalya, et al.
Published: (2025)
Emoji Retrieval from Gibberish or Garbled Social Media Text: A Novel Methodology and A Case Study
by: Cui, Shuqi, et al.
Published: (2024)
by: Cui, Shuqi, et al.
Published: (2024)
FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation
by: Hildebrand, Samuel, et al.
Published: (2025)
by: Hildebrand, Samuel, et al.
Published: (2025)
Evo-DKD: Dual-Knowledge Decoding for Autonomous Ontology Evolution in Large Language Models
by: Raman, Vishal, et al.
Published: (2025)
by: Raman, Vishal, et al.
Published: (2025)
An Explainable Collaborative Dialogue System using a Theory of Mind
by: Cohen, Philip R., et al.
Published: (2023)
by: Cohen, Philip R., et al.
Published: (2023)
Quantifying Public Response to COVID-19 Events: Introducing the Community Sentiment and Engagement Index
by: Thakur, Nirmalya, et al.
Published: (2024)
by: Thakur, Nirmalya, et al.
Published: (2024)
Retrieval Augmented Thought Process for Private Data Handling in Healthcare
by: Pouplin, Thomas, et al.
Published: (2024)
by: Pouplin, Thomas, et al.
Published: (2024)
Similar Items
-
How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment
by: Ansell, Rebecca, et al.
Published: (2026) -
REPOT: Recoverable Program-of-Thought via Checkpoint Repair
by: Mazaheri, Parsa
Published: (2026) -
HELEA: Hard-Negative Benchmark and LLM-based Reranking for Robust Entity Alignment
by: Jang, Yoonjin, et al.
Published: (2026) -
GuardVal: Dynamic Large Language Model Jailbreak Evaluation for Comprehensive Safety Testing
by: Zhang, Peiyan, et al.
Published: (2025) -
Critical Insights into Leading Conversational AI Models
by: Kohli, Urja, et al.
Published: (2025)