Saved in:
| Main Authors: | Shi, Zijing, Fang, Meng, Chen, Ling |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.16855 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in the Avalon Game
by: Shi, Zijing, et al.
Published: (2023)
by: Shi, Zijing, et al.
Published: (2023)
Spiral of Silence in Large Language Model Agents
by: Zhong, Mingze, et al.
Published: (2025)
by: Zhong, Mingze, et al.
Published: (2025)
Large Language Models Are Neurosymbolic Reasoners
by: Fang, Meng, et al.
Published: (2024)
by: Fang, Meng, et al.
Published: (2024)
Entropy-Based Decoding for Retrieval-Augmented Large Language Models
by: Qiu, Zexuan, et al.
Published: (2024)
by: Qiu, Zexuan, et al.
Published: (2024)
AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game
by: Chi, Yizhou, et al.
Published: (2024)
by: Chi, Yizhou, et al.
Published: (2024)
SpeechR: A Benchmark for Speech Reasoning in Large Audio-Language Models
by: Yang, Wanqi, et al.
Published: (2025)
by: Yang, Wanqi, et al.
Published: (2025)
TokenSHAP: Interpreting Large Language Models with Monte Carlo Shapley Value Estimation
by: Goldshmidt, Roni, et al.
Published: (2024)
by: Goldshmidt, Roni, et al.
Published: (2024)
Large Language Models as Agents in Two-Player Games
by: Liu, Yang, et al.
Published: (2024)
by: Liu, Yang, et al.
Published: (2024)
TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning
by: Hudi, Frederikus, et al.
Published: (2025)
by: Hudi, Frederikus, et al.
Published: (2025)
Design and Optimization of Reinforcement Learning-Based Agents in Text-Based Games
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios
by: Feng, Xiachong, et al.
Published: (2024)
by: Feng, Xiachong, et al.
Published: (2024)
ERAGent: Enhancing Retrieval-Augmented Language Models with Improved Accuracy, Efficiency, and Personalization
by: Shi, Yunxiao, et al.
Published: (2024)
by: Shi, Yunxiao, et al.
Published: (2024)
TransportationGames: Benchmarking Transportation Knowledge of (Multimodal) Large Language Models
by: Zhang, Xue, et al.
Published: (2024)
by: Zhang, Xue, et al.
Published: (2024)
TextLap: Customizing Language Models for Text-to-Layout Planning
by: Chen, Jian, et al.
Published: (2024)
by: Chen, Jian, et al.
Published: (2024)
Recovering Mental Representations from Large Language Models with Markov Chain Monte Carlo
by: Zhu, Jian-Qiao, et al.
Published: (2024)
by: Zhu, Jian-Qiao, et al.
Published: (2024)
Diffusion Language Model Inference with Monte Carlo Tree Search
by: Huang, Zheng, et al.
Published: (2025)
by: Huang, Zheng, et al.
Published: (2025)
ClinicalAgent: Clinical Trial Multi-Agent System with Large Language Model-based Reasoning
by: Yue, Ling, et al.
Published: (2024)
by: Yue, Ling, et al.
Published: (2024)
CMCTS: A Constrained Monte Carlo Tree Search Framework for Mathematical Reasoning in Large Language Model
by: Lin, Qingwen, et al.
Published: (2025)
by: Lin, Qingwen, et al.
Published: (2025)
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
by: Loula, João, et al.
Published: (2025)
by: Loula, João, et al.
Published: (2025)
Ensembling Language Models with Sequential Monte Carlo
by: Chan, Robin Shing Moon, et al.
Published: (2026)
by: Chan, Robin Shing Moon, et al.
Published: (2026)
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
by: Yang, Wanqi, et al.
Published: (2025)
by: Yang, Wanqi, et al.
Published: (2025)
RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines
by: Yu, Pengfei, et al.
Published: (2025)
by: Yu, Pengfei, et al.
Published: (2025)
CRiskEval: A Chinese Multi-Level Risk Evaluation Benchmark Dataset for Large Language Models
by: Shi, Ling, et al.
Published: (2024)
by: Shi, Ling, et al.
Published: (2024)
Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models
by: Lee, Seongmin, et al.
Published: (2024)
by: Lee, Seongmin, et al.
Published: (2024)
ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities
by: Su, Ying, et al.
Published: (2024)
by: Su, Ying, et al.
Published: (2024)
Credence Calibration Game? Calibrating Large Language Models through Structured Play
by: Fang, Ke, et al.
Published: (2025)
by: Fang, Ke, et al.
Published: (2025)
Hazards in Daily Life? Enabling Robots to Proactively Detect and Resolve Anomalies
by: Song, Zirui, et al.
Published: (2024)
by: Song, Zirui, et al.
Published: (2024)
ProBench: Benchmarking Large Language Models in Competitive Programming
by: Yang, Lei, et al.
Published: (2025)
by: Yang, Lei, et al.
Published: (2025)
GameTraversalBenchmark: Evaluating Planning Abilities Of Large Language Models Through Traversing 2D Game Maps
by: Nasir, Muhammad Umair, et al.
Published: (2024)
by: Nasir, Muhammad Umair, et al.
Published: (2024)
RePrompt: Planning by Automatic Prompt Engineering for Large Language Models Agents
by: Chen, Weizhe, et al.
Published: (2024)
by: Chen, Weizhe, et al.
Published: (2024)
When Language Overrules: Revealing Text Dominance in Multimodal Large Language Models
by: Wu, Huyu, et al.
Published: (2025)
by: Wu, Huyu, et al.
Published: (2025)
Progressive Document-level Text Simplification via Large Language Models
by: Fang, Dengzhao, et al.
Published: (2025)
by: Fang, Dengzhao, et al.
Published: (2025)
Game of Thought: Robust Information Seeking with Large Language Models Using Game Theory
by: Cui, Langyuan, et al.
Published: (2026)
by: Cui, Langyuan, et al.
Published: (2026)
MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models
by: Yu, Beibei, et al.
Published: (2024)
by: Yu, Beibei, et al.
Published: (2024)
Principled Gradient-based Markov Chain Monte Carlo for Text Generation
by: Du, Li, et al.
Published: (2023)
by: Du, Li, et al.
Published: (2023)
A Survey on Employing Large Language Models for Text-to-SQL Tasks
by: Shi, Liang, et al.
Published: (2024)
by: Shi, Liang, et al.
Published: (2024)
Can Large Language Models Master Complex Card Games?
by: Wang, Wei, et al.
Published: (2025)
by: Wang, Wei, et al.
Published: (2025)
MCTS-SQL: Light-Weight LLMs can Master the Text-to-SQL through Monte Carlo Tree Search
by: Yuan, Shuozhi, et al.
Published: (2025)
by: Yuan, Shuozhi, et al.
Published: (2025)
Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
by: Wang, Danqing, et al.
Published: (2024)
by: Wang, Danqing, et al.
Published: (2024)
A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction
by: Cai, Erica, et al.
Published: (2023)
by: Cai, Erica, et al.
Published: (2023)
Similar Items
-
Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in the Avalon Game
by: Shi, Zijing, et al.
Published: (2023) -
Spiral of Silence in Large Language Model Agents
by: Zhong, Mingze, et al.
Published: (2025) -
Large Language Models Are Neurosymbolic Reasoners
by: Fang, Meng, et al.
Published: (2024) -
Entropy-Based Decoding for Retrieval-Augmented Large Language Models
by: Qiu, Zexuan, et al.
Published: (2024) -
AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game
by: Chi, Yizhou, et al.
Published: (2024)