Saved in:
| Main Authors: | Chen, Weiyi, Wang, Shuaixiong, Gao, Ziyun, Hu, Kaichun, Ni, Wangze, Di, Shimin, Zhang, Chen Jason, Chen, Lei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.01046 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TravelAgent: An AI Assistant for Personalized Travel Planning
by: Chen, Aili, et al.
Published: (2024)
by: Chen, Aili, et al.
Published: (2024)
HarmMetric Eval: Benchmarking Metrics and Judges for LLM Harmfulness Assessment
by: Yang, Langqi, et al.
Published: (2025)
by: Yang, Langqi, et al.
Published: (2025)
When AI reviews science: Can we trust the referee?
by: Wang, Jialiang, et al.
Published: (2026)
by: Wang, Jialiang, et al.
Published: (2026)
DeepTravel: An End-to-End Agentic Reinforcement Learning Framework for Autonomous Travel Planning Agents
by: Ning, Yansong, et al.
Published: (2025)
by: Ning, Yansong, et al.
Published: (2025)
ChinaTravel: An Open-Ended Travel Planning Benchmark with Compositional Constraint Validation for Language Agents
by: Shao, Jie-Jing, et al.
Published: (2024)
by: Shao, Jie-Jing, et al.
Published: (2024)
RxnNano:Training Compact LLMs for Chemical Reaction and Retrosynthesis Prediction via Hierarchical Curriculum Learning
by: Li, Ran, et al.
Published: (2026)
by: Li, Ran, et al.
Published: (2026)
GroupTravelBench: Benchmarking LLM Agents on Multi-Person Travel Planning
by: Cheng, Xiang, et al.
Published: (2026)
by: Cheng, Xiang, et al.
Published: (2026)
Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning
by: Gundawar, Atharva, et al.
Published: (2024)
by: Gundawar, Atharva, et al.
Published: (2024)
HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel
by: Bui, The Viet, et al.
Published: (2026)
by: Bui, The Viet, et al.
Published: (2026)
Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example
by: Chen, Yanan, et al.
Published: (2024)
by: Chen, Yanan, et al.
Published: (2024)
ATLAS: Constraints-Aware Multi-Agent Collaboration for Real-World Travel Planning
by: Choi, Jihye, et al.
Published: (2025)
by: Choi, Jihye, et al.
Published: (2025)
TravelAgent: Generative Agents in the Built Environment
by: Noyman, Ariel, et al.
Published: (2024)
by: Noyman, Ariel, et al.
Published: (2024)
VeriTrip: A Verifiable Benchmark for Travel Planning Agents over Unstructured Web Corpora
by: Xu, Yuting, et al.
Published: (2026)
by: Xu, Yuting, et al.
Published: (2026)
TripTailor: A Real-World Benchmark for Personalized Travel Planning
by: Shen, Yuanzhe, et al.
Published: (2025)
by: Shen, Yuanzhe, et al.
Published: (2025)
Vaiage: A Multi-Agent Solution to Personalized Travel Planning
by: Liu, Binwen, et al.
Published: (2025)
by: Liu, Binwen, et al.
Published: (2025)
RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments
by: Fu, Yuchuan, et al.
Published: (2025)
by: Fu, Yuchuan, et al.
Published: (2025)
TripTide: A Benchmark for Adaptive Travel Planning under Disruptions
by: Karmakar, Priyanshu, et al.
Published: (2025)
by: Karmakar, Priyanshu, et al.
Published: (2025)
SRBench: A Comprehensive Benchmark for Sequential Recommendation with Large Language Models
by: Li, Jianhong, et al.
Published: (2026)
by: Li, Jianhong, et al.
Published: (2026)
RETAIL: Towards Real-world Travel Planning for Large Language Models
by: Deng, Bin, et al.
Published: (2025)
by: Deng, Bin, et al.
Published: (2025)
Learn to Tour: Operator Design For Solution Feasibility Mapping in Pickup-and-delivery Traveling Salesman Problem
by: Fang, Bowen, et al.
Published: (2024)
by: Fang, Bowen, et al.
Published: (2024)
TripCraft: A Benchmark for Spatio-Temporally Fine Grained Travel Planning
by: Chaudhuri, Soumyabrata, et al.
Published: (2025)
by: Chaudhuri, Soumyabrata, et al.
Published: (2025)
Beyond Itinerary Planning-A Real-World Benchmark for Multi-Turn and Tool-Using Travel Tasks
by: Cheng, Xiang, et al.
Published: (2025)
by: Cheng, Xiang, et al.
Published: (2025)
MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents
by: Dai, Yanqi, et al.
Published: (2024)
by: Dai, Yanqi, et al.
Published: (2024)
SurveyEval: Towards Comprehensive Evaluation of LLM-Generated Academic Surveys
by: Zhao, Jiahao, et al.
Published: (2025)
by: Zhao, Jiahao, et al.
Published: (2025)
InsightEval: An Expert-Curated Benchmark for Assessing Insight Discovery in LLM-Driven Data Agents
by: Zhu, Zhenghao, et al.
Published: (2025)
by: Zhu, Zhenghao, et al.
Published: (2025)
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
by: Xie, Jian, et al.
Published: (2024)
by: Xie, Jian, et al.
Published: (2024)
Revisiting the Travel Planning Capabilities of Large Language Models
by: Zhang, Bo-Wen, et al.
Published: (2026)
by: Zhang, Bo-Wen, et al.
Published: (2026)
MMCircuitEval: A Comprehensive Multimodal Circuit-Focused Benchmark for Evaluating LLMs
by: Zhao, Chenchen, et al.
Published: (2025)
by: Zhao, Chenchen, et al.
Published: (2025)
Be More Real: Travel Diary Generation Using LLM Agents and Individual Profiles
by: Li, Xuchuan, et al.
Published: (2024)
by: Li, Xuchuan, et al.
Published: (2024)
RxEval: A Prescription-Level Benchmark for Evaluating LLM Medication Recommendation
by: Chen, Shuhao, et al.
Published: (2026)
by: Chen, Shuhao, et al.
Published: (2026)
PEMANT: Persona-Enriched Multi-Agent Negotiation for Travel
by: Sun, Yuran, et al.
Published: (2026)
by: Sun, Yuran, et al.
Published: (2026)
Reinforcement Learning-based Non-Autoregressive Solver for Traveling Salesman Problems
by: Xiao, Yubin, et al.
Published: (2023)
by: Xiao, Yubin, et al.
Published: (2023)
AlphaEval: A Comprehensive and Efficient Evaluation Framework for Formula Alpha Mining
by: Ding, Hongjun, et al.
Published: (2025)
by: Ding, Hongjun, et al.
Published: (2025)
Search to Fine-tune Pre-trained Graph Neural Networks for Graph-level Tasks
by: Wang, Zhili, et al.
Published: (2023)
by: Wang, Zhili, et al.
Published: (2023)
Large Language Models in the Travel Domain: An Industrial Experience
by: Di Meglio, Sergio, et al.
Published: (2025)
by: Di Meglio, Sergio, et al.
Published: (2025)
IMAIA: Interactive Maps AI Assistant for Travel Planning and Geo-Spatial Intelligence
by: Deng, Jieren, et al.
Published: (2025)
by: Deng, Jieren, et al.
Published: (2025)
SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment
by: Jiang, Sihang, et al.
Published: (2026)
by: Jiang, Sihang, et al.
Published: (2026)
AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems
by: Wang, Weiyi, et al.
Published: (2026)
by: Wang, Weiyi, et al.
Published: (2026)
A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case
by: Meyer, Sonia, et al.
Published: (2024)
by: Meyer, Sonia, et al.
Published: (2024)
CodeFuse-CommitEval: Towards Benchmarking LLM's Power on Commit Message and Code Change Inconsistency Detection
by: Zhang, Qingyu, et al.
Published: (2025)
by: Zhang, Qingyu, et al.
Published: (2025)
Similar Items
-
TravelAgent: An AI Assistant for Personalized Travel Planning
by: Chen, Aili, et al.
Published: (2024) -
HarmMetric Eval: Benchmarking Metrics and Judges for LLM Harmfulness Assessment
by: Yang, Langqi, et al.
Published: (2025) -
When AI reviews science: Can we trust the referee?
by: Wang, Jialiang, et al.
Published: (2026) -
DeepTravel: An End-to-End Agentic Reinforcement Learning Framework for Autonomous Travel Planning Agents
by: Ning, Yansong, et al.
Published: (2025) -
ChinaTravel: An Open-Ended Travel Planning Benchmark with Compositional Constraint Validation for Language Agents
by: Shao, Jie-Jing, et al.
Published: (2024)