:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Taneja, Karan, Segal, Richard, Goodwin, Richard
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2401.05199
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

HALT: Hallucination Assessment via Log-probs as Time series
by: Shapiro, Ahmad, et al.
Published: (2026)

Jill Watson: A Virtual Teaching Assistant powered by ChatGPT
by: Taneja, Karan, et al.
Published: (2024)

Interpretable Contrastive Monte Carlo Tree Search Reasoning
by: Gao, Zitian, et al.
Published: (2024)

Diffusion Language Model Inference with Monte Carlo Tree Search
by: Huang, Zheng, et al.
Published: (2025)

SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation
by: Xu, Bin, et al.
Published: (2024)

Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search
by: Ryu, Sangwon, et al.
Published: (2025)

MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering
by: Xiong, Guanming, et al.
Published: (2025)

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
by: Wu, Fang, et al.
Published: (2025)

Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search
by: Li, Shuangtao, et al.
Published: (2025)

Empirical-MCTS: Continuous Agent Evolution via Dual-Experience Monte Carlo Tree Search
by: Lu, Hao, et al.
Published: (2026)

Graph-O1 : Monte Carlo Tree Search with Reinforcement Learning for Text-Attributed Graph Reasoning
by: Liu, Lihui
Published: (2025)

KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search
by: Luo, Haoran, et al.
Published: (2025)

MCTS-SQL: Light-Weight LLMs can Master the Text-to-SQL through Monte Carlo Tree Search
by: Yuan, Shuozhi, et al.
Published: (2025)

Don't throw away your value model! Generating more preferable text with Value-Guided Monte-Carlo Tree Search decoding
by: Liu, Jiacheng, et al.
Published: (2023)

Enhancing Action and Ingredient Modeling for Semantically Grounded Recipe Generation
by: Liu, Guoshan, et al.
Published: (2026)

Losses that Cook: Topological Optimal Transport for Structured Recipe Generation
by: Ottoborgo, Mattia, et al.
Published: (2026)

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents
by: Sutawika, Lintang, et al.
Published: (2026)

Feedback-Aware Monte Carlo Tree Search for Efficient Information Seeking in Goal-Oriented Conversations
by: Chopra, Harshita, et al.
Published: (2025)

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
by: Zhang, Kaichen, et al.
Published: (2025)

MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration
by: Wan, David, et al.
Published: (2025)

Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study
by: Vij, Anneketh, et al.
Published: (2025)

Removing RLHF Protections in GPT-4 via Fine-Tuning
by: Zhan, Qiusi, et al.
Published: (2023)

Vero: An Open RL Recipe for General Visual Reasoning
by: Sarch, Gabriel, et al.
Published: (2026)

The Aloe Family Recipe for Open and Specialized Healthcare LLMs
by: Garcia-Gasulla, Dario, et al.
Published: (2025)

MuDoC: An Interactive Multimodal Document-grounded Conversational AI System
by: Taneja, Karan, et al.
Published: (2025)

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
by: Xin, Huajian, et al.
Published: (2024)

A Recipe For Building a Compliant Real Estate Chatbot
by: Madani, Navid, et al.
Published: (2024)

Behavior Trees Enable Structured Programming of Language Model Agents
by: Kelley, Richard
Published: (2024)

Ensembling Language Models with Sequential Monte Carlo
by: Chan, Robin Shing Moon, et al.
Published: (2026)

English K_Quantization of LLMs Does Not Disproportionately Diminish Multilingual Performance
by: Borgersen, Karl Audun, et al.
Published: (2025)

Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation
by: Park, Jong Inn, et al.
Published: (2025)

Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for Multistep Reasoning
by: Huang, Bingning, et al.
Published: (2025)

Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat
by: Daynauth, Roland, et al.
Published: (2024)

Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation
by: Chen, Yifang, et al.
Published: (2024)

Ratchet: A Minimal Hygiene Recipe for Self-Evolving LLM Agents
by: Zhang, Xing, et al.
Published: (2026)

PersonaMatrix: A Recipe for Persona-Aware Evaluation of Legal Summarization
by: Pang, Tsz Fung, et al.
Published: (2025)

Structured Extraction of Real World Medical Knowledge using LLMs for Summarization and Search
by: Kim, Edward, et al.
Published: (2024)

Recovering Mental Representations from Large Language Models with Markov Chain Monte Carlo
by: Zhu, Jian-Qiao, et al.
Published: (2024)

InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models
by: Wang, Wenjun, et al.
Published: (2025)

GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
by: Kang, Hao, et al.
Published: (2024)