Saved in:
| Main Authors: | Chi, Yizhou, Yang, Kevin, Klein, Dan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.05966 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval
by: Chi, Yizhou, et al.
Published: (2024)
by: Chi, Yizhou, et al.
Published: (2024)
RLCD: Reinforcement Learning from Contrastive Distillation for Language Model Alignment
by: Yang, Kevin, et al.
Published: (2023)
by: Yang, Kevin, et al.
Published: (2023)
SubSearch: Intermediate Rewards for Unsupervised Guided Reasoning in Complex Retrieval
by: Petcu, Roxana, et al.
Published: (2026)
by: Petcu, Roxana, et al.
Published: (2026)
Doing Experiments and Revising Rules with Natural Language and Probabilistic Reasoning
by: Piriyakulkij, Wasu Top, et al.
Published: (2024)
by: Piriyakulkij, Wasu Top, et al.
Published: (2024)
Reinforced Context Order Recovery for Adaptive Reasoning and Planning
by: Ma, Long, et al.
Published: (2025)
by: Ma, Long, et al.
Published: (2025)
Intermediate Languages Matter: Formal Choice Drives Neurosymbolic LLM Reasoning
by: Beiser, Alexander, et al.
Published: (2025)
by: Beiser, Alexander, et al.
Published: (2025)
American Sign Language Handshapes Reflect Pressures for Communicative Efficiency
by: Yin, Kayo, et al.
Published: (2024)
by: Yin, Kayo, et al.
Published: (2024)
When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks
by: Fleisig, Eve, et al.
Published: (2023)
by: Fleisig, Eve, et al.
Published: (2023)
Instructing the Architecture Search for Spatial-temporal Sequence Forecasting with LLM
by: Xue, Xin, et al.
Published: (2025)
by: Xue, Xin, et al.
Published: (2025)
Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes
by: Jiao, Rui, et al.
Published: (2025)
by: Jiao, Rui, et al.
Published: (2025)
LRAS: Advanced Legal Reasoning with Agentic Search
by: Zhou, Yujin, et al.
Published: (2026)
by: Zhou, Yujin, et al.
Published: (2026)
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
by: Chen, Mingyang, et al.
Published: (2025)
by: Chen, Mingyang, et al.
Published: (2025)
Geo-Expert: Towards Expert-Level Geological Reasoning via Parameter-Efficient Fine-Tuning
by: Guo, Chenyou, et al.
Published: (2026)
by: Guo, Chenyou, et al.
Published: (2026)
Ghostbuster: Detecting Text Ghostwritten by Large Language Models
by: Verma, Vivek, et al.
Published: (2023)
by: Verma, Vivek, et al.
Published: (2023)
Balancing Quality and Variation: Spam Filtering Distorts Data Label Distributions
by: Fleisig, Eve, et al.
Published: (2025)
by: Fleisig, Eve, et al.
Published: (2025)
Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers
by: Yang, Wang, et al.
Published: (2026)
by: Yang, Wang, et al.
Published: (2026)
Emergent Search and Backtracking in Latent Reasoning Models
by: Cui, Jasmine, et al.
Published: (2026)
by: Cui, Jasmine, et al.
Published: (2026)
PromptTailor: Multi-turn Intent-Aligned Prompt Synthesis for Lightweight LLMs
by: Xu, Yizhou, et al.
Published: (2025)
by: Xu, Yizhou, et al.
Published: (2025)
WildSci: Advancing Scientific Reasoning from In-the-Wild Literature
by: Liu, Tengxiao, et al.
Published: (2026)
by: Liu, Tengxiao, et al.
Published: (2026)
What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features
by: Ki, Dayeon, et al.
Published: (2026)
by: Ki, Dayeon, et al.
Published: (2026)
Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration
by: He, Bowei, et al.
Published: (2026)
by: He, Bowei, et al.
Published: (2026)
R2-Write: Reflection and Revision for Open-Ended Writing with Deep Reasoning
by: Liu, Wanlong, et al.
Published: (2026)
by: Liu, Wanlong, et al.
Published: (2026)
SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation
by: Xu, Bin, et al.
Published: (2024)
by: Xu, Bin, et al.
Published: (2024)
Learning to Reason via Program Generation, Emulation, and Search
by: Weir, Nathaniel, et al.
Published: (2024)
by: Weir, Nathaniel, et al.
Published: (2024)
Enhancing LLM Reasoning with Reward-guided Tree Search
by: Jiang, Jinhao, et al.
Published: (2024)
by: Jiang, Jinhao, et al.
Published: (2024)
Interpretable Contrastive Monte Carlo Tree Search Reasoning
by: Gao, Zitian, et al.
Published: (2024)
by: Gao, Zitian, et al.
Published: (2024)
Efficient Reasoning with Hidden Thinking
by: Shen, Xuan, et al.
Published: (2025)
by: Shen, Xuan, et al.
Published: (2025)
Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought
by: Zhang, Zhen-Yu, et al.
Published: (2024)
by: Zhang, Zhen-Yu, et al.
Published: (2024)
BFS-PO: Best-First Search for Large Reasoning Models
by: Parascandolo, Fiorenzo, et al.
Published: (2026)
by: Parascandolo, Fiorenzo, et al.
Published: (2026)
Accelerating Large Language Model Reasoning via Speculative Search
by: Wang, Zhihai, et al.
Published: (2025)
by: Wang, Zhihai, et al.
Published: (2025)
A Study on Leveraging Search and Self-Feedback for Agent Reasoning
by: K, Karthikeyan, et al.
Published: (2025)
by: K, Karthikeyan, et al.
Published: (2025)
Novelty-based Tree-of-Thought Search for LLM Reasoning and Planning
by: Hamm, Leon, et al.
Published: (2026)
by: Hamm, Leon, et al.
Published: (2026)
ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning
by: Potamitis, Nearchos, et al.
Published: (2025)
by: Potamitis, Nearchos, et al.
Published: (2025)
ETR: Entropy Trend Reward for Efficient Chain-of-Thought Reasoning
by: Xiong, Xuan, et al.
Published: (2026)
by: Xiong, Xuan, et al.
Published: (2026)
MultiZebraLogic: A Multilingual Logical Reasoning Benchmark
by: Bruun, Sofie Helene, et al.
Published: (2025)
by: Bruun, Sofie Helene, et al.
Published: (2025)
No Universal Prompt: Unifying Reasoning through Adaptive Prompting for Temporal Table Reasoning
by: Rajgaria, Abhishek, et al.
Published: (2025)
by: Rajgaria, Abhishek, et al.
Published: (2025)
Language Models Represent Beliefs of Self and Others
by: Zhu, Wentao, et al.
Published: (2024)
by: Zhu, Wentao, et al.
Published: (2024)
SIGMA: Search-Augmented On-Demand Knowledge Integration for Agentic Mathematical Reasoning
by: Asgarov, Ali, et al.
Published: (2025)
by: Asgarov, Ali, et al.
Published: (2025)
Robust Search with Uncertainty-Aware Value Models for Language Model Reasoning
by: Yu, Fei, et al.
Published: (2025)
by: Yu, Fei, et al.
Published: (2025)
ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search
by: Zhang, Yize, et al.
Published: (2025)
by: Zhang, Yize, et al.
Published: (2025)
Similar Items
-
CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval
by: Chi, Yizhou, et al.
Published: (2024) -
RLCD: Reinforcement Learning from Contrastive Distillation for Language Model Alignment
by: Yang, Kevin, et al.
Published: (2023) -
SubSearch: Intermediate Rewards for Unsupervised Guided Reasoning in Complex Retrieval
by: Petcu, Roxana, et al.
Published: (2026) -
Doing Experiments and Revising Rules with Natural Language and Probabilistic Reasoning
by: Piriyakulkij, Wasu Top, et al.
Published: (2024) -
Reinforced Context Order Recovery for Adaptive Reasoning and Planning
by: Ma, Long, et al.
Published: (2025)