Агуулга: :: Library Catalog

$Нүүр зураг$

-д хадгалсан:

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид:	Mr. Sankhadeep Debdas, Pawan Kumar
Формат:	Recurso digital
Хэл сонгох:
Хэвлэсэн:	Zenodo 2026
Онлайн хандалт:	https://doi.org/10.5281/zenodo.20123803
Шошгууд:	Шошго нэмэх Шошго байхгүй, Энэхүү баримтыг шошголох эхний хүн болох!

Агуулга:

Abstract The evaluation of Tree-of-Thought (ToT) reasoning and self-consistency is important when deploying lightweight language models (LLMs) that can produce capable results while operating on constrained hardware. Self-consistency decoders can enhance the robustness of chain-of-thought style inference for complex reasoning tasks through the use of selfconsistency, by allowing multiple pathways of diverse reasoning to be aggregated through sample-and-vote based methods. By enabling look-ahead, backtracking, and intentional processing of alternate solution paths, the concept of ToT generalizes these ideas to create a search tree that represents all the "thoughts" produced at the intermediate level of processing. This version provides a thorough description of both methods of deriving results from using lightweight language models through self-consistency (SC) and trees of thoughts (ToT) techniques according to an established benchmark suite consisting of three different types of arithmetic problems: word, logic, and combinatorial, and implementing various instances of SC and ToT methodologies (with varying numbers of parameters) on small models that can be checked for performance (accuracy, sample efficiency and latency) against alternative computational environments and the respective experiments were constructed using a combination of algorithmic and statistically-based methods to achieve “faithfulness” to machines' reasoning by employing an agreement between the methods' individual sample trajectories and ground-truth solutions. The result whereby self-tested consistency alone (trial and error) was greater in performance as compared to ToT were due only to circumstances with high “overhead” costs associated to searching tree-based data with additional overheads for ToT data structures permitting “non-trivial” savings achieved through incorrect reasoning, i.e., by escaping “local consistency” through incorrect use of its rules. Finally, we propose a series of guidelines to assist in the selection and tuning of SC and ToT for use in lightweight LLMs for applications requiring mathematical or logical reasoning, and a framework will be created to serve as an efficiency-based adaptive-toa-of-self-consistency model for future research endeavors. Keywords: Deep Learning, Large Language Model, Tree of Thought Reasoning, Neural Networks, Artificial Intelligence.

Ижил төстэй зүйлс