:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Radha, Santosh Kumar, Goktas, Oktay
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computation and Language Multiagent Systems
Online Access:	https://arxiv.org/abs/2410.08037
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning
by: Radha, Santosh Kumar, et al.
Published: (2024)

On the Reasoning Capacity of AI Models and How to Quantify It
by: Radha, Santosh Kumar, et al.
Published: (2025)

ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
by: Wan, Ziyu, et al.
Published: (2025)

DR. WELL: Dynamic Reasoning and Learning with Symbolic World Model for Embodied LLM-Based Multi-Agent Collaboration
by: Nourzad, Narjes, et al.
Published: (2025)

Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures
by: Pandey, Tushar, et al.
Published: (2025)

Exploring Natural Language-Based Strategies for Efficient Number Learning in Children through Reinforcement Learning
by: Mittra, Tirthankar
Published: (2024)

FORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast
by: Bogdanov, Igor, et al.
Published: (2026)

MAC: Multi-Agent Constitution Learning
by: Thareja, Rushil, et al.
Published: (2026)

PLANET: A Collection of Benchmarks for Evaluating LLMs' Planning Capabilities
by: Li, Haoming, et al.
Published: (2025)

Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration
by: He, Zhixuan, et al.
Published: (2025)

Can We Predict Before Executing Machine Learning Agents?
by: Zheng, Jingsheng, et al.
Published: (2026)

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
by: Sarkar, Bidipta, et al.
Published: (2025)

Mnemosyne: An Unsupervised, Human-Inspired Long-Term Memory Architecture for Edge-Based LLMs
by: Jonelagadda, Aneesh, et al.
Published: (2025)

Multi-Objective Reinforcement Learning for Large Language Model Optimization: Visionary Perspective
by: Kong, Lingxiao, et al.
Published: (2025)

MLZero: A Multi-Agent System for End-to-end Machine Learning Automation
by: Fang, Haoyang, et al.
Published: (2025)

Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks
by: Lau, Gregory Kang Ruey, et al.
Published: (2024)

LENS: Learning Ensemble Confidence from Neural States for Multi-LLM Answer Integration
by: Guo, Jizhou
Published: (2025)

Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings
by: Taylor, Russell, et al.
Published: (2025)

RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
by: Shen, Chengzhi, et al.
Published: (2026)

Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects
by: Mushtaq, Abdullah, et al.
Published: (2025)

Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset
by: Gabriel, Adrian Garret, et al.
Published: (2024)

Recursive Agent Optimization
by: Gandhi, Apurva, et al.
Published: (2026)

PLAGUE: Plug-and-play framework for Lifelong Adaptive Generation of Multi-turn Exploits
by: Bhuiya, Neeladri, et al.
Published: (2025)

Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning
by: Mo, Shentong
Published: (2026)

LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
by: Sun, Chuanneng, et al.
Published: (2024)

EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce
by: Yu, Minhyeong, et al.
Published: (2026)

Context, Reasoning, and Hierarchy: A Cost-Performance Study of Compound LLM Agent Design in an Adversarial POMDP
by: Bogdanov, Igor, et al.
Published: (2026)

Toward Inclusive Educational AI: Auditing Frontier LLMs through a Multiplexity Lens
by: Mushtaq, Abdullah, et al.
Published: (2025)

StructMem: Structured Memory for Long-Horizon Behavior in LLMs
by: Xu, Buqiang, et al.
Published: (2026)

GenDB: The Next Generation of Query Processing -- Synthesized, Not Engineered
by: Lao, Jiale, et al.
Published: (2026)

Towards Emotionally Intelligent and Responsible Reinforcement Learning
by: Keerthana, Garapati, et al.
Published: (2025)

X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents
by: Rahman, Salman, et al.
Published: (2025)

Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
by: Zhu, Yuqi, et al.
Published: (2025)

AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML
by: Trirat, Patara, et al.
Published: (2024)

Language Agents as Optimizable Graphs
by: Zhuge, Mingchen, et al.
Published: (2024)

DroidSpeak: KV Cache Sharing for Cross-LLM Communication and Multi-LLM Serving
by: Liu, Yuhan, et al.
Published: (2024)

Iterative Graph Alignment
by: Yu, Fangyuan, et al.
Published: (2024)

MARCO: Multi-Agent Real-time Chat Orchestration
by: Shrimal, Anubhav, et al.
Published: (2024)

TrustAgent: Towards Safe and Trustworthy LLM-based Agents
by: Hua, Wenyue, et al.
Published: (2024)

In-Context Environments Induce Evaluation-Awareness in Language Models
by: Chaudhary, Maheep
Published: (2026)