Saved in:
| Main Authors: | Radha, Santosh Kumar, Goktas, Oktay |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.08037 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning
by: Radha, Santosh Kumar, et al.
Published: (2024)
by: Radha, Santosh Kumar, et al.
Published: (2024)
On the Reasoning Capacity of AI Models and How to Quantify It
by: Radha, Santosh Kumar, et al.
Published: (2025)
by: Radha, Santosh Kumar, et al.
Published: (2025)
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
by: Wan, Ziyu, et al.
Published: (2025)
by: Wan, Ziyu, et al.
Published: (2025)
DR. WELL: Dynamic Reasoning and Learning with Symbolic World Model for Embodied LLM-Based Multi-Agent Collaboration
by: Nourzad, Narjes, et al.
Published: (2025)
by: Nourzad, Narjes, et al.
Published: (2025)
Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures
by: Pandey, Tushar, et al.
Published: (2025)
by: Pandey, Tushar, et al.
Published: (2025)
Exploring Natural Language-Based Strategies for Efficient Number Learning in Children through Reinforcement Learning
by: Mittra, Tirthankar
Published: (2024)
by: Mittra, Tirthankar
Published: (2024)
FORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast
by: Bogdanov, Igor, et al.
Published: (2026)
by: Bogdanov, Igor, et al.
Published: (2026)
MAC: Multi-Agent Constitution Learning
by: Thareja, Rushil, et al.
Published: (2026)
by: Thareja, Rushil, et al.
Published: (2026)
PLANET: A Collection of Benchmarks for Evaluating LLMs' Planning Capabilities
by: Li, Haoming, et al.
Published: (2025)
by: Li, Haoming, et al.
Published: (2025)
Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration
by: He, Zhixuan, et al.
Published: (2025)
by: He, Zhixuan, et al.
Published: (2025)
Can We Predict Before Executing Machine Learning Agents?
by: Zheng, Jingsheng, et al.
Published: (2026)
by: Zheng, Jingsheng, et al.
Published: (2026)
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
by: Sarkar, Bidipta, et al.
Published: (2025)
by: Sarkar, Bidipta, et al.
Published: (2025)
Mnemosyne: An Unsupervised, Human-Inspired Long-Term Memory Architecture for Edge-Based LLMs
by: Jonelagadda, Aneesh, et al.
Published: (2025)
by: Jonelagadda, Aneesh, et al.
Published: (2025)
Multi-Objective Reinforcement Learning for Large Language Model Optimization: Visionary Perspective
by: Kong, Lingxiao, et al.
Published: (2025)
by: Kong, Lingxiao, et al.
Published: (2025)
MLZero: A Multi-Agent System for End-to-end Machine Learning Automation
by: Fang, Haoyang, et al.
Published: (2025)
by: Fang, Haoyang, et al.
Published: (2025)
Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks
by: Lau, Gregory Kang Ruey, et al.
Published: (2024)
by: Lau, Gregory Kang Ruey, et al.
Published: (2024)
LENS: Learning Ensemble Confidence from Neural States for Multi-LLM Answer Integration
by: Guo, Jizhou
Published: (2025)
by: Guo, Jizhou
Published: (2025)
Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings
by: Taylor, Russell, et al.
Published: (2025)
by: Taylor, Russell, et al.
Published: (2025)
RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
by: Shen, Chengzhi, et al.
Published: (2026)
by: Shen, Chengzhi, et al.
Published: (2026)
Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects
by: Mushtaq, Abdullah, et al.
Published: (2025)
by: Mushtaq, Abdullah, et al.
Published: (2025)
Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset
by: Gabriel, Adrian Garret, et al.
Published: (2024)
by: Gabriel, Adrian Garret, et al.
Published: (2024)
Recursive Agent Optimization
by: Gandhi, Apurva, et al.
Published: (2026)
by: Gandhi, Apurva, et al.
Published: (2026)
PLAGUE: Plug-and-play framework for Lifelong Adaptive Generation of Multi-turn Exploits
by: Bhuiya, Neeladri, et al.
Published: (2025)
by: Bhuiya, Neeladri, et al.
Published: (2025)
Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning
by: Mo, Shentong
Published: (2026)
by: Mo, Shentong
Published: (2026)
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
by: Sun, Chuanneng, et al.
Published: (2024)
by: Sun, Chuanneng, et al.
Published: (2024)
EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce
by: Yu, Minhyeong, et al.
Published: (2026)
by: Yu, Minhyeong, et al.
Published: (2026)
Context, Reasoning, and Hierarchy: A Cost-Performance Study of Compound LLM Agent Design in an Adversarial POMDP
by: Bogdanov, Igor, et al.
Published: (2026)
by: Bogdanov, Igor, et al.
Published: (2026)
Toward Inclusive Educational AI: Auditing Frontier LLMs through a Multiplexity Lens
by: Mushtaq, Abdullah, et al.
Published: (2025)
by: Mushtaq, Abdullah, et al.
Published: (2025)
StructMem: Structured Memory for Long-Horizon Behavior in LLMs
by: Xu, Buqiang, et al.
Published: (2026)
by: Xu, Buqiang, et al.
Published: (2026)
GenDB: The Next Generation of Query Processing -- Synthesized, Not Engineered
by: Lao, Jiale, et al.
Published: (2026)
by: Lao, Jiale, et al.
Published: (2026)
Towards Emotionally Intelligent and Responsible Reinforcement Learning
by: Keerthana, Garapati, et al.
Published: (2025)
by: Keerthana, Garapati, et al.
Published: (2025)
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents
by: Rahman, Salman, et al.
Published: (2025)
by: Rahman, Salman, et al.
Published: (2025)
Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
by: Zhu, Yuqi, et al.
Published: (2025)
by: Zhu, Yuqi, et al.
Published: (2025)
AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML
by: Trirat, Patara, et al.
Published: (2024)
by: Trirat, Patara, et al.
Published: (2024)
Language Agents as Optimizable Graphs
by: Zhuge, Mingchen, et al.
Published: (2024)
by: Zhuge, Mingchen, et al.
Published: (2024)
DroidSpeak: KV Cache Sharing for Cross-LLM Communication and Multi-LLM Serving
by: Liu, Yuhan, et al.
Published: (2024)
by: Liu, Yuhan, et al.
Published: (2024)
Iterative Graph Alignment
by: Yu, Fangyuan, et al.
Published: (2024)
by: Yu, Fangyuan, et al.
Published: (2024)
MARCO: Multi-Agent Real-time Chat Orchestration
by: Shrimal, Anubhav, et al.
Published: (2024)
by: Shrimal, Anubhav, et al.
Published: (2024)
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
In-Context Environments Induce Evaluation-Awareness in Language Models
by: Chaudhary, Maheep
Published: (2026)
by: Chaudhary, Maheep
Published: (2026)
Similar Items
-
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning
by: Radha, Santosh Kumar, et al.
Published: (2024) -
On the Reasoning Capacity of AI Models and How to Quantify It
by: Radha, Santosh Kumar, et al.
Published: (2025) -
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
by: Wan, Ziyu, et al.
Published: (2025) -
DR. WELL: Dynamic Reasoning and Learning with Symbolic World Model for Embodied LLM-Based Multi-Agent Collaboration
by: Nourzad, Narjes, et al.
Published: (2025) -
Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures
by: Pandey, Tushar, et al.
Published: (2025)