:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Kuan, Lu, Yadong, Santacroce, Michael, Gong, Yeyun, Zhang, Chao, Shen, Yelong
Format:	Preprint
Published:	2023
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2310.01444
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

OmniParser for Pure Vision Based GUI Agent
by: Lu, Yadong, et al.
Published: (2024)

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
by: Gou, Zhibin, et al.
Published: (2023)

Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
by: Sun, Jiashuo, et al.
Published: (2023)

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
by: Gou, Zhibin, et al.
Published: (2023)

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
by: Huang, Yiming, et al.
Published: (2024)

Competition-Level Problems are Effective LLM Evaluators
by: Huang, Yiming, et al.
Published: (2023)

Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
by: Luo, Yi, et al.
Published: (2024)

Rho-1: Not All Tokens Are What You Need
by: Lin, Zhenghao, et al.
Published: (2024)

Exploring and Controlling Diversity in LLM-Agent Conversation
by: Chu, KuanChao, et al.
Published: (2024)

Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling
by: Shin, Haebin, et al.
Published: (2025)

Training Agents with Weakly Supervised Feedback from Large Language Models
by: Gong, Dihong, et al.
Published: (2024)

Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
by: Zhang, Xiaoying, et al.
Published: (2025)

Multi-LoRA Composition for Image Generation
by: Zhong, Ming, et al.
Published: (2024)

StreamAdapter: Efficient Test Time Adaptation from Contextual Streams
by: Muhtar, Dilxat, et al.
Published: (2024)

Enhancing Large Language Model Performance with Gradient-Based Parameter Selection
by: Li, Haoling, et al.
Published: (2024)

LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy
by: Zhang, Rongzhi, et al.
Published: (2024)

Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development
by: Shen, Ming, et al.
Published: (2025)

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
by: Zhang, Zhen, et al.
Published: (2025)

Process-based Self-Rewarding Language Models
by: Zhang, Shimao, et al.
Published: (2025)

Generative Prompt Internalization
by: Shin, Haebin, et al.
Published: (2024)

SynthAgent: Adapting Web Agents with Synthetic Supervision
by: Wang, Zhaoyang, et al.
Published: (2025)

How Does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective
by: Zhang, Shimao, et al.
Published: (2025)

Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training
by: Yang, Kailai, et al.
Published: (2025)

Multi-Agent Comedy Club: Investigating Community Discussion Effects on LLM Humor Generation
by: Hong, Shiwei, et al.
Published: (2026)

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
by: Rosset, Corby, et al.
Published: (2024)

AutoAgent: A Fully-Automated and Zero-Code Framework for LLM Agents
by: Tang, Jiabin, et al.
Published: (2025)

WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning
by: Zhuang, Yuchen, et al.
Published: (2025)

DynamixSFT: Dynamic Mixture Optimization of Instruction Tuning Collections
by: Shin, Haebin, et al.
Published: (2025)

GroundAct: Can LLM Agents Ground Actions in Environmental States?
by: Wang, Zixuan, et al.
Published: (2025)

SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories
by: Yu, Zhuoyun, et al.
Published: (2026)

Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection
by: He, Hongyi, et al.
Published: (2025)

Exploring the Necessity of Reasoning in LLM-based Agent Scenarios
by: Zhou, Xueyang, et al.
Published: (2025)

Optimization Techniques for Sentiment Analysis Based on LLM (GPT-3)
by: Zhan, Tong, et al.
Published: (2024)

One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment
by: Cai, Hongru, et al.
Published: (2026)

QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback-based Self-Correction
by: Huang, Xiang, et al.
Published: (2024)

Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry
by: Peng, Run, et al.
Published: (2025)

AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations
by: Verma, Gaurav, et al.
Published: (2024)

Confirming Correct, Missing the Rest: LLM Tutoring Agents Struggle Where Feedback Matters Most
by: Yasir, Tahreem, et al.
Published: (2026)

PlotGen: Multi-Agent LLM-based Scientific Data Visualization via Multimodal Feedback
by: Goswami, Kanika, et al.
Published: (2025)

Igniting Creative Writing in Small Language Models: LLM-as-a-Judge versus Multi-Agent Refined Rewards
by: Wei, Xiaolong, et al.
Published: (2025)