Saved in:
| Main Authors: | Liu, Yuchi, Singh, Jaskirat, Liu, Gaowen, Payani, Ali, Zheng, Liang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.20252 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Effective Training Data Synthesis for Improving MLLM Chart Understanding
by: Yang, Yuwei, et al.
Published: (2025)
by: Yang, Yuwei, et al.
Published: (2025)
FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments
by: Saeidi, Amir, et al.
Published: (2026)
by: Saeidi, Amir, et al.
Published: (2026)
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$-bench
by: Mishra, Venkatesh, et al.
Published: (2025)
by: Mishra, Venkatesh, et al.
Published: (2025)
Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning
by: Mishra, Venkatesh, et al.
Published: (2025)
by: Mishra, Venkatesh, et al.
Published: (2025)
R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE Agents
by: Jain, Naman, et al.
Published: (2025)
by: Jain, Naman, et al.
Published: (2025)
Enhancing Long Chain-of-Thought Reasoning through Multi-Path Plan Aggregation
by: Xiong, Siheng, et al.
Published: (2025)
by: Xiong, Siheng, et al.
Published: (2025)
Dynamic Optimizations of LLM Ensembles with Two-Stage Reinforcement Learning Agents
by: Tekin, Selim Furkan, et al.
Published: (2025)
by: Tekin, Selim Furkan, et al.
Published: (2025)
Prompt Mining for Language-based Human Mobility Forecasting
by: Xue, Hao, et al.
Published: (2024)
by: Xue, Hao, et al.
Published: (2024)
Which Words Matter Most in Zero-Shot Prompts?
by: Sadr, Nikta Gohari, et al.
Published: (2025)
by: Sadr, Nikta Gohari, et al.
Published: (2025)
Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm
by: Huang, Baixiang, et al.
Published: (2025)
by: Huang, Baixiang, et al.
Published: (2025)
GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs
by: Song, Mingyang, et al.
Published: (2025)
by: Song, Mingyang, et al.
Published: (2025)
DiSA: Diffusion Step Annealing in Autoregressive Image Generation
by: Zhao, Qinyu, et al.
Published: (2025)
by: Zhao, Qinyu, et al.
Published: (2025)
MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems
by: Lei, Bin, et al.
Published: (2024)
by: Lei, Bin, et al.
Published: (2024)
EpiBench: Benchmarking Multi-turn Research Workflows for Multimodal Agents
by: Dong, Xuan, et al.
Published: (2026)
by: Dong, Xuan, et al.
Published: (2026)
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction
by: Qian, Junlang, et al.
Published: (2025)
by: Qian, Junlang, et al.
Published: (2025)
Diverge to Induce Prompting: Multi-Rationale Induction for Zero-Shot Reasoning
by: Chen, Po-Chun, et al.
Published: (2026)
by: Chen, Po-Chun, et al.
Published: (2026)
Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering
by: Beliaev, Mark, et al.
Published: (2025)
by: Beliaev, Mark, et al.
Published: (2025)
MDBench: A Synthetic Multi-Document Reasoning Benchmark Generated with Knowledge Guidance
by: Peper, Joseph J., et al.
Published: (2025)
by: Peper, Joseph J., et al.
Published: (2025)
Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity
by: Nguyen, Dang, et al.
Published: (2025)
by: Nguyen, Dang, et al.
Published: (2025)
Autonoma: A Hierarchical Multi-Agent Framework for End-to-End Workflow Automation
by: Reda, Eslam, et al.
Published: (2026)
by: Reda, Eslam, et al.
Published: (2026)
Better Zero-Shot Reasoning with Role-Play Prompting
by: Kong, Aobo, et al.
Published: (2023)
by: Kong, Aobo, et al.
Published: (2023)
Toward Zero-Shot Instruction Following
by: Lou, Renze, et al.
Published: (2023)
by: Lou, Renze, et al.
Published: (2023)
Toward Automated Simulation Research Workflow through LLM Prompt Engineering Design
by: Liu, Zhihan, et al.
Published: (2024)
by: Liu, Zhihan, et al.
Published: (2024)
Personalized Federated Fine-tuning for Heterogeneous Data: An Automatic Rank Learning Approach via Two-Level LoRA
by: Hao, Jie, et al.
Published: (2025)
by: Hao, Jie, et al.
Published: (2025)
PLHF: Prompt Optimization with Few-Shot Human Feedback
by: Yang, Chun-Pai, et al.
Published: (2025)
by: Yang, Chun-Pai, et al.
Published: (2025)
Rank-and-Reason: Multi-Agent Collaboration Accelerates Zero-Shot Protein Mutation Prediction
by: Tan, Yang, et al.
Published: (2026)
by: Tan, Yang, et al.
Published: (2026)
Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model
by: Xiong, Siheng, et al.
Published: (2024)
by: Xiong, Siheng, et al.
Published: (2024)
Large Language Models Can Learn Temporal Reasoning
by: Xiong, Siheng, et al.
Published: (2024)
by: Xiong, Siheng, et al.
Published: (2024)
HiCoLoRA: Addressing Context-Prompt Misalignment via Hierarchical Collaborative LoRA for Zero-Shot DST
by: Zhang, Shuyu, et al.
Published: (2025)
by: Zhang, Shuyu, et al.
Published: (2025)
WorkTeam: Constructing Workflows from Natural Language with Multi-Agents
by: Liu, Hanchao, et al.
Published: (2025)
by: Liu, Hanchao, et al.
Published: (2025)
Towards Zero-Shot Multimodal Machine Translation
by: Futeral, Matthieu, et al.
Published: (2024)
by: Futeral, Matthieu, et al.
Published: (2024)
SG-FSM: A Self-Guiding Zero-Shot Prompting Paradigm for Multi-Hop Question Answering Based on Finite State Machine
by: Wang, Xiaochen, et al.
Published: (2024)
by: Wang, Xiaochen, et al.
Published: (2024)
FUSE-ing Language Models: Zero-Shot Adapter Discovery for Prompt Optimization Across Tokenizers
by: Williams, Joshua Nathaniel, et al.
Published: (2024)
by: Williams, Joshua Nathaniel, et al.
Published: (2024)
EduAgentQG: A Multi-Agent Workflow Framework for Personalized Question Generation
by: Jia, Rui, et al.
Published: (2025)
by: Jia, Rui, et al.
Published: (2025)
Communication to Completion: Modeling Collaborative Workflows with Intelligent Multi-Agent Communication
by: Lu, Yiming, et al.
Published: (2025)
by: Lu, Yiming, et al.
Published: (2025)
Zero-Shot Cross-Domain Code Search without Fine-Tuning
by: Liang, Keyu, et al.
Published: (2025)
by: Liang, Keyu, et al.
Published: (2025)
Zero-Shot Hierarchical Classification on the Common Procurement Vocabulary Taxonomy
by: Moiraghi, Federico, et al.
Published: (2024)
by: Moiraghi, Federico, et al.
Published: (2024)
Towards Zero-Shot, Controllable Dialog Planning with LLMs
by: Väth, Dirk, et al.
Published: (2024)
by: Väth, Dirk, et al.
Published: (2024)
FSM: A Finite State Machine Based Zero-Shot Prompting Paradigm for Multi-Hop Question Answering
by: Wang, Xiaochen, et al.
Published: (2024)
by: Wang, Xiaochen, et al.
Published: (2024)
A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
Similar Items
-
Effective Training Data Synthesis for Improving MLLM Chart Understanding
by: Yang, Yuwei, et al.
Published: (2025) -
FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments
by: Saeidi, Amir, et al.
Published: (2026) -
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$-bench
by: Mishra, Venkatesh, et al.
Published: (2025) -
Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning
by: Mishra, Venkatesh, et al.
Published: (2025) -
R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE Agents
by: Jain, Naman, et al.
Published: (2025)