Saved in:
| Main Authors: | Parfenova, Angelina, Denzler, Alexander, Pfeffer, Juergen |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.00047 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Text Annotation via Inductive Coding: Comparing Human Experts to LLMs in Qualitative Data Analysis
by: Parfenova, Angelina, et al.
Published: (2025)
by: Parfenova, Angelina, et al.
Published: (2025)
From Quotes to Concepts: Axial Coding of Political Debates with Ensemble LMs
by: Parfenova, Angelina, et al.
Published: (2026)
by: Parfenova, Angelina, et al.
Published: (2026)
Automating the Information Extraction from Semi-Structured Interview Transcripts
by: Parfenova, Angelina
Published: (2024)
by: Parfenova, Angelina
Published: (2024)
Risk prediction of pathological gambling on social media
by: Parfenova, Angelina, et al.
Published: (2024)
by: Parfenova, Angelina, et al.
Published: (2024)
CONSCIENTIA: Can LLM Agents Learn to Strategize? Emergent Deception and Trust in a Multi-Agent NYC Simulation
by: Sinha, Aarush, et al.
Published: (2026)
by: Sinha, Aarush, et al.
Published: (2026)
Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents
by: Shao, Shuai, et al.
Published: (2025)
by: Shao, Shuai, et al.
Published: (2025)
When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis
by: Dietrich, Juergen
Published: (2026)
by: Dietrich, Juergen
Published: (2026)
Trust, Lies, and Long Memories: Emergent Social Dynamics and Reputation in Multi-Round Avalon with LLM Agents
by: Ellawela, Suveen
Published: (2026)
by: Ellawela, Suveen
Published: (2026)
Knots: A Large-Scale Multi-Agent Enhanced Expert-Annotated Dataset and LLM Prompt Optimization for NOTAM Semantic Parsing
by: Liu, Maoqi, et al.
Published: (2025)
by: Liu, Maoqi, et al.
Published: (2025)
The Inadequacy of Offline LLM Evaluations: A Need to Account for Personalization in Model Behavior
by: Wang, Angelina, et al.
Published: (2025)
by: Wang, Angelina, et al.
Published: (2025)
Hallucination as output-boundary misclassification: a composite abstention architecture for language models
by: Hintsanen, Angelina
Published: (2026)
by: Hintsanen, Angelina
Published: (2026)
Repurposing Annotation Guidelines to Instruct LLM Annotators: A Case Study
by: Kim, Kon Woo, et al.
Published: (2025)
by: Kim, Kon Woo, et al.
Published: (2025)
MAEBE: Multi-Agent Emergent Behavior Framework
by: Erisken, Sinem, et al.
Published: (2025)
by: Erisken, Sinem, et al.
Published: (2025)
Interpretable Emergent Language Using Inter-Agent Transformers
by: Bhardwaj, Mannan
Published: (2025)
by: Bhardwaj, Mannan
Published: (2025)
MIRIX: Multi-Agent Memory System for LLM-Based Agents
by: Wang, Yu, et al.
Published: (2025)
by: Wang, Yu, et al.
Published: (2025)
Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation
by: Choi, Juhwan, et al.
Published: (2024)
by: Choi, Juhwan, et al.
Published: (2024)
Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation
by: Lin, Minhua, et al.
Published: (2024)
by: Lin, Minhua, et al.
Published: (2024)
AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators
by: Ni, Jingwei, et al.
Published: (2024)
by: Ni, Jingwei, et al.
Published: (2024)
Insight Agents: An LLM-Based Multi-Agent System for Data Insights
by: Bai, Jincheng, et al.
Published: (2026)
by: Bai, Jincheng, et al.
Published: (2026)
An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering
by: Murphy, Alexander, et al.
Published: (2025)
by: Murphy, Alexander, et al.
Published: (2025)
Completing Missing Annotation: Multi-Agent Debate for Accurate and Scalable Relevant Assessment for IR Benchmarks
by: Ban, Minjeong, et al.
Published: (2026)
by: Ban, Minjeong, et al.
Published: (2026)
Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models
by: Dietrich, Juergen
Published: (2026)
by: Dietrich, Juergen
Published: (2026)
Asymmetric Actor-Critic for Multi-turn LLM Agents
by: Jiang, Shuli, et al.
Published: (2026)
by: Jiang, Shuli, et al.
Published: (2026)
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
by: Zhang, Zhiwei, et al.
Published: (2025)
by: Zhang, Zhiwei, et al.
Published: (2025)
Evaluating the Impact of LLM-Assisted Annotation in a Perspectivized Setting: the Case of FrameNet Annotation
by: Belcavello, Frederico, et al.
Published: (2025)
by: Belcavello, Frederico, et al.
Published: (2025)
Enhancing LLM-Based Data Annotation with Error Decomposition
by: Xu, Zhen, et al.
Published: (2026)
by: Xu, Zhen, et al.
Published: (2026)
Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration
by: Wang, Zhenhailong, et al.
Published: (2023)
by: Wang, Zhenhailong, et al.
Published: (2023)
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction
by: Chun, Ye Eun, et al.
Published: (2025)
by: Chun, Ye Eun, et al.
Published: (2025)
Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents
by: Yehudai, Asaf, et al.
Published: (2026)
by: Yehudai, Asaf, et al.
Published: (2026)
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key?
by: Wang, Qineng, et al.
Published: (2024)
by: Wang, Qineng, et al.
Published: (2024)
Codebook-Injected Dialogue Segmentation for Multi-Utterance Constructs Annotation: LLM-Assisted and Gold-Label-Free Evaluation
by: Lee, Jinsook, et al.
Published: (2026)
by: Lee, Jinsook, et al.
Published: (2026)
AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration
by: Wang, Zhexuan, et al.
Published: (2025)
by: Wang, Zhexuan, et al.
Published: (2025)
LinguistAgent: A Reflective Multi-Model Platform for Automated Linguistic Annotation
by: Li, Bingru
Published: (2026)
by: Li, Bingru
Published: (2026)
Users as Annotators: LLM Preference Learning from Comparison Mode
by: Cai, Zhongze, et al.
Published: (2025)
by: Cai, Zhongze, et al.
Published: (2025)
SelectLLM: Can LLMs Select Important Instructions to Annotate?
by: Parkar, Ritik Sachin, et al.
Published: (2024)
by: Parkar, Ritik Sachin, et al.
Published: (2024)
Is Your LLM Really Mastering the Concept? A Multi-Agent Benchmark
by: Xu, Shuhang, et al.
Published: (2025)
by: Xu, Shuhang, et al.
Published: (2025)
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
by: Hu, Yuanzhe, et al.
Published: (2025)
by: Hu, Yuanzhe, et al.
Published: (2025)
Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey
by: Guan, Shengyue, et al.
Published: (2025)
by: Guan, Shengyue, et al.
Published: (2025)
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent
by: Shen, Weizhou, et al.
Published: (2024)
by: Shen, Weizhou, et al.
Published: (2024)
Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System
by: Chen, Weize, et al.
Published: (2024)
by: Chen, Weize, et al.
Published: (2024)
Similar Items
-
Text Annotation via Inductive Coding: Comparing Human Experts to LLMs in Qualitative Data Analysis
by: Parfenova, Angelina, et al.
Published: (2025) -
From Quotes to Concepts: Axial Coding of Political Debates with Ensemble LMs
by: Parfenova, Angelina, et al.
Published: (2026) -
Automating the Information Extraction from Semi-Structured Interview Transcripts
by: Parfenova, Angelina
Published: (2024) -
Risk prediction of pathological gambling on social media
by: Parfenova, Angelina, et al.
Published: (2024) -
CONSCIENTIA: Can LLM Agents Learn to Strategize? Emergent Deception and Trust in a Multi-Agent NYC Simulation
by: Sinha, Aarush, et al.
Published: (2026)