Enregistré dans:
| Auteurs principaux: | Shakouri, David Ph., Cremers, Crit, Schiller, Niels O. |
|---|---|
| Format: | Preprint |
| Publié: |
2025
|
| Sujets: | |
| Accès en ligne: | https://arxiv.org/abs/2503.18702 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
A Knowledge-Based Language Model: Deducing Grammatical Knowledge in a Multi-Agent Language Acquisition Simulation
par: Shakouri, David Ph., et autres
Publié: (2025)
par: Shakouri, David Ph., et autres
Publié: (2025)
DecisionBench: A Benchmark for Emergent Delegation in Long-Horizon Agentic Workflows
par: Gao, Yuxuan, et autres
Publié: (2026)
par: Gao, Yuxuan, et autres
Publié: (2026)
SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use
par: Patel, Hitesh Laxmichand, et autres
Publié: (2025)
par: Patel, Hitesh Laxmichand, et autres
Publié: (2025)
Benefits and Limitations of Communication in Multi-Agent Reasoning
par: Rizvi-Martel, Michael, et autres
Publié: (2025)
par: Rizvi-Martel, Michael, et autres
Publié: (2025)
The Power of Stories: Narrative Priming Shapes How LLM Agents Collaborate and Compete
par: Großmann, Gerrit, et autres
Publié: (2025)
par: Großmann, Gerrit, et autres
Publié: (2025)
Towards More Human-like AI Communication: A Review of Emergent Communication Research
par: Brandizzi, Nicolo'
Publié: (2023)
par: Brandizzi, Nicolo'
Publié: (2023)
Adversarially Probing Cross-Family Sound Symbolism in 27 Languages
par: Sharma, Anika, et autres
Publié: (2025)
par: Sharma, Anika, et autres
Publié: (2025)
RPRA: Predicting an LLM-Judge for Efficient but Performant Inference
par: Ashley, Dylan R., et autres
Publié: (2026)
par: Ashley, Dylan R., et autres
Publié: (2026)
Subword-Based Comparative Linguistics across 242 Languages Using Wikipedia Glottosets
par: Chelombitko, Iaroslav, et autres
Publié: (2026)
par: Chelombitko, Iaroslav, et autres
Publié: (2026)
When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges
par: Darshan, Parth, et autres
Publié: (2026)
par: Darshan, Parth, et autres
Publié: (2026)
Pioneer Agent: Continual Improvement of Small Language Models in Production
par: Atreja, Dhruv, et autres
Publié: (2026)
par: Atreja, Dhruv, et autres
Publié: (2026)
Solving Zebra Puzzles Using Constraint-Guided Multi-Agent Systems
par: Berman, Shmuel, et autres
Publié: (2024)
par: Berman, Shmuel, et autres
Publié: (2024)
Shallow Robustness, Deep Vulnerabilities: Multi-Turn Evaluation of Medical LLMs
par: Manczak, Blazej, et autres
Publié: (2025)
par: Manczak, Blazej, et autres
Publié: (2025)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
par: Fadli, Samih
Publié: (2025)
par: Fadli, Samih
Publié: (2025)
Conversational No-code, Multi-agentic Disease Module Identification and Drug Repurposing Prediction with ChatDRex
par: Süwer, Simon, et autres
Publié: (2025)
par: Süwer, Simon, et autres
Publié: (2025)
Grammatically-Guided Sparse Attention for Efficient and Interpretable Transformers
par: Pratyush, Spandan
Publié: (2026)
par: Pratyush, Spandan
Publié: (2026)
The Qualitative Laboratory: Theory Prototyping and Hypothesis Generation with Large Language Models
par: Draelants, Hugues
Publié: (2025)
par: Draelants, Hugues
Publié: (2025)
Mixed-Initiative Dialog for Human-Robot Collaborative Manipulation
par: Yu, Albert, et autres
Publié: (2025)
par: Yu, Albert, et autres
Publié: (2025)
ELMTEX: Fine-Tuning Large Language Models for Structured Clinical Information Extraction. A Case Study on Clinical Reports
par: Guluzade, Aynur, et autres
Publié: (2025)
par: Guluzade, Aynur, et autres
Publié: (2025)
Evaluating Large Language Models for IUCN Red List Species Information
par: Uryu, Shinya
Publié: (2025)
par: Uryu, Shinya
Publié: (2025)
Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries
par: Cacioli, Jon-Paul
Publié: (2026)
par: Cacioli, Jon-Paul
Publié: (2026)
When is dataset cartography ineffective? Using training dynamics does not improve robustness against Adversarial SQuAD
par: Mandal, Paul K.
Publié: (2025)
par: Mandal, Paul K.
Publié: (2025)
PAVE: A Cognitive Architecture for Legitimate Violation in Generative Agent Societies
par: Yehia, Ahmad, et autres
Publié: (2026)
par: Yehia, Ahmad, et autres
Publié: (2026)
Self-Emotion Blended Dialogue Generation in Social Simulation Agents
par: Zhang, Qiang, et autres
Publié: (2024)
par: Zhang, Qiang, et autres
Publié: (2024)
Exploring Model Invariance with Discrete Search for Ultra-Low-Bit Quantization
par: Wen, Yuqiao, et autres
Publié: (2025)
par: Wen, Yuqiao, et autres
Publié: (2025)
"Guinea Pig Trials" Utilizing GPT: A Novel Smart Agent-Based Modeling Approach for Studying Firm Competition and Collusion
par: Han, Xu, et autres
Publié: (2023)
par: Han, Xu, et autres
Publié: (2023)
Ask WhAI:Probing Belief Formation in Role-Primed LLM Agents
par: Moore, Keith, et autres
Publié: (2025)
par: Moore, Keith, et autres
Publié: (2025)
LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance
par: Shi, Jack Wei Lun, et autres
Publié: (2026)
par: Shi, Jack Wei Lun, et autres
Publié: (2026)
PRISMA: Preference-Reinforced Self-Training Approach for Interpretable Emotionally Intelligent Negotiation Dialogues
par: Kajare, Prajwal Vijay, et autres
Publié: (2026)
par: Kajare, Prajwal Vijay, et autres
Publié: (2026)
propella-1: Multi-Property Document Annotation for LLM Data Curation at Scale
par: Idahl, Maximilian, et autres
Publié: (2026)
par: Idahl, Maximilian, et autres
Publié: (2026)
Cost-Aware Model Selection for Text Classification: Multi-Objective Trade-offs Between Fine-Tuned Encoders and LLM Prompting in Production
par: Gonzalez, Alberto Andres Valdes
Publié: (2026)
par: Gonzalez, Alberto Andres Valdes
Publié: (2026)
Computational Economics in Large Language Models: Exploring Model Behavior and Incentive Design under Resource Constraints
par: Reddy, Sandeep, et autres
Publié: (2025)
par: Reddy, Sandeep, et autres
Publié: (2025)
Analysis of LLM as a grammatical feature tagger for African American English
par: Porwal, Rahul, et autres
Publié: (2025)
par: Porwal, Rahul, et autres
Publié: (2025)
The Rise of Verbal Tics in Large Language Models: A Systematic Analysis Across Frontier Models
par: Wu, Shuai, et autres
Publié: (2026)
par: Wu, Shuai, et autres
Publié: (2026)
Agentic Automation of BT-RADS Scoring: End-to-End Multi-Agent System for Standardized Brain Tumor Follow-up Assessment
par: Jabal, Mohamed Sobhi, et autres
Publié: (2026)
par: Jabal, Mohamed Sobhi, et autres
Publié: (2026)
Chain of Unit-Physics: A Primitive-Centric Approach to Scientific Code Synthesis
par: Sharma, Vansh, et autres
Publié: (2025)
par: Sharma, Vansh, et autres
Publié: (2025)
BioAlchemy: Distilling Biological Literature into Reasoning-Ready Reinforcement Learning Training Data
par: Hsu, Brian, et autres
Publié: (2026)
par: Hsu, Brian, et autres
Publié: (2026)
Model selection meets clinical semantics: Optimizing ICD-10-CM prediction via LLM-as-Judge evaluation, redundancy-aware sampling, and section-aware fine-tuning
par: Dai, Hong-Jie, et autres
Publié: (2025)
par: Dai, Hong-Jie, et autres
Publié: (2025)
Step-Audio-R1 Technical Report
par: Tian, Fei, et autres
Publié: (2025)
par: Tian, Fei, et autres
Publié: (2025)
ADALog: Adaptive Unsupervised Anomaly detection in Logs with Self-attention Masked Language Model
par: Pospieszny, Przemek, et autres
Publié: (2025)
par: Pospieszny, Przemek, et autres
Publié: (2025)
Documents similaires
-
A Knowledge-Based Language Model: Deducing Grammatical Knowledge in a Multi-Agent Language Acquisition Simulation
par: Shakouri, David Ph., et autres
Publié: (2025) -
DecisionBench: A Benchmark for Emergent Delegation in Long-Horizon Agentic Workflows
par: Gao, Yuxuan, et autres
Publié: (2026) -
SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use
par: Patel, Hitesh Laxmichand, et autres
Publié: (2025) -
Benefits and Limitations of Communication in Multi-Agent Reasoning
par: Rizvi-Martel, Michael, et autres
Publié: (2025) -
The Power of Stories: Narrative Priming Shapes How LLM Agents Collaborate and Compete
par: Großmann, Gerrit, et autres
Publié: (2025)