Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chernyshev, Konstantin, Artemova, Ekaterina, Zhukov, Viacheslav, Nerush, Maksim, Fedorova, Mariia, Repik, Iryna, Shapovalova, Olga, Sukhorosov, Aleksey, Dobrovolskii, Vladimir, Mikhailova, Natalia, Tilga, Sergei
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.01119
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

Tendem is a hybrid system where AI handles structured, repeatable work and Human Experts step in when the models fail or to verify results. Each result undergoes a comprehensive quality review before delivery to the Client. To assess Tendem's performance, we conducted a series of in-house evaluations on 94 real-world tasks, comparing it with AI-only agents and human-only workflows carried out by Upwork freelancers. The results show that Tendem consistently delivers higher-quality outputs with faster turnaround times. At the same time, its operational costs remain comparable to human-only execution. On third-party agentic benchmarks, Tendem's AI Agent (operating autonomously, without human involvement) performs near state-of-the-art on web browsing and tool-use tasks while demonstrating strong results in frontier domain knowledge and reasoning.

Similar Items