:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shlomi, Eliezer, Levy, Ido, Shapira, Eilam, Katz, Michael, Uziel, Guy, Shlomov, Segev, Mashkif, Nir, Reichart, Roi, Keren, Sarah
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.04557
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TabAgent: A Framework for Replacing Agentic Generative Components with Tabular-Textual Classifiers
by: Levy, Ido, et al.
Published: (2026)

TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields
by: Arazi, Alan, et al.
Published: (2025)

The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents
by: Shapira, Eilam, et al.
Published: (2026)

Predicting Decisions of AI Agents from Limited Interaction through Text-Tabular Modeling
by: Shapira, Eilam, et al.
Published: (2026)

Can LLMs Replace Economic Choice Prediction Labs? The Case of Language-based Persuasion Games
by: Shapira, Eilam, et al.
Published: (2024)

Donors and Recipients: On Asymmetric Transfer Across Tasks and Languages with Parameter-Efficient Fine-Tuning
by: Dymkiewicz, Kajetan, et al.
Published: (2025)

From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production
by: Shlomov, Segev, et al.
Published: (2025)

Towards Enterprise-Ready Computer Using Generalist Agent
by: Marreed, Sami, et al.
Published: (2025)

GLEE: A Unified Framework and Benchmark for Language-based Economic Environments
by: Shapira, Eilam, et al.
Published: (2024)

SNAP: Semantic Stories for Next Activity Prediction
by: Oved, Alon, et al.
Published: (2024)

Human Choice Prediction in Language-based Persuasion Games: Simulation-based Off-Policy Evaluation
by: Shapira, Eilam, et al.
Published: (2023)

What's the Plan? Evaluating and Developing Planning-Aware Techniques for Language Models
by: Hirsch, Eran, et al.
Published: (2024)

Governance by Construction for Generalist Agents
by: Shlomov, Segev, et al.
Published: (2026)

Multi-Review Fusion-in-Context
by: Slobodkin, Aviv, et al.
Published: (2024)

MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image
by: Arazi, Alan, et al.
Published: (2026)

From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
by: Shlomov, Segev, et al.
Published: (2024)

Text2Model: Text-based Model Induction for Zero-shot Image Classification
by: Amosy, Ohad, et al.
Published: (2022)

On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
by: Calderon, Nitay, et al.
Published: (2024)

A Systematic Review of NLP for Dementia -- Tasks, Datasets and Opportunities
by: Peled-Cohen, Lotem, et al.
Published: (2024)

An Information-Theoretic Approach to Identifying Formulaic Clusters in Textual Data
by: Yoffe, Gideon, et al.
Published: (2025)

The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs
by: Calderon, Nitay, et al.
Published: (2025)

Multi-Domain Explainability of Preferences
by: Calderon, Nitay, et al.
Published: (2025)

CRISP: Complex Reasoning with Interpretable Step-based Plans
by: Vetzler, Matan, et al.
Published: (2025)

A Unifying Scheme for Extractive Content Selection Tasks
by: Amar, Shmuel, et al.
Published: (2025)

Motivation in Large Language Models
by: Nahum, Omer, et al.
Published: (2026)

The Power of Summary-Source Alignments
by: Ernst, Ori, et al.
Published: (2024)

DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models
by: Ventura, Mor, et al.
Published: (2025)

ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
by: Levy, Ido, et al.
Published: (2024)

AgentFixer: From Failure Detection to Fix Recommendations in LLM Agentic Systems
by: Mulian, Hadar, et al.
Published: (2026)

LIBERTy: A Causal Framework for Benchmarking Concept-Based Explanations of LLMs with Structural Counterfactuals
by: Toker, Gilat, et al.
Published: (2026)

Systematic Biases in LLM Simulations of Debates
by: Taubenfeld, Amir, et al.
Published: (2024)

Consensus or Conflict? Fine-Grained Evaluation of Conflicting Answers in Question-Answering
by: Nachshoni, Eviatar, et al.
Published: (2025)

Multilinguality at the Edge: Developing Language Models for the Global South
by: Miranda, Lester James V., et al.
Published: (2026)

Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance
by: Nahum, Omer, et al.
Published: (2024)

EncodeRec: An Embedding Backbone for Recommendation Systems
by: Hadad, Guy, et al.
Published: (2026)

Can LLMs Learn Macroeconomic Narratives from Social Media?
by: Gueta, Almog, et al.
Published: (2024)

AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation
by: Nakash, Itay, et al.
Published: (2025)

Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs
by: Peisakhovsky, Yehonatan, et al.
Published: (2025)

Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models
by: Ventura, Mor, et al.
Published: (2023)

NL-Eye: Abductive NLI for Images
by: Ventura, Mor, et al.
Published: (2024)