Saved in:
| Main Authors: | Shlomi, Eliezer, Levy, Ido, Shapira, Eilam, Katz, Michael, Uziel, Guy, Shlomov, Segev, Mashkif, Nir, Reichart, Roi, Keren, Sarah |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04557 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TabAgent: A Framework for Replacing Agentic Generative Components with Tabular-Textual Classifiers
by: Levy, Ido, et al.
Published: (2026)
by: Levy, Ido, et al.
Published: (2026)
TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields
by: Arazi, Alan, et al.
Published: (2025)
by: Arazi, Alan, et al.
Published: (2025)
The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents
by: Shapira, Eilam, et al.
Published: (2026)
by: Shapira, Eilam, et al.
Published: (2026)
Predicting Decisions of AI Agents from Limited Interaction through Text-Tabular Modeling
by: Shapira, Eilam, et al.
Published: (2026)
by: Shapira, Eilam, et al.
Published: (2026)
Can LLMs Replace Economic Choice Prediction Labs? The Case of Language-based Persuasion Games
by: Shapira, Eilam, et al.
Published: (2024)
by: Shapira, Eilam, et al.
Published: (2024)
Donors and Recipients: On Asymmetric Transfer Across Tasks and Languages with Parameter-Efficient Fine-Tuning
by: Dymkiewicz, Kajetan, et al.
Published: (2025)
by: Dymkiewicz, Kajetan, et al.
Published: (2025)
From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production
by: Shlomov, Segev, et al.
Published: (2025)
by: Shlomov, Segev, et al.
Published: (2025)
Towards Enterprise-Ready Computer Using Generalist Agent
by: Marreed, Sami, et al.
Published: (2025)
by: Marreed, Sami, et al.
Published: (2025)
GLEE: A Unified Framework and Benchmark for Language-based Economic Environments
by: Shapira, Eilam, et al.
Published: (2024)
by: Shapira, Eilam, et al.
Published: (2024)
SNAP: Semantic Stories for Next Activity Prediction
by: Oved, Alon, et al.
Published: (2024)
by: Oved, Alon, et al.
Published: (2024)
Human Choice Prediction in Language-based Persuasion Games: Simulation-based Off-Policy Evaluation
by: Shapira, Eilam, et al.
Published: (2023)
by: Shapira, Eilam, et al.
Published: (2023)
What's the Plan? Evaluating and Developing Planning-Aware Techniques for Language Models
by: Hirsch, Eran, et al.
Published: (2024)
by: Hirsch, Eran, et al.
Published: (2024)
Governance by Construction for Generalist Agents
by: Shlomov, Segev, et al.
Published: (2026)
by: Shlomov, Segev, et al.
Published: (2026)
Multi-Review Fusion-in-Context
by: Slobodkin, Aviv, et al.
Published: (2024)
by: Slobodkin, Aviv, et al.
Published: (2024)
MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image
by: Arazi, Alan, et al.
Published: (2026)
by: Arazi, Alan, et al.
Published: (2026)
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
by: Shlomov, Segev, et al.
Published: (2024)
by: Shlomov, Segev, et al.
Published: (2024)
Text2Model: Text-based Model Induction for Zero-shot Image Classification
by: Amosy, Ohad, et al.
Published: (2022)
by: Amosy, Ohad, et al.
Published: (2022)
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
by: Calderon, Nitay, et al.
Published: (2024)
by: Calderon, Nitay, et al.
Published: (2024)
A Systematic Review of NLP for Dementia -- Tasks, Datasets and Opportunities
by: Peled-Cohen, Lotem, et al.
Published: (2024)
by: Peled-Cohen, Lotem, et al.
Published: (2024)
An Information-Theoretic Approach to Identifying Formulaic Clusters in Textual Data
by: Yoffe, Gideon, et al.
Published: (2025)
by: Yoffe, Gideon, et al.
Published: (2025)
The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs
by: Calderon, Nitay, et al.
Published: (2025)
by: Calderon, Nitay, et al.
Published: (2025)
Multi-Domain Explainability of Preferences
by: Calderon, Nitay, et al.
Published: (2025)
by: Calderon, Nitay, et al.
Published: (2025)
CRISP: Complex Reasoning with Interpretable Step-based Plans
by: Vetzler, Matan, et al.
Published: (2025)
by: Vetzler, Matan, et al.
Published: (2025)
A Unifying Scheme for Extractive Content Selection Tasks
by: Amar, Shmuel, et al.
Published: (2025)
by: Amar, Shmuel, et al.
Published: (2025)
Motivation in Large Language Models
by: Nahum, Omer, et al.
Published: (2026)
by: Nahum, Omer, et al.
Published: (2026)
The Power of Summary-Source Alignments
by: Ernst, Ori, et al.
Published: (2024)
by: Ernst, Ori, et al.
Published: (2024)
DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models
by: Ventura, Mor, et al.
Published: (2025)
by: Ventura, Mor, et al.
Published: (2025)
ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
by: Levy, Ido, et al.
Published: (2024)
by: Levy, Ido, et al.
Published: (2024)
AgentFixer: From Failure Detection to Fix Recommendations in LLM Agentic Systems
by: Mulian, Hadar, et al.
Published: (2026)
by: Mulian, Hadar, et al.
Published: (2026)
LIBERTy: A Causal Framework for Benchmarking Concept-Based Explanations of LLMs with Structural Counterfactuals
by: Toker, Gilat, et al.
Published: (2026)
by: Toker, Gilat, et al.
Published: (2026)
Systematic Biases in LLM Simulations of Debates
by: Taubenfeld, Amir, et al.
Published: (2024)
by: Taubenfeld, Amir, et al.
Published: (2024)
Consensus or Conflict? Fine-Grained Evaluation of Conflicting Answers in Question-Answering
by: Nachshoni, Eviatar, et al.
Published: (2025)
by: Nachshoni, Eviatar, et al.
Published: (2025)
Multilinguality at the Edge: Developing Language Models for the Global South
by: Miranda, Lester James V., et al.
Published: (2026)
by: Miranda, Lester James V., et al.
Published: (2026)
Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance
by: Nahum, Omer, et al.
Published: (2024)
by: Nahum, Omer, et al.
Published: (2024)
EncodeRec: An Embedding Backbone for Recommendation Systems
by: Hadad, Guy, et al.
Published: (2026)
by: Hadad, Guy, et al.
Published: (2026)
Can LLMs Learn Macroeconomic Narratives from Social Media?
by: Gueta, Almog, et al.
Published: (2024)
by: Gueta, Almog, et al.
Published: (2024)
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation
by: Nakash, Itay, et al.
Published: (2025)
by: Nakash, Itay, et al.
Published: (2025)
Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs
by: Peisakhovsky, Yehonatan, et al.
Published: (2025)
by: Peisakhovsky, Yehonatan, et al.
Published: (2025)
Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models
by: Ventura, Mor, et al.
Published: (2023)
by: Ventura, Mor, et al.
Published: (2023)
NL-Eye: Abductive NLI for Images
by: Ventura, Mor, et al.
Published: (2024)
by: Ventura, Mor, et al.
Published: (2024)
Similar Items
-
TabAgent: A Framework for Replacing Agentic Generative Components with Tabular-Textual Classifiers
by: Levy, Ido, et al.
Published: (2026) -
TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields
by: Arazi, Alan, et al.
Published: (2025) -
The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents
by: Shapira, Eilam, et al.
Published: (2026) -
Predicting Decisions of AI Agents from Limited Interaction through Text-Tabular Modeling
by: Shapira, Eilam, et al.
Published: (2026) -
Can LLMs Replace Economic Choice Prediction Labs? The Case of Language-based Persuasion Games
by: Shapira, Eilam, et al.
Published: (2024)