:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sweed, Nir, Hakim, Hanit, Wolfson, Ben, Lifshitz, Hila, Shahaf, Dafna
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2509.05072
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

One Joke to Rule them All? On the (Im)possibility of Generalizing Humor
by: Turgeman, Mor, et al.
Published: (2025)

ParallelPARC: A Scalable Pipeline for Generating Natural-Language Analogies
by: Sultan, Oren, et al.
Published: (2024)

Visual Editing with LLM-based Tool Chaining: An Efficient Distillation Approach for Real-Time Applications
by: Sultan, Oren, et al.
Published: (2024)

InterFeat: A Pipeline for Finding Interesting Scientific Features
by: Ofer, Dan, et al.
Published: (2025)

Cooking Up Creativity: Enhancing LLM Creativity through Structured Recombination
by: Mizrahi, Moran, et al.
Published: (2025)

LLMs versus the Halting Problem: Characterizing Program Termination Reasoning
by: Sultan, Oren, et al.
Published: (2026)

Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models
by: Pope, Quintin, et al.
Published: (2026)

Answering Questions by Meta-Reasoning over Multiple Chains of Thought
by: Yoran, Ori, et al.
Published: (2023)

Weakly Supervised Text-to-SQL Parsing through Question Decomposition
by: Wolfson, Tomer, et al.
Published: (2021)

Making Retrieval-Augmented Language Models Robust to Irrelevant Context
by: Yoran, Ori, et al.
Published: (2023)

MUSE: Machine Unlearning Six-Way Evaluation for Language Models
by: Shi, Weijia, et al.
Published: (2024)

Brevity Constraints Reverse Performance Hierarchies in Language Models
by: Hakim, MD Azizul
Published: (2026)

Generating Tables from the Parametric Knowledge of Language Models
by: Berkovitch, Yevgeni, et al.
Published: (2024)

Understanding Enthymemes in Argument Maps: Bridging Argument Mining and Logic-based Argumentation
by: Ben-Naim, Jonathan, et al.
Published: (2024)

EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline
by: Chen, Peter Baile, et al.
Published: (2025)

CUPCase: Clinically Uncommon Patient Cases and Diagnoses Dataset
by: Perets, Oriel, et al.
Published: (2025)

Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation
by: Bhambri, Siddhant, et al.
Published: (2025)

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models
by: Yan, Siyu, et al.
Published: (2025)

Simplex-Optimized Hybrid Ensemble for Large Language Model Text Detection Under Generative Distribution Drif
by: Kristanto, Sepyan Purnama, et al.
Published: (2025)

Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation
by: Huang, Sukai, et al.
Published: (2024)

BenchOverflow: Measuring Overflow in Large Language Models via Plain-Text Prompts
by: Feiglin, Erin, et al.
Published: (2026)

MMFactory: A Universal Solution Search Engine for Vision-Language Tasks
by: Fan, Wan-Cyuan, et al.
Published: (2024)

MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling
by: Limisiewicz, Tomasz, et al.
Published: (2024)

Next Token Perception Score: Analytical Assessment of your LLM Perception Skills
by: Cheng, Yu-Ang, et al.
Published: (2025)

TextMineX: Data, Evaluation Framework and Ontology-guided LLM Pipeline for Humanitarian Mine Action
by: Zhou, Chenyue, et al.
Published: (2025)

Advancing NLP Security by Leveraging LLMs as Adversarial Engines
by: Srinivasan, Sudarshan, et al.
Published: (2024)

A Linguistic Analysis of Spontaneous Thoughts: Investigating Experiences of Déjà Vu, Unexpected Thoughts, and Involuntary Autobiographical Memories
by: Venkatesha, Videep, et al.
Published: (2025)

Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space
by: Chen, Zhiliang, et al.
Published: (2025)

XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
by: Dong, Yixin, et al.
Published: (2024)

MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents
by: Wolfson, Tomer, et al.
Published: (2025)

VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning
by: Singh, Vikash, et al.
Published: (2026)

From Reasoning to Generalization: Knowledge-Augmented LLMs for ARC Benchmark
by: Lei, Chao, et al.
Published: (2025)

RELOOP: Recursive Retrieval with Multi-Hop Reasoner and Planners for Heterogeneous QA
by: Yang, Ruiyi, et al.
Published: (2025)

RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines
by: Yu, Pengfei, et al.
Published: (2025)

RATIONALYST: Mining Implicit Rationales for Process Supervision of Reasoning
by: Jiang, Dongwei, et al.
Published: (2024)

Prompt Mining for Language-based Human Mobility Forecasting
by: Xue, Hao, et al.
Published: (2024)

What Makes an Ideal Quote? Recommending "Unexpected yet Rational" Quotations via Novelty
by: Zhang, Bowei, et al.
Published: (2025)

CELL your Model: Contrastive Explanations for Large Language Models
by: Luss, Ronny, et al.
Published: (2024)

On Languaging a Simulation Engine
by: Liu, Han, et al.
Published: (2024)

ProteinJEPA: Latent prediction complements protein language models
by: Ofer, Dan, et al.
Published: (2026)