Saved in:
| Main Authors: | Sweed, Nir, Hakim, Hanit, Wolfson, Ben, Lifshitz, Hila, Shahaf, Dafna |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.05072 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
One Joke to Rule them All? On the (Im)possibility of Generalizing Humor
by: Turgeman, Mor, et al.
Published: (2025)
by: Turgeman, Mor, et al.
Published: (2025)
ParallelPARC: A Scalable Pipeline for Generating Natural-Language Analogies
by: Sultan, Oren, et al.
Published: (2024)
by: Sultan, Oren, et al.
Published: (2024)
Visual Editing with LLM-based Tool Chaining: An Efficient Distillation Approach for Real-Time Applications
by: Sultan, Oren, et al.
Published: (2024)
by: Sultan, Oren, et al.
Published: (2024)
InterFeat: A Pipeline for Finding Interesting Scientific Features
by: Ofer, Dan, et al.
Published: (2025)
by: Ofer, Dan, et al.
Published: (2025)
Cooking Up Creativity: Enhancing LLM Creativity through Structured Recombination
by: Mizrahi, Moran, et al.
Published: (2025)
by: Mizrahi, Moran, et al.
Published: (2025)
LLMs versus the Halting Problem: Characterizing Program Termination Reasoning
by: Sultan, Oren, et al.
Published: (2026)
by: Sultan, Oren, et al.
Published: (2026)
Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models
by: Pope, Quintin, et al.
Published: (2026)
by: Pope, Quintin, et al.
Published: (2026)
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
by: Yoran, Ori, et al.
Published: (2023)
by: Yoran, Ori, et al.
Published: (2023)
Weakly Supervised Text-to-SQL Parsing through Question Decomposition
by: Wolfson, Tomer, et al.
Published: (2021)
by: Wolfson, Tomer, et al.
Published: (2021)
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
by: Yoran, Ori, et al.
Published: (2023)
by: Yoran, Ori, et al.
Published: (2023)
MUSE: Machine Unlearning Six-Way Evaluation for Language Models
by: Shi, Weijia, et al.
Published: (2024)
by: Shi, Weijia, et al.
Published: (2024)
Brevity Constraints Reverse Performance Hierarchies in Language Models
by: Hakim, MD Azizul
Published: (2026)
by: Hakim, MD Azizul
Published: (2026)
Generating Tables from the Parametric Knowledge of Language Models
by: Berkovitch, Yevgeni, et al.
Published: (2024)
by: Berkovitch, Yevgeni, et al.
Published: (2024)
Understanding Enthymemes in Argument Maps: Bridging Argument Mining and Logic-based Argumentation
by: Ben-Naim, Jonathan, et al.
Published: (2024)
by: Ben-Naim, Jonathan, et al.
Published: (2024)
EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline
by: Chen, Peter Baile, et al.
Published: (2025)
by: Chen, Peter Baile, et al.
Published: (2025)
CUPCase: Clinically Uncommon Patient Cases and Diagnoses Dataset
by: Perets, Oriel, et al.
Published: (2025)
by: Perets, Oriel, et al.
Published: (2025)
Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation
by: Bhambri, Siddhant, et al.
Published: (2025)
by: Bhambri, Siddhant, et al.
Published: (2025)
MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models
by: Yan, Siyu, et al.
Published: (2025)
by: Yan, Siyu, et al.
Published: (2025)
Simplex-Optimized Hybrid Ensemble for Large Language Model Text Detection Under Generative Distribution Drif
by: Kristanto, Sepyan Purnama, et al.
Published: (2025)
by: Kristanto, Sepyan Purnama, et al.
Published: (2025)
Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation
by: Huang, Sukai, et al.
Published: (2024)
by: Huang, Sukai, et al.
Published: (2024)
BenchOverflow: Measuring Overflow in Large Language Models via Plain-Text Prompts
by: Feiglin, Erin, et al.
Published: (2026)
by: Feiglin, Erin, et al.
Published: (2026)
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks
by: Fan, Wan-Cyuan, et al.
Published: (2024)
by: Fan, Wan-Cyuan, et al.
Published: (2024)
MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling
by: Limisiewicz, Tomasz, et al.
Published: (2024)
by: Limisiewicz, Tomasz, et al.
Published: (2024)
Next Token Perception Score: Analytical Assessment of your LLM Perception Skills
by: Cheng, Yu-Ang, et al.
Published: (2025)
by: Cheng, Yu-Ang, et al.
Published: (2025)
TextMineX: Data, Evaluation Framework and Ontology-guided LLM Pipeline for Humanitarian Mine Action
by: Zhou, Chenyue, et al.
Published: (2025)
by: Zhou, Chenyue, et al.
Published: (2025)
Advancing NLP Security by Leveraging LLMs as Adversarial Engines
by: Srinivasan, Sudarshan, et al.
Published: (2024)
by: Srinivasan, Sudarshan, et al.
Published: (2024)
A Linguistic Analysis of Spontaneous Thoughts: Investigating Experiences of Déjà Vu, Unexpected Thoughts, and Involuntary Autobiographical Memories
by: Venkatesha, Videep, et al.
Published: (2025)
by: Venkatesha, Videep, et al.
Published: (2025)
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space
by: Chen, Zhiliang, et al.
Published: (2025)
by: Chen, Zhiliang, et al.
Published: (2025)
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
by: Dong, Yixin, et al.
Published: (2024)
by: Dong, Yixin, et al.
Published: (2024)
MoNaCo: More Natural and Complex Questions for Reasoning Across Dozens of Documents
by: Wolfson, Tomer, et al.
Published: (2025)
by: Wolfson, Tomer, et al.
Published: (2025)
VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning
by: Singh, Vikash, et al.
Published: (2026)
by: Singh, Vikash, et al.
Published: (2026)
From Reasoning to Generalization: Knowledge-Augmented LLMs for ARC Benchmark
by: Lei, Chao, et al.
Published: (2025)
by: Lei, Chao, et al.
Published: (2025)
RELOOP: Recursive Retrieval with Multi-Hop Reasoner and Planners for Heterogeneous QA
by: Yang, Ruiyi, et al.
Published: (2025)
by: Yang, Ruiyi, et al.
Published: (2025)
RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines
by: Yu, Pengfei, et al.
Published: (2025)
by: Yu, Pengfei, et al.
Published: (2025)
RATIONALYST: Mining Implicit Rationales for Process Supervision of Reasoning
by: Jiang, Dongwei, et al.
Published: (2024)
by: Jiang, Dongwei, et al.
Published: (2024)
Prompt Mining for Language-based Human Mobility Forecasting
by: Xue, Hao, et al.
Published: (2024)
by: Xue, Hao, et al.
Published: (2024)
What Makes an Ideal Quote? Recommending "Unexpected yet Rational" Quotations via Novelty
by: Zhang, Bowei, et al.
Published: (2025)
by: Zhang, Bowei, et al.
Published: (2025)
CELL your Model: Contrastive Explanations for Large Language Models
by: Luss, Ronny, et al.
Published: (2024)
by: Luss, Ronny, et al.
Published: (2024)
On Languaging a Simulation Engine
by: Liu, Han, et al.
Published: (2024)
by: Liu, Han, et al.
Published: (2024)
ProteinJEPA: Latent prediction complements protein language models
by: Ofer, Dan, et al.
Published: (2026)
by: Ofer, Dan, et al.
Published: (2026)
Similar Items
-
One Joke to Rule them All? On the (Im)possibility of Generalizing Humor
by: Turgeman, Mor, et al.
Published: (2025) -
ParallelPARC: A Scalable Pipeline for Generating Natural-Language Analogies
by: Sultan, Oren, et al.
Published: (2024) -
Visual Editing with LLM-based Tool Chaining: An Efficient Distillation Approach for Real-Time Applications
by: Sultan, Oren, et al.
Published: (2024) -
InterFeat: A Pipeline for Finding Interesting Scientific Features
by: Ofer, Dan, et al.
Published: (2025) -
Cooking Up Creativity: Enhancing LLM Creativity through Structured Recombination
by: Mizrahi, Moran, et al.
Published: (2025)