Saved in:
| Main Authors: | Mama, Eden, Sheri, Liel, Aperstein, Yehudit, Apartsin, Alexander |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.11803 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Controlled Synthetic Benchmark for Educational Aspect-Based Sentiment Analysis
by: Aperstein, Yehudit, et al.
Published: (2026)
by: Aperstein, Yehudit, et al.
Published: (2026)
IRC-Bench: Recognizing Entities from Contextual Cues in First-Person Reminiscences
by: Aperstein, Yehudit, et al.
Published: (2026)
by: Aperstein, Yehudit, et al.
Published: (2026)
Toward a Benchmark for Controllable Simulation of Imperfect Students with Large Language Models
by: Apartsin, Alexander, et al.
Published: (2026)
by: Apartsin, Alexander, et al.
Published: (2026)
SeaAlert: Critical Information Extraction From Maritime Distress Communications with Large Language Models
by: Atia, Tomer, et al.
Published: (2026)
by: Atia, Tomer, et al.
Published: (2026)
From Joy to Fear: A Benchmark of Emotion Estimation in Pop Song Lyrics
by: Dahary, Shay, et al.
Published: (2025)
by: Dahary, Shay, et al.
Published: (2025)
When Curiosity Signals Danger: Predicting Health Crises Through Online Medication Inquiries
by: Goncharok, Dvora, et al.
Published: (2025)
by: Goncharok, Dvora, et al.
Published: (2025)
Reliable Extraction of Clinical Follow-Up Instructions: A Hybrid Neural-Symbolic Pipeline
by: Laufer, Michal, et al.
Published: (2026)
by: Laufer, Michal, et al.
Published: (2026)
Reading Between the Lines: Classifying Resume Seniority with Large Language Models
by: Cohen, Matan, et al.
Published: (2025)
by: Cohen, Matan, et al.
Published: (2025)
An Interpretable Benchmark for Clickbait Detection and Tactic Attribution
by: Nofar, Lihi, et al.
Published: (2025)
by: Nofar, Lihi, et al.
Published: (2025)
CalexNet: Soft Cascade-Aligned Training and Calibration for Lightweight Early-Exit Branches
by: Aperstein, Yehudit, et al.
Published: (2025)
by: Aperstein, Yehudit, et al.
Published: (2025)
Explainable Semantic Text Relations: A Question-Answering Framework for Comparing Document Content
by: Aperstein, Yehudit, et al.
Published: (2025)
by: Aperstein, Yehudit, et al.
Published: (2025)
LLM-guided headline rewriting for clickability enhancement without clickbait
by: Aperstein, Yehudit, et al.
Published: (2026)
by: Aperstein, Yehudit, et al.
Published: (2026)
DGLD: Domain-Gated Latent Diffusion for the Discovery of Novel Energetic Materials
by: Aperstein, Yehudit, et al.
Published: (2026)
by: Aperstein, Yehudit, et al.
Published: (2026)
Do Large Language Models Need Intent? Revisiting Response Generation Strategies for Service Assistant
by: Bolshinsky, Inbal, et al.
Published: (2025)
by: Bolshinsky, Inbal, et al.
Published: (2025)
Code Review Without Borders: Evaluating Synthetic vs. Real Data for Review Recommendation
by: Cohen, Yogev, et al.
Published: (2025)
by: Cohen, Yogev, et al.
Published: (2025)
Stitching the Story: Creating Panoramic Incident Summaries from Body-Worn Footage
by: Cohen, Dor, et al.
Published: (2025)
by: Cohen, Dor, et al.
Published: (2025)
Beyond Words: Interjection Classification for Improved Human-Computer Interaction
by: Goren, Yaniv, et al.
Published: (2025)
by: Goren, Yaniv, et al.
Published: (2025)
Acting on the Unseen: Communication-Free Collaborative Filtering for Decentralized Multi-Robot Task Allocation
by: Apartsin, Alexander, et al.
Published: (2026)
by: Apartsin, Alexander, et al.
Published: (2026)
Mapping License Plate Recoverability Under Extreme Viewing Angles for Oppor-tunistic Urban Sensing
by: Adamenko, Igor, et al.
Published: (2026)
by: Adamenko, Igor, et al.
Published: (2026)
Multi-pathology Chest X-ray Classification with Rejection Mechanisms
by: Aperstein, Yehudit, et al.
Published: (2025)
by: Aperstein, Yehudit, et al.
Published: (2025)
Enhancing Classification of Streaming Data with Image Distillation
by: Khatib, Rwad, et al.
Published: (2025)
by: Khatib, Rwad, et al.
Published: (2025)
PTEENet: Post-Trained Early-Exit Neural Networks Augmentation for Inference Cost Optimization
by: Lahiany, Assaf, et al.
Published: (2025)
by: Lahiany, Assaf, et al.
Published: (2025)
Learning from Child-Directed Speech in Two-Language Scenarios: A French-English Case Study
by: Binyamin, Liel, et al.
Published: (2026)
by: Binyamin, Liel, et al.
Published: (2026)
Improving Deep Tabular Learning
by: Sarafian, Sivan, et al.
Published: (2025)
by: Sarafian, Sivan, et al.
Published: (2025)
From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs
by: Guo, Xiaoyong, et al.
Published: (2026)
by: Guo, Xiaoyong, et al.
Published: (2026)
Extracting Biomedical Entities from Noisy Audio Transcripts
by: Ebadi, Nima, et al.
Published: (2024)
by: Ebadi, Nima, et al.
Published: (2024)
Recognition of Abnormal Events in Surveillance Videos using Weakly Supervised Dual-Encoder Models
by: Tsfaty, Noam, et al.
Published: (2025)
by: Tsfaty, Noam, et al.
Published: (2025)
WildSpeech-Bench: Benchmarking End-to-End SpeechLLMs in the Wild
by: Zhang, Linhao, et al.
Published: (2025)
by: Zhang, Linhao, et al.
Published: (2025)
XAI-Based Detection of Adversarial Attacks on Deepfake Detectors
by: Pinhasov, Ben, et al.
Published: (2024)
by: Pinhasov, Ben, et al.
Published: (2024)
From Surveys to Narratives: Rethinking Cultural Value Adaptation in LLMs
by: Adilazuarda, Muhammad Farid, et al.
Published: (2025)
by: Adilazuarda, Muhammad Farid, et al.
Published: (2025)
How AI Forecasts AI Jobs: Benchmarking LLM Predictions of Labor Market Changes
by: Osborn, Sheri, et al.
Published: (2025)
by: Osborn, Sheri, et al.
Published: (2025)
When Can We Trust LLMs in Mental Health? Large-Scale Benchmarks for Reliable LLM Evaluation
by: Badawi, Abeer, et al.
Published: (2025)
by: Badawi, Abeer, et al.
Published: (2025)
MediEval: A Unified Medical Benchmark for Patient-Contextual and Knowledge-Grounded Reasoning in LLMs
by: Qu, Zhan, et al.
Published: (2025)
by: Qu, Zhan, et al.
Published: (2025)
InsightFlow: LLM-Driven Synthesis of Patient Narratives for Mental Health into Causal Models
by: Gupta, Shreya, et al.
Published: (2026)
by: Gupta, Shreya, et al.
Published: (2026)
We Should Chart an Atlas of All the World's Models
by: Horwitz, Eliahu, et al.
Published: (2025)
by: Horwitz, Eliahu, et al.
Published: (2025)
Mapping News Narratives Using LLMs and Narrative-Structured Text Embeddings
by: Elfes, Jan
Published: (2024)
by: Elfes, Jan
Published: (2024)
Narrative Landscape: Mapping Narrative Dispositions Across LLMs
by: Jung, Donghoon, et al.
Published: (2026)
by: Jung, Donghoon, et al.
Published: (2026)
Beyond Idealized Patients: Evaluating LLMs under Challenging Patient Behaviors in Medical Consultations
by: Li, Yahan, et al.
Published: (2026)
by: Li, Yahan, et al.
Published: (2026)
ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark
by: Nguyen, Tung X., et al.
Published: (2026)
by: Nguyen, Tung X., et al.
Published: (2026)
Towards Reliable Medical LLMs: Benchmarking and Enhancing Confidence Estimation of Large Language Models in Medical Consultation
by: Ren, Zhiyao, et al.
Published: (2026)
by: Ren, Zhiyao, et al.
Published: (2026)
Similar Items
-
A Controlled Synthetic Benchmark for Educational Aspect-Based Sentiment Analysis
by: Aperstein, Yehudit, et al.
Published: (2026) -
IRC-Bench: Recognizing Entities from Contextual Cues in First-Person Reminiscences
by: Aperstein, Yehudit, et al.
Published: (2026) -
Toward a Benchmark for Controllable Simulation of Imperfect Students with Large Language Models
by: Apartsin, Alexander, et al.
Published: (2026) -
SeaAlert: Critical Information Extraction From Maritime Distress Communications with Large Language Models
by: Atia, Tomer, et al.
Published: (2026) -
From Joy to Fear: A Benchmark of Emotion Estimation in Pop Song Lyrics
by: Dahary, Shay, et al.
Published: (2025)