Saved in:
| Main Authors: | Davoudi, Saeedeh, Iranmanesh, Reihaneh, Frieder, Ophir, Goharian, Nazli |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.30599 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TARAZ: Persian Short-Answer Question Benchmark for Cultural Evaluation of Language Models
by: Iranmanesh, Reihaneh, et al.
Published: (2026)
by: Iranmanesh, Reihaneh, et al.
Published: (2026)
Genetic Approach to Mitigate Hallucination in Generative IR
by: Kulkarni, Hrishikesh, et al.
Published: (2024)
by: Kulkarni, Hrishikesh, et al.
Published: (2024)
Intercept Cancer: Cancer Pre-Screening with Large Scale Healthcare Foundation Models
by: Sun, Liwen, et al.
Published: (2025)
by: Sun, Liwen, et al.
Published: (2025)
Learning to Rank Salient Content for Query-focused Summarization
by: Sotudeh, Sajad, et al.
Published: (2024)
by: Sotudeh, Sajad, et al.
Published: (2024)
LexBoost: Improving Lexical Document Retrieval with Nearest Neighbors
by: Kulkarni, Hrishikesh, et al.
Published: (2024)
by: Kulkarni, Hrishikesh, et al.
Published: (2024)
Benchmarking Hindi LLMs: A New Suite of Datasets and a Comparative Analysis
by: Kamath, Anusha, et al.
Published: (2025)
by: Kamath, Anusha, et al.
Published: (2025)
Generating Text from Uniform Meaning Representation
by: Markle, Emma, et al.
Published: (2025)
by: Markle, Emma, et al.
Published: (2025)
GRIT: Graph-based Recall Improvement for Task-oriented E-commerce Queries
by: Kulkarni, Hrishikesh, et al.
Published: (2025)
by: Kulkarni, Hrishikesh, et al.
Published: (2025)
LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs
by: Davoodi, Arash Gholami, et al.
Published: (2024)
by: Davoodi, Arash Gholami, et al.
Published: (2024)
ControBench: An Interaction-Aware Benchmark for Controversial Discourse Analysis on Social Networks
by: Thuy, Ta Thanh, et al.
Published: (2026)
by: Thuy, Ta Thanh, et al.
Published: (2026)
A Large-Scale Benchmark for Evaluating Large Language Models on Medical Question Answering in Romanian
by: Rogoz, Ana-Cristina, et al.
Published: (2025)
by: Rogoz, Ana-Cristina, et al.
Published: (2025)
Neural Isomorphic Fields: A Transformer-based Algebraic Numerical Embedding
by: Sadeghi, Hamidreza, et al.
Published: (2026)
by: Sadeghi, Hamidreza, et al.
Published: (2026)
Geometry-Aware Decoding with Wasserstein-Regularized Truncation and Mass Penalties for Large Language Models
by: Davoodi, Arash Gholami, et al.
Published: (2026)
by: Davoodi, Arash Gholami, et al.
Published: (2026)
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models
by: Jin, Zhuoran, et al.
Published: (2024)
by: Jin, Zhuoran, et al.
Published: (2024)
DRAMA: Domain Retrieval using Adaptive Module Allocation
by: Kasela, Pranav, et al.
Published: (2026)
by: Kasela, Pranav, et al.
Published: (2026)
A Benchmark Suite of Reddit-Derived Datasets for Mental Health Detection
by: Hasan, Khalid, et al.
Published: (2026)
by: Hasan, Khalid, et al.
Published: (2026)
BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation
by: Kim, Eunsu, et al.
Published: (2025)
by: Kim, Eunsu, et al.
Published: (2025)
Rethinking Machine Unlearning for Large Language Models
by: Liu, Sijia, et al.
Published: (2024)
by: Liu, Sijia, et al.
Published: (2024)
Does Unlearning Truly Unlearn? A Black Box Evaluation of LLM Unlearning Methods
by: Doshi, Jai, et al.
Published: (2024)
by: Doshi, Jai, et al.
Published: (2024)
KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding
by: Hwang, Bokwang, et al.
Published: (2025)
by: Hwang, Bokwang, et al.
Published: (2025)
Large Language Model Unlearning
by: Yao, Yuanshun, et al.
Published: (2023)
by: Yao, Yuanshun, et al.
Published: (2023)
Large Language Models for Mathematicians
by: Frieder, Simon, et al.
Published: (2023)
by: Frieder, Simon, et al.
Published: (2023)
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models
by: Lin, Yujie, et al.
Published: (2026)
by: Lin, Yujie, et al.
Published: (2026)
Dissecting Fine-Tuning Unlearning in Large Language Models
by: Hong, Yihuai, et al.
Published: (2024)
by: Hong, Yihuai, et al.
Published: (2024)
Are Large Language Models Good Temporal Graph Learners?
by: Huang, Shenyang, et al.
Published: (2025)
by: Huang, Shenyang, et al.
Published: (2025)
Align-then-Unlearn: Embedding Alignment for LLM Unlearning
by: Spohn, Philipp, et al.
Published: (2025)
by: Spohn, Philipp, et al.
Published: (2025)
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease Diagnosis
by: Wang, Xu, et al.
Published: (2025)
by: Wang, Xu, et al.
Published: (2025)
Offset Unlearning for Large Language Models
by: Huang, James Y., et al.
Published: (2024)
by: Huang, James Y., et al.
Published: (2024)
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
by: Li, Nathaniel, et al.
Published: (2024)
by: Li, Nathaniel, et al.
Published: (2024)
A Neuro-inspired Interpretation of Unlearning in Large Language Models through Sample-level Unlearning Difficulty
by: Feng, Xiaohua, et al.
Published: (2025)
by: Feng, Xiaohua, et al.
Published: (2025)
Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language Models
by: Mekala, Anmol, et al.
Published: (2024)
by: Mekala, Anmol, et al.
Published: (2024)
Sparse-Autoencoder-Guided Internal Representation Unlearning for Large Language Models
by: Yamashita, Tomoya, et al.
Published: (2025)
by: Yamashita, Tomoya, et al.
Published: (2025)
Towards Detecting Contextual Real-Time Toxicity for In-Game Chat
by: Yang, Zachary, et al.
Published: (2023)
by: Yang, Zachary, et al.
Published: (2023)
Hierarchical Federated Unlearning for Large Language Models
by: Zhong, Yisheng, et al.
Published: (2025)
by: Zhong, Yisheng, et al.
Published: (2025)
Soft Prompting for Unlearning in Large Language Models
by: Bhaila, Karuna, et al.
Published: (2024)
by: Bhaila, Karuna, et al.
Published: (2024)
Multi-Objective Large Language Model Unlearning
by: Pan, Zibin, et al.
Published: (2024)
by: Pan, Zibin, et al.
Published: (2024)
Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization
by: Guo, Phillip, et al.
Published: (2024)
by: Guo, Phillip, et al.
Published: (2024)
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
by: Guldimann, Philipp, et al.
Published: (2024)
by: Guldimann, Philipp, et al.
Published: (2024)
EvasionBench: A Large-Scale Benchmark for Detecting Managerial Evasion in Earnings Call Q&A
by: Ma, Shijian, et al.
Published: (2026)
by: Ma, Shijian, et al.
Published: (2026)
A Closer Look at Machine Unlearning for Large Language Models
by: Yuan, Xiaojian, et al.
Published: (2024)
by: Yuan, Xiaojian, et al.
Published: (2024)
Similar Items
-
TARAZ: Persian Short-Answer Question Benchmark for Cultural Evaluation of Language Models
by: Iranmanesh, Reihaneh, et al.
Published: (2026) -
Genetic Approach to Mitigate Hallucination in Generative IR
by: Kulkarni, Hrishikesh, et al.
Published: (2024) -
Intercept Cancer: Cancer Pre-Screening with Large Scale Healthcare Foundation Models
by: Sun, Liwen, et al.
Published: (2025) -
Learning to Rank Salient Content for Query-focused Summarization
by: Sotudeh, Sajad, et al.
Published: (2024) -
LexBoost: Improving Lexical Document Retrieval with Nearest Neighbors
by: Kulkarni, Hrishikesh, et al.
Published: (2024)