Saved in:
| Main Authors: | Afzal, Anum, Chalumattu, Ribin, Matthes, Florian, Mascarell, Laura |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.11591 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset
by: Mascarell, Laura, et al.
Published: (2024)
by: Mascarell, Laura, et al.
Published: (2024)
Can Smaller LLMs do better? Unlocking Cross-Domain Potential through Parameter-Efficient Fine-Tuning for Text Summarization
by: Afzal, Anum, et al.
Published: (2025)
by: Afzal, Anum, et al.
Published: (2025)
FActBench: A Benchmark for Fine-grained Automatic Evaluation of LLM-Generated Text in the Medical Domain
by: Afzal, Anum, et al.
Published: (2025)
by: Afzal, Anum, et al.
Published: (2025)
JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
by: Afzal, Anum, et al.
Published: (2025)
by: Afzal, Anum, et al.
Published: (2025)
Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using LLMs with Human in the Loop
by: Afzal, Anum, et al.
Published: (2024)
by: Afzal, Anum, et al.
Published: (2024)
Knowing Before Saying: LLM Representations Encode Information About Chain-of-Thought Success Before Completion
by: Afzal, Anum, et al.
Published: (2025)
by: Afzal, Anum, et al.
Published: (2025)
AdaptEval: A Benchmark for Evaluating Large Language Models on Code Snippet Adaptation
by: Zhang, Tanghaoran, et al.
Published: (2026)
by: Zhang, Tanghaoran, et al.
Published: (2026)
Enhancing Answer Attribution for Faithful Text Generation with Large Language Models
by: Vladika, Juraj, et al.
Published: (2024)
by: Vladika, Juraj, et al.
Published: (2024)
Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting
by: Meisenbacher, Stephen, et al.
Published: (2024)
by: Meisenbacher, Stephen, et al.
Published: (2024)
A Comparative Analysis of Conversational Large Language Models in Knowledge-Based Text Generation
by: Schneider, Phillip, et al.
Published: (2024)
by: Schneider, Phillip, et al.
Published: (2024)
Facts Fade Fast: Evaluating Memorization of Outdated Medical Knowledge in Large Language Models
by: Vladika, Juraj, et al.
Published: (2025)
by: Vladika, Juraj, et al.
Published: (2025)
Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches
by: Afzal, Anum, et al.
Published: (2026)
by: Afzal, Anum, et al.
Published: (2026)
Adapted Large Language Models Can Outperform Medical Experts in Clinical Text Summarization
by: Van Veen, Dave, et al.
Published: (2023)
by: Van Veen, Dave, et al.
Published: (2023)
DP-MLM: Differentially Private Text Rewriting Using Masked Language Models
by: Meisenbacher, Stephen, et al.
Published: (2024)
by: Meisenbacher, Stephen, et al.
Published: (2024)
MedREQAL: Examining Medical Knowledge Recall of Large Language Models via Question Answering
by: Vladika, Juraj, et al.
Published: (2024)
by: Vladika, Juraj, et al.
Published: (2024)
With Privacy, Size Matters: On the Importance of Dataset Size in Differentially Private Text Rewriting
by: Meisenbacher, Stephen, et al.
Published: (2025)
by: Meisenbacher, Stephen, et al.
Published: (2025)
Towards Optimizing a Retrieval Augmented Generation using Large Language Model on Academic Data
by: Afzal, Anum, et al.
Published: (2024)
by: Afzal, Anum, et al.
Published: (2024)
Just Rewrite It Again: A Post-Processing Method for Enhanced Semantic Similarity and Privacy Preservation of Differentially Private Rewritten Text
by: Meisenbacher, Stephen, et al.
Published: (2024)
by: Meisenbacher, Stephen, et al.
Published: (2024)
A Systematic Exploration of Text Decomposition and Budget Distribution in Differentially Private Text Obfuscation
by: Meisenbacher, Stephen, et al.
Published: (2026)
by: Meisenbacher, Stephen, et al.
Published: (2026)
Evaluating Large Language Models in Semantic Parsing for Conversational Question Answering over Knowledge Graphs
by: Schneider, Phillip, et al.
Published: (2024)
by: Schneider, Phillip, et al.
Published: (2024)
Comparing Knowledge Sources for Open-Domain Scientific Claim Verification
by: Vladika, Juraj, et al.
Published: (2024)
by: Vladika, Juraj, et al.
Published: (2024)
On the Impact of Noise in Differentially Private Text Rewriting
by: Meisenbacher, Stephen, et al.
Published: (2025)
by: Meisenbacher, Stephen, et al.
Published: (2025)
NLP-KG: A System for Exploratory Search of Scientific Literature in Natural Language Processing
by: Schopf, Tim, et al.
Published: (2024)
by: Schopf, Tim, et al.
Published: (2024)
Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise: A Case Study on Chinese Legal Domain
by: wan, Zhen, et al.
Published: (2023)
by: wan, Zhen, et al.
Published: (2023)
StrucText-Eval: Evaluating Large Language Model's Reasoning Ability in Structure-Rich Text
by: Gu, Zhouhong, et al.
Published: (2024)
by: Gu, Zhouhong, et al.
Published: (2024)
1-Diffractor: Efficient and Utility-Preserving Text Obfuscation Leveraging Word-Level Metric Differential Privacy
by: Meisenbacher, Stephen, et al.
Published: (2024)
by: Meisenbacher, Stephen, et al.
Published: (2024)
Scaling Up Summarization: Leveraging Large Language Models for Long Text Extractive Summarization
by: Hemamou, Léo, et al.
Published: (2024)
by: Hemamou, Léo, et al.
Published: (2024)
FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models
by: Guo, Xin, et al.
Published: (2023)
by: Guo, Xin, et al.
Published: (2023)
Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?
by: Fonseca, Marcio, et al.
Published: (2024)
by: Fonseca, Marcio, et al.
Published: (2024)
SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains
by: Ramesh, Krithika, et al.
Published: (2025)
by: Ramesh, Krithika, et al.
Published: (2025)
Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German Perspective
by: Vladika, Juraj, et al.
Published: (2024)
by: Vladika, Juraj, et al.
Published: (2024)
On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems
by: Vladika, Juraj, et al.
Published: (2025)
by: Vladika, Juraj, et al.
Published: (2025)
Evaluation of Large Language Models for Summarization Tasks in the Medical Domain: A Narrative Review
by: Croxford, Emma, et al.
Published: (2024)
by: Croxford, Emma, et al.
Published: (2024)
FinEval-KR: A Financial Domain Evaluation Framework for Large Language Models' Knowledge and Reasoning
by: Dou, Shaoyu, et al.
Published: (2025)
by: Dou, Shaoyu, et al.
Published: (2025)
Investigating User Perspectives on Differentially Private Text Privatization
by: Meisenbacher, Stephen, et al.
Published: (2025)
by: Meisenbacher, Stephen, et al.
Published: (2025)
Adapting Large Language Models to Domains via Reading Comprehension
by: Cheng, Daixuan, et al.
Published: (2023)
by: Cheng, Daixuan, et al.
Published: (2023)
MTQ-Eval: Multilingual Text Quality Evaluation for Language Models
by: Pokharel, Rhitabrat, et al.
Published: (2025)
by: Pokharel, Rhitabrat, et al.
Published: (2025)
LLM-as-a-Judge for Privacy Evaluation? Exploring the Alignment of Human and LLM Perceptions of Privacy in Textual Data
by: Meisenbacher, Stephen, et al.
Published: (2025)
by: Meisenbacher, Stephen, et al.
Published: (2025)
An Evaluation of Large Language Models on Text Summarization Tasks Using Prompt Engineering Techniques
by: Aly, Walid Mohamed, et al.
Published: (2025)
by: Aly, Walid Mohamed, et al.
Published: (2025)
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages
by: Kammakomati, Mehant, et al.
Published: (2024)
by: Kammakomati, Mehant, et al.
Published: (2024)
Similar Items
-
German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset
by: Mascarell, Laura, et al.
Published: (2024) -
Can Smaller LLMs do better? Unlocking Cross-Domain Potential through Parameter-Efficient Fine-Tuning for Text Summarization
by: Afzal, Anum, et al.
Published: (2025) -
FActBench: A Benchmark for Fine-grained Automatic Evaluation of LLM-Generated Text in the Medical Domain
by: Afzal, Anum, et al.
Published: (2025) -
JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
by: Afzal, Anum, et al.
Published: (2025) -
Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using LLMs with Human in the Loop
by: Afzal, Anum, et al.
Published: (2024)