Saved in:
| Main Author: | Moez, Catherine |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.00964 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ScoreRAG: A Retrieval-Augmented Generation Framework with Consistency-Relevance Scoring and Structured Summarization for News Generation
by: Lin, Pei-Yun, et al.
Published: (2025)
by: Lin, Pei-Yun, et al.
Published: (2025)
Contrasting Linguistic Patterns in Human and LLM-Generated News Text
by: Muñoz-Ortiz, Alberto, et al.
Published: (2023)
by: Muñoz-Ortiz, Alberto, et al.
Published: (2023)
NOTAI.AI: Explainable Detection of Machine-Generated Text via Curvature and Feature Attribution
by: Breneur, Oleksandr Marchenko, et al.
Published: (2026)
by: Breneur, Oleksandr Marchenko, et al.
Published: (2026)
Comparison of Modern Multilingual Text Embedding Techniques for Hate Speech Detection Task
by: Vaiciukynas, Evaldas, et al.
Published: (2026)
by: Vaiciukynas, Evaldas, et al.
Published: (2026)
Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents
by: Günther, Michael, et al.
Published: (2023)
by: Günther, Michael, et al.
Published: (2023)
Profiling German Text Simplification with Interpretable Model-Fingerprints
by: Klöser, Lars, et al.
Published: (2026)
by: Klöser, Lars, et al.
Published: (2026)
Relating Word Embedding Gender Biases to Gender Gaps: A Cross-Cultural Analysis
by: Friedman, Scott, et al.
Published: (2026)
by: Friedman, Scott, et al.
Published: (2026)
Transparent but Powerful: Explainability, Accuracy, and Generalizability in ADHD Detection from Social Media Data
by: Wiechmann, D., et al.
Published: (2024)
by: Wiechmann, D., et al.
Published: (2024)
A Survey of Text Watermarking in the Era of Large Language Models
by: Liu, Aiwei, et al.
Published: (2023)
by: Liu, Aiwei, et al.
Published: (2023)
Multiplication in Multimodal LLMs: Computation with Text, Image, and Audio Inputs
by: Balter, Samuel G., et al.
Published: (2026)
by: Balter, Samuel G., et al.
Published: (2026)
SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity
by: Kim, Jaemin, et al.
Published: (2024)
by: Kim, Jaemin, et al.
Published: (2024)
Co-NAML-LSTUR: A Combined Model with Attentive Multi-View Learning and Long- and Short-term User Representations for News Recommendation
by: Nguyen, Minh Hoang, et al.
Published: (2025)
by: Nguyen, Minh Hoang, et al.
Published: (2025)
Prior-based Noisy Text Data Filtering: Fast and Strong Alternative For Perplexity
by: Seo, Yeongbin, et al.
Published: (2025)
by: Seo, Yeongbin, et al.
Published: (2025)
Enhancing OCR for Sino-Vietnamese Language Processing via Fine-tuned PaddleOCRv5
by: Nguyen, Minh Hoang, et al.
Published: (2025)
by: Nguyen, Minh Hoang, et al.
Published: (2025)
PaperAudit-Bench: Benchmarking Error Detection in Research Papers for Critical Automated Peer Review
by: Tu, Songjun, et al.
Published: (2026)
by: Tu, Songjun, et al.
Published: (2026)
Technical Report on the Pangram AI-Generated Text Classifier
by: Emi, Bradley, et al.
Published: (2024)
by: Emi, Bradley, et al.
Published: (2024)
Make Literature-Based Discovery Great Again through Reproducible Pipelines
by: Cestnik, Bojan, et al.
Published: (2025)
by: Cestnik, Bojan, et al.
Published: (2025)
Understanding the Effects of RLHF on the Quality and Detectability of LLM-Generated Texts
by: Xu, Beining, et al.
Published: (2025)
by: Xu, Beining, et al.
Published: (2025)
A Survey on Natural Language Counterfactual Generation
by: Wang, Yongjie, et al.
Published: (2024)
by: Wang, Yongjie, et al.
Published: (2024)
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models
by: Günther, Michael, et al.
Published: (2024)
by: Günther, Michael, et al.
Published: (2024)
Adaptive Steering and Remasking for Safe Generation in Diffusion Language Models
by: Lee, Yejin, et al.
Published: (2026)
by: Lee, Yejin, et al.
Published: (2026)
When Retrieval Succeeds and Fails: Rethinking Retrieval-Augmented Generation for LLMs
by: Wang, Yongjie, et al.
Published: (2025)
by: Wang, Yongjie, et al.
Published: (2025)
From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation
by: Oduwole, Mardiyyah, et al.
Published: (2025)
by: Oduwole, Mardiyyah, et al.
Published: (2025)
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey
by: Fang, Xi, et al.
Published: (2024)
by: Fang, Xi, et al.
Published: (2024)
Culturally-Nuanced Story Generation for Reasoning in Low-Resource Languages: The Case of Javanese and Sundanese
by: Pranida, Salsabila Zahirah, et al.
Published: (2025)
by: Pranida, Salsabila Zahirah, et al.
Published: (2025)
A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bangla Texts
by: Elahi, Kazi Toufique, et al.
Published: (2024)
by: Elahi, Kazi Toufique, et al.
Published: (2024)
Mitigating Position-Shift Failures in Text-Based Modular Arithmetic via Position Curriculum and Template Diversity
by: Yudin, Nikolay
Published: (2026)
by: Yudin, Nikolay
Published: (2026)
Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings
by: Mohr, Isabelle, et al.
Published: (2024)
by: Mohr, Isabelle, et al.
Published: (2024)
A-VERT: Agnostic Verification with Embedding Ranking Targets
by: Aguirre, Nicolás, et al.
Published: (2025)
by: Aguirre, Nicolás, et al.
Published: (2025)
ADE: Adaptive Dictionary Embeddings -- Scaling Multi-Anchor Representations to Large Language Models
by: Demirci, Orhan, et al.
Published: (2026)
by: Demirci, Orhan, et al.
Published: (2026)
Efficient Code Embeddings from Code Generation Models
by: Kryvosheieva, Daria, et al.
Published: (2025)
by: Kryvosheieva, Daria, et al.
Published: (2025)
Towards Probabilistic Question Answering Over Tabular Data
by: Shen, Chen, et al.
Published: (2025)
by: Shen, Chen, et al.
Published: (2025)
Retrieval-Based Multi-Label Legal Annotation: Extensible, Data-Efficient and Hallucination-Free
by: Zhang, Li, et al.
Published: (2026)
by: Zhang, Li, et al.
Published: (2026)
The Knesset Corpus: An Annotated Corpus of Hebrew Parliamentary Proceedings
by: Goldin, Gili, et al.
Published: (2024)
by: Goldin, Gili, et al.
Published: (2024)
Towards Effective and Efficient Continual Pre-training of Large Language Models
by: Chen, Jie, et al.
Published: (2024)
by: Chen, Jie, et al.
Published: (2024)
The Unlikely Duel: Evaluating Creative Writing in LLMs through a Unique Scenario
by: Gómez-Rodríguez, Carlos, et al.
Published: (2024)
by: Gómez-Rodríguez, Carlos, et al.
Published: (2024)
Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
by: Liu, Aiwei, et al.
Published: (2024)
by: Liu, Aiwei, et al.
Published: (2024)
The Superalignment of Superhuman Intelligence with Large Language Models
by: Huang, Minlie, et al.
Published: (2024)
by: Huang, Minlie, et al.
Published: (2024)
Train-Attention: Meta-Learning Where to Focus in Continual Knowledge Learning
by: Seo, Yeongbin, et al.
Published: (2024)
by: Seo, Yeongbin, et al.
Published: (2024)
Experimentation in Content Moderation using RWKV
by: Yildirim, Umut, et al.
Published: (2024)
by: Yildirim, Umut, et al.
Published: (2024)
Similar Items
-
ScoreRAG: A Retrieval-Augmented Generation Framework with Consistency-Relevance Scoring and Structured Summarization for News Generation
by: Lin, Pei-Yun, et al.
Published: (2025) -
Contrasting Linguistic Patterns in Human and LLM-Generated News Text
by: Muñoz-Ortiz, Alberto, et al.
Published: (2023) -
NOTAI.AI: Explainable Detection of Machine-Generated Text via Curvature and Feature Attribution
by: Breneur, Oleksandr Marchenko, et al.
Published: (2026) -
Comparison of Modern Multilingual Text Embedding Techniques for Hate Speech Detection Task
by: Vaiciukynas, Evaldas, et al.
Published: (2026) -
Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents
by: Günther, Michael, et al.
Published: (2023)