:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Osuji, Chinonso Cynthia, Ferreira, Thiago Castro, Davis, Brian
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2402.08496
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Long-context Reference-based MT Quality Estimation
by: Haq, Sami Ul, et al.
Published: (2025)

Enhancing Text Generation in Joint NLG/NLU Learning Through Curriculum Learning, Semi-Supervised Training, and Advanced Optimization Techniques
by: Shaik, Rahimanuddin, et al.
Published: (2024)

Direct-Scoring NLG Evaluators Can Use Pairwise Comparisons Too
by: Lawrence, Logan, et al.
Published: (2025)

Synthetic vs. Gold: The Role of LLM Generated Labels and Data in Cyberbullying Detection
by: Kazemi, Arefeh, et al.
Published: (2025)

Systematic Literature Review: Computational Approaches for Humour Style Classification
by: Kenneth, Mary Ogbuka, et al.
Published: (2024)

A Sea of Words: An In-Depth Analysis of Anchors for Text Data
by: Lopardo, Gianluigi, et al.
Published: (2022)

Text2Data: Low-Resource Data Generation with Textual Control
by: Wang, Shiyu, et al.
Published: (2024)

A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
by: Laskar, Md Tahmid Rahman, et al.
Published: (2024)

Health-LLM: Large Language Models for Health Prediction via Wearable Sensor Data
by: Kim, Yubin, et al.
Published: (2024)

How to Synthesize Text Data without Model Collapse?
by: Zhu, Xuekai, et al.
Published: (2024)

Enhance Multi-domain Sentiment Analysis of Review Texts through Prompting Strategies
by: Wang, Yajing, et al.
Published: (2023)

Text Detoxification: Data Efficiency, Semantic Preservation and Model Generalization
by: Yu, Jing, et al.
Published: (2025)

Comparing Feature Importance and Rule Extraction for Interpretability on Text Data
by: Lopardo, Gianluigi, et al.
Published: (2022)

The Human Factor in Detecting Errors of Large Language Models: A Systematic Literature Review and Future Research Directions
by: Schiller, Christian A.
Published: (2024)

Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts
by: Zhang, Yifan, et al.
Published: (2024)

How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data
by: Niklaus, Joel, et al.
Published: (2026)

Factual Inconsistency in Data-to-Text Generation Scales Exponentially with LLM Size: A Statistical Validation
by: Mahapatra, Joy, et al.
Published: (2025)

Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls
by: Kang, Feiyang, et al.
Published: (2025)

You Can Generate It Again: Data-to-Text Generation with Verification and Correction Prompting
by: Ren, Xuan, et al.
Published: (2023)

On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices
by: Pecher, Branislav, et al.
Published: (2024)

Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding
by: Idrissi-Yaghir, Ahmad, et al.
Published: (2024)

Computational Measurement of Political Positions: A Review of Text-Based Ideal Point Estimation Algorithms
by: Parschan, Patrick, et al.
Published: (2025)

TextGrad: Automatic "Differentiation" via Text
by: Yuksekgonul, Mert, et al.
Published: (2024)

Why mask diffusion does not work
by: Sun, Haocheng, et al.
Published: (2025)

Generative Artificial Intelligence: A Systematic Review and Applications
by: Sengar, Sandeep Singh, et al.
Published: (2024)

Paired by the Teacher: Turning Unpaired Data into High-Fidelity Pairs for Low-Resource Text Generation
by: Lu, Yen-Ju, et al.
Published: (2025)

Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data
by: Guo, Yuting, et al.
Published: (2024)

Efficient Systematic Reviews: Literature Filtering with Transformers & Transfer Learning
by: Hawkins, John, et al.
Published: (2024)

IndiText Boost: Text Augmentation for Low Resource India Languages
by: Litake, Onkar, et al.
Published: (2024)

Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding
by: Xiao, Feng, et al.
Published: (2025)

BARE: Leveraging Base Language Models for Few-Shot Synthetic Data Generation
by: Zhu, Alan, et al.
Published: (2025)

A Systematic Survey on Large Language Models for Algorithm Design
by: Liu, Fei, et al.
Published: (2024)

A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
by: Eisape, Tiwalayo, et al.
Published: (2023)

Text2Freq: Learning Series Patterns from Text via Frequency Domain
by: Lo, Ming-Chih, et al.
Published: (2024)

Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain
by: García-Ferrero, Iker, et al.
Published: (2024)

TextReg: Mitigating Prompt Distributional Overfitting via Regularized Text-Space Optimization
by: Fu, Lucheng, et al.
Published: (2026)

Text Diffusion with Reinforced Conditioning
by: Liu, Yuxuan, et al.
Published: (2024)

Rep2Text: Decoding Full Text from a Single LLM Token Representation
by: Zhao, Haiyan, et al.
Published: (2025)

A General Framework for Producing Interpretable Semantic Text Embeddings
by: Sun, Yiqun, et al.
Published: (2024)

Text Classification Under Class Distribution Shift: A Survey
by: Costache, Adriana Valentina, et al.
Published: (2025)