Saved in:
| Main Authors: | Osuji, Chinonso Cynthia, Ferreira, Thiago Castro, Davis, Brian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.08496 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Long-context Reference-based MT Quality Estimation
by: Haq, Sami Ul, et al.
Published: (2025)
by: Haq, Sami Ul, et al.
Published: (2025)
Enhancing Text Generation in Joint NLG/NLU Learning Through Curriculum Learning, Semi-Supervised Training, and Advanced Optimization Techniques
by: Shaik, Rahimanuddin, et al.
Published: (2024)
by: Shaik, Rahimanuddin, et al.
Published: (2024)
Direct-Scoring NLG Evaluators Can Use Pairwise Comparisons Too
by: Lawrence, Logan, et al.
Published: (2025)
by: Lawrence, Logan, et al.
Published: (2025)
Synthetic vs. Gold: The Role of LLM Generated Labels and Data in Cyberbullying Detection
by: Kazemi, Arefeh, et al.
Published: (2025)
by: Kazemi, Arefeh, et al.
Published: (2025)
Systematic Literature Review: Computational Approaches for Humour Style Classification
by: Kenneth, Mary Ogbuka, et al.
Published: (2024)
by: Kenneth, Mary Ogbuka, et al.
Published: (2024)
A Sea of Words: An In-Depth Analysis of Anchors for Text Data
by: Lopardo, Gianluigi, et al.
Published: (2022)
by: Lopardo, Gianluigi, et al.
Published: (2022)
Text2Data: Low-Resource Data Generation with Textual Control
by: Wang, Shiyu, et al.
Published: (2024)
by: Wang, Shiyu, et al.
Published: (2024)
A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
by: Laskar, Md Tahmid Rahman, et al.
Published: (2024)
by: Laskar, Md Tahmid Rahman, et al.
Published: (2024)
Health-LLM: Large Language Models for Health Prediction via Wearable Sensor Data
by: Kim, Yubin, et al.
Published: (2024)
by: Kim, Yubin, et al.
Published: (2024)
How to Synthesize Text Data without Model Collapse?
by: Zhu, Xuekai, et al.
Published: (2024)
by: Zhu, Xuekai, et al.
Published: (2024)
Enhance Multi-domain Sentiment Analysis of Review Texts through Prompting Strategies
by: Wang, Yajing, et al.
Published: (2023)
by: Wang, Yajing, et al.
Published: (2023)
Text Detoxification: Data Efficiency, Semantic Preservation and Model Generalization
by: Yu, Jing, et al.
Published: (2025)
by: Yu, Jing, et al.
Published: (2025)
Comparing Feature Importance and Rule Extraction for Interpretability on Text Data
by: Lopardo, Gianluigi, et al.
Published: (2022)
by: Lopardo, Gianluigi, et al.
Published: (2022)
The Human Factor in Detecting Errors of Large Language Models: A Systematic Literature Review and Future Research Directions
by: Schiller, Christian A.
Published: (2024)
by: Schiller, Christian A.
Published: (2024)
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts
by: Zhang, Yifan, et al.
Published: (2024)
by: Zhang, Yifan, et al.
Published: (2024)
How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data
by: Niklaus, Joel, et al.
Published: (2026)
by: Niklaus, Joel, et al.
Published: (2026)
Factual Inconsistency in Data-to-Text Generation Scales Exponentially with LLM Size: A Statistical Validation
by: Mahapatra, Joy, et al.
Published: (2025)
by: Mahapatra, Joy, et al.
Published: (2025)
Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls
by: Kang, Feiyang, et al.
Published: (2025)
by: Kang, Feiyang, et al.
Published: (2025)
You Can Generate It Again: Data-to-Text Generation with Verification and Correction Prompting
by: Ren, Xuan, et al.
Published: (2023)
by: Ren, Xuan, et al.
Published: (2023)
On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices
by: Pecher, Branislav, et al.
Published: (2024)
by: Pecher, Branislav, et al.
Published: (2024)
Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding
by: Idrissi-Yaghir, Ahmad, et al.
Published: (2024)
by: Idrissi-Yaghir, Ahmad, et al.
Published: (2024)
Computational Measurement of Political Positions: A Review of Text-Based Ideal Point Estimation Algorithms
by: Parschan, Patrick, et al.
Published: (2025)
by: Parschan, Patrick, et al.
Published: (2025)
TextGrad: Automatic "Differentiation" via Text
by: Yuksekgonul, Mert, et al.
Published: (2024)
by: Yuksekgonul, Mert, et al.
Published: (2024)
Why mask diffusion does not work
by: Sun, Haocheng, et al.
Published: (2025)
by: Sun, Haocheng, et al.
Published: (2025)
Generative Artificial Intelligence: A Systematic Review and Applications
by: Sengar, Sandeep Singh, et al.
Published: (2024)
by: Sengar, Sandeep Singh, et al.
Published: (2024)
Paired by the Teacher: Turning Unpaired Data into High-Fidelity Pairs for Low-Resource Text Generation
by: Lu, Yen-Ju, et al.
Published: (2025)
by: Lu, Yen-Ju, et al.
Published: (2025)
Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data
by: Guo, Yuting, et al.
Published: (2024)
by: Guo, Yuting, et al.
Published: (2024)
Efficient Systematic Reviews: Literature Filtering with Transformers & Transfer Learning
by: Hawkins, John, et al.
Published: (2024)
by: Hawkins, John, et al.
Published: (2024)
IndiText Boost: Text Augmentation for Low Resource India Languages
by: Litake, Onkar, et al.
Published: (2024)
by: Litake, Onkar, et al.
Published: (2024)
Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding
by: Xiao, Feng, et al.
Published: (2025)
by: Xiao, Feng, et al.
Published: (2025)
BARE: Leveraging Base Language Models for Few-Shot Synthetic Data Generation
by: Zhu, Alan, et al.
Published: (2025)
by: Zhu, Alan, et al.
Published: (2025)
A Systematic Survey on Large Language Models for Algorithm Design
by: Liu, Fei, et al.
Published: (2024)
by: Liu, Fei, et al.
Published: (2024)
A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
by: Eisape, Tiwalayo, et al.
Published: (2023)
by: Eisape, Tiwalayo, et al.
Published: (2023)
Text2Freq: Learning Series Patterns from Text via Frequency Domain
by: Lo, Ming-Chih, et al.
Published: (2024)
by: Lo, Ming-Chih, et al.
Published: (2024)
Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain
by: García-Ferrero, Iker, et al.
Published: (2024)
by: García-Ferrero, Iker, et al.
Published: (2024)
TextReg: Mitigating Prompt Distributional Overfitting via Regularized Text-Space Optimization
by: Fu, Lucheng, et al.
Published: (2026)
by: Fu, Lucheng, et al.
Published: (2026)
Text Diffusion with Reinforced Conditioning
by: Liu, Yuxuan, et al.
Published: (2024)
by: Liu, Yuxuan, et al.
Published: (2024)
Rep2Text: Decoding Full Text from a Single LLM Token Representation
by: Zhao, Haiyan, et al.
Published: (2025)
by: Zhao, Haiyan, et al.
Published: (2025)
A General Framework for Producing Interpretable Semantic Text Embeddings
by: Sun, Yiqun, et al.
Published: (2024)
by: Sun, Yiqun, et al.
Published: (2024)
Text Classification Under Class Distribution Shift: A Survey
by: Costache, Adriana Valentina, et al.
Published: (2025)
by: Costache, Adriana Valentina, et al.
Published: (2025)
Similar Items
-
Long-context Reference-based MT Quality Estimation
by: Haq, Sami Ul, et al.
Published: (2025) -
Enhancing Text Generation in Joint NLG/NLU Learning Through Curriculum Learning, Semi-Supervised Training, and Advanced Optimization Techniques
by: Shaik, Rahimanuddin, et al.
Published: (2024) -
Direct-Scoring NLG Evaluators Can Use Pairwise Comparisons Too
by: Lawrence, Logan, et al.
Published: (2025) -
Synthetic vs. Gold: The Role of LLM Generated Labels and Data in Cyberbullying Detection
by: Kazemi, Arefeh, et al.
Published: (2025) -
Systematic Literature Review: Computational Approaches for Humour Style Classification
by: Kenneth, Mary Ogbuka, et al.
Published: (2024)