Saved in:
| Main Authors: | Miller, Justin K., Alexander, Tristram J. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.07278 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards interpretable models for language proficiency assessment: Predicting the CEFR level of Estonian learner texts
by: Allkivi, Kais
Published: (2026)
by: Allkivi, Kais
Published: (2026)
State space models can express n-gram languages
by: Nandakumar, Vinoth, et al.
Published: (2023)
by: Nandakumar, Vinoth, et al.
Published: (2023)
Inference acceleration for large language models using "stairs" assisted greedy generation
by: Grigaliūnas, Domas, et al.
Published: (2024)
by: Grigaliūnas, Domas, et al.
Published: (2024)
Mechanistic interpretability of large language models with applications to the financial services industry
by: Golgoon, Ashkan, et al.
Published: (2024)
by: Golgoon, Ashkan, et al.
Published: (2024)
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)
by: Oketunji, Abiodun Finbarrs
Published: (2023)
Context-Aware Clustering using Large Language Models
by: Tipirneni, Sindhu, et al.
Published: (2024)
by: Tipirneni, Sindhu, et al.
Published: (2024)
Clinical information extraction for Low-resource languages with Few-shot learning using Pre-trained language models and Prompting
by: Richter-Pechanski, Phillip, et al.
Published: (2024)
by: Richter-Pechanski, Phillip, et al.
Published: (2024)
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
by: Aponte, Ryan, et al.
Published: (2024)
by: Aponte, Ryan, et al.
Published: (2024)
Enhancing Traffic Accident Classifications: Application of NLP Methods for City Safety
by: Özeren, Enes, et al.
Published: (2025)
by: Özeren, Enes, et al.
Published: (2025)
Engineering A Large Language Model From Scratch
by: Oketunji, Abiodun Finbarrs
Published: (2024)
by: Oketunji, Abiodun Finbarrs
Published: (2024)
Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers?
by: Weber, Manuel, et al.
Published: (2025)
by: Weber, Manuel, et al.
Published: (2025)
Human-like fleeting memory improves language learning but impairs reading time prediction in transformer language models
by: Thamma, Abishek, et al.
Published: (2025)
by: Thamma, Abishek, et al.
Published: (2025)
Thread Detection and Response Generation using Transformers with Prompt Optimisation
by: T, Kevin Joshua, et al.
Published: (2024)
by: T, Kevin Joshua, et al.
Published: (2024)
Plain language adaptations of biomedical text using LLMs: Comparision of evaluation metrics
by: Kocbek, Primoz, et al.
Published: (2025)
by: Kocbek, Primoz, et al.
Published: (2025)
LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance
by: Shi, Jack Wei Lun, et al.
Published: (2026)
by: Shi, Jack Wei Lun, et al.
Published: (2026)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Unleashing the potential of prompt engineering for large language models
by: Chen, Banghao, et al.
Published: (2023)
by: Chen, Banghao, et al.
Published: (2023)
DAEDRA: A language model for predicting outcomes in passive pharmacovigilance reporting
by: von Csefalvay, Chris
Published: (2024)
by: von Csefalvay, Chris
Published: (2024)
When Models Can't Follow: Testing Instruction Adherence Across 256 LLMs
by: Young, Richard J., et al.
Published: (2025)
by: Young, Richard J., et al.
Published: (2025)
Representation-Aware Unlearning via Activation Signatures: From Suppression to Entity-Signature Erasure
by: Mahmood, Syed Naveed, et al.
Published: (2026)
by: Mahmood, Syed Naveed, et al.
Published: (2026)
$\text{Memory}^3$: Language Modeling with Explicit Memory
by: Yang, Hongkang, et al.
Published: (2024)
by: Yang, Hongkang, et al.
Published: (2024)
Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
by: Hanna, Michael, et al.
Published: (2024)
by: Hanna, Michael, et al.
Published: (2024)
Integrating Expert Labels into LLM-based Emission Goal Detection: Example Selection vs Automatic Prompt Design
by: Wrzalik, Marco, et al.
Published: (2024)
by: Wrzalik, Marco, et al.
Published: (2024)
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
by: Aars, Corinne, et al.
Published: (2024)
by: Aars, Corinne, et al.
Published: (2024)
Scaling Laws for Forgetting When Fine-Tuning Large Language Models
by: Kalajdzievski, Damjan
Published: (2024)
by: Kalajdzievski, Damjan
Published: (2024)
The Role of Language Imbalance in Cross-lingual Generalisation: Insights from Cloned Language Experiments
by: Schäfer, Anton, et al.
Published: (2024)
by: Schäfer, Anton, et al.
Published: (2024)
Ensemble Language Models for Multilingual Sentiment Analysis
by: Hasan, Md Arid
Published: (2024)
by: Hasan, Md Arid
Published: (2024)
Linguistically-Informed Multilingual Instruction Tuning: Is There an Optimal Set of Languages to Tune?
by: Soykan, Gürkan, et al.
Published: (2024)
by: Soykan, Gürkan, et al.
Published: (2024)
MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering
by: Ness, Robert Osazuwa, et al.
Published: (2024)
by: Ness, Robert Osazuwa, et al.
Published: (2024)
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
by: Luo, Yuqi, et al.
Published: (2024)
by: Luo, Yuqi, et al.
Published: (2024)
$S^3$ -- Semantic Signal Separation
by: Kardos, Márton, et al.
Published: (2024)
by: Kardos, Márton, et al.
Published: (2024)
MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning
by: Wang, Xujia, et al.
Published: (2024)
by: Wang, Xujia, et al.
Published: (2024)
Chain and Causal Attention for Efficient Entity Tracking
by: Fagnou, Erwan, et al.
Published: (2024)
by: Fagnou, Erwan, et al.
Published: (2024)
Efficient Solutions For An Intriguing Failure of LLMs: Long Context Window Does Not Mean LLMs Can Analyze Long Sequences Flawlessly
by: Hosseini, Peyman, et al.
Published: (2024)
by: Hosseini, Peyman, et al.
Published: (2024)
On the Effect of (Near) Duplicate Subwords in Language Modelling
by: Schäfer, Anton, et al.
Published: (2024)
by: Schäfer, Anton, et al.
Published: (2024)
BabyLlama-2: Ensemble-Distilled Models Consistently Outperform Teachers With Limited Data
by: Tastet, Jean-Loup, et al.
Published: (2024)
by: Tastet, Jean-Loup, et al.
Published: (2024)
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs
by: Feucht, Sheridan, et al.
Published: (2024)
by: Feucht, Sheridan, et al.
Published: (2024)
The Open Source Advantage in Large Language Models (LLMs)
by: Manchanda, Jiya, et al.
Published: (2024)
by: Manchanda, Jiya, et al.
Published: (2024)
MMSciBench: Benchmarking Language Models on Chinese Multimodal Scientific Problems
by: Ye, Xinwu, et al.
Published: (2025)
by: Ye, Xinwu, et al.
Published: (2025)
Similar Items
-
Towards interpretable models for language proficiency assessment: Predicting the CEFR level of Estonian learner texts
by: Allkivi, Kais
Published: (2026) -
State space models can express n-gram languages
by: Nandakumar, Vinoth, et al.
Published: (2023) -
Inference acceleration for large language models using "stairs" assisted greedy generation
by: Grigaliūnas, Domas, et al.
Published: (2024) -
Mechanistic interpretability of large language models with applications to the financial services industry
by: Golgoon, Ashkan, et al.
Published: (2024) -
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)