:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Miller, Justin K., Alexander, Tristram J.
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Machine Learning I.2.7
Online Access:	https://arxiv.org/abs/2405.07278
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards interpretable models for language proficiency assessment: Predicting the CEFR level of Estonian learner texts
by: Allkivi, Kais
Published: (2026)

State space models can express n-gram languages
by: Nandakumar, Vinoth, et al.
Published: (2023)

Inference acceleration for large language models using "stairs" assisted greedy generation
by: Grigaliūnas, Domas, et al.
Published: (2024)

Mechanistic interpretability of large language models with applications to the financial services industry
by: Golgoon, Ashkan, et al.
Published: (2024)

Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)

Context-Aware Clustering using Large Language Models
by: Tipirneni, Sindhu, et al.
Published: (2024)

Clinical information extraction for Low-resource languages with Few-shot learning using Pre-trained language models and Prompting
by: Richter-Pechanski, Phillip, et al.
Published: (2024)

A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
by: Aponte, Ryan, et al.
Published: (2024)

Enhancing Traffic Accident Classifications: Application of NLP Methods for City Safety
by: Özeren, Enes, et al.
Published: (2025)

Engineering A Large Language Model From Scratch
by: Oketunji, Abiodun Finbarrs
Published: (2024)

Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)

Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers?
by: Weber, Manuel, et al.
Published: (2025)

Human-like fleeting memory improves language learning but impairs reading time prediction in transformer language models
by: Thamma, Abishek, et al.
Published: (2025)

Thread Detection and Response Generation using Transformers with Prompt Optimisation
by: T, Kevin Joshua, et al.
Published: (2024)

Plain language adaptations of biomedical text using LLMs: Comparision of evaluation metrics
by: Kocbek, Primoz, et al.
Published: (2025)

LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance
by: Shi, Jack Wei Lun, et al.
Published: (2026)

Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)

Unleashing the potential of prompt engineering for large language models
by: Chen, Banghao, et al.
Published: (2023)

DAEDRA: A language model for predicting outcomes in passive pharmacovigilance reporting
by: von Csefalvay, Chris
Published: (2024)

When Models Can't Follow: Testing Instruction Adherence Across 256 LLMs
by: Young, Richard J., et al.
Published: (2025)

Representation-Aware Unlearning via Activation Signatures: From Suppression to Entity-Signature Erasure
by: Mahmood, Syed Naveed, et al.
Published: (2026)

$\text{Memory}^3$: Language Modeling with Explicit Memory
by: Yang, Hongkang, et al.
Published: (2024)

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
by: Hanna, Michael, et al.
Published: (2024)

Integrating Expert Labels into LLM-based Emission Goal Detection: Example Selection vs Automatic Prompt Design
by: Wrzalik, Marco, et al.
Published: (2024)

Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
by: Aars, Corinne, et al.
Published: (2024)

Scaling Laws for Forgetting When Fine-Tuning Large Language Models
by: Kalajdzievski, Damjan
Published: (2024)

The Role of Language Imbalance in Cross-lingual Generalisation: Insights from Cloned Language Experiments
by: Schäfer, Anton, et al.
Published: (2024)

Ensemble Language Models for Multilingual Sentiment Analysis
by: Hasan, Md Arid
Published: (2024)

Linguistically-Informed Multilingual Instruction Tuning: Is There an Optimal Set of Languages to Tune?
by: Soykan, Gürkan, et al.
Published: (2024)

MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering
by: Ness, Robert Osazuwa, et al.
Published: (2024)

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
by: Luo, Yuqi, et al.
Published: (2024)

$S^3$ -- Semantic Signal Separation
by: Kardos, Márton, et al.
Published: (2024)

MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning
by: Wang, Xujia, et al.
Published: (2024)

Chain and Causal Attention for Efficient Entity Tracking
by: Fagnou, Erwan, et al.
Published: (2024)

Efficient Solutions For An Intriguing Failure of LLMs: Long Context Window Does Not Mean LLMs Can Analyze Long Sequences Flawlessly
by: Hosseini, Peyman, et al.
Published: (2024)

On the Effect of (Near) Duplicate Subwords in Language Modelling
by: Schäfer, Anton, et al.
Published: (2024)

BabyLlama-2: Ensemble-Distilled Models Consistently Outperform Teachers With Limited Data
by: Tastet, Jean-Loup, et al.
Published: (2024)

Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs
by: Feucht, Sheridan, et al.
Published: (2024)

The Open Source Advantage in Large Language Models (LLMs)
by: Manchanda, Jiya, et al.
Published: (2024)

MMSciBench: Benchmarking Language Models on Chinese Multimodal Scientific Problems
by: Ye, Xinwu, et al.
Published: (2025)