:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kardos, Márton, Kostkan, Jan, Vermillet, Arnault-Quentin, Nielbo, Kristoffer, Enevoldsen, Kenneth, Rocca, Roberta
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computation and Language I.2.7
Online Access:	https://arxiv.org/abs/2406.09556
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Topeax -- An Improved Clustering Topic Model with Density Peak Detection and Lexical-Semantic Term Importance
by: Kardos, Márton
Published: (2026)

Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
by: Nielsen, Dan Saattrup, et al.
Published: (2024)

Resonant Context Anchoring: Decoupling Attention Routing and Signal Gain at Inference Time
by: Zhao, Mingkuan, et al.
Published: (2026)

Geometric Deviation as an Unsupervised Pre-Generation Reliability Signal: Probing LLM Representations for Answerability
by: Du, Yucheng
Published: (2026)

Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)

topicwizard -- a Modern, Model-agnostic Framework for Topic Model Visualization and Interpretation
by: Kardos, Márton, et al.
Published: (2025)

Improving the Capabilities of Large Language Model Based Marketing Analytics Copilots With Semantic Search And Fine-Tuning
by: Gao, Yilin, et al.
Published: (2024)

Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
by: Walker, Nicholas
Published: (2024)

Path-Lock Expert: Separating Reasoning Mode in Hybrid Thinking via Architecture-Level Separation
by: Wang, Shouren, et al.
Published: (2026)

Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)

Engineering A Large Language Model From Scratch
by: Oketunji, Abiodun Finbarrs
Published: (2024)

GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge
by: Dugan, Liam, et al.
Published: (2025)

Weakly Supervised Distillation of Hallucination Signals into Transformer Representations
by: Salehmohamed, Shoaib Sadiq, et al.
Published: (2026)

Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
by: Xu, Shuyao, et al.
Published: (2025)

Semantic Convergence: Investigating Shared Representations Across Scaled LLMs
by: Son, Daniel, et al.
Published: (2025)

In-Context Fixation: When Demonstrated Labels Override Semantics in Few-Shot Classification
by: Liu, Ming
Published: (2026)

Language as a Wave Phenomenon: Semantic Phase Locking and Interference in Neural Networks
by: Yıldırım, Alper, et al.
Published: (2025)

MMSciBench: Benchmarking Language Models on Chinese Multimodal Scientific Problems
by: Ye, Xinwu, et al.
Published: (2025)

Influence-driven Curriculum Learning for Pre-training on Limited Data
by: Schoenegger, Loris, et al.
Published: (2025)

Towards Latent Diffusion Suitable For Text
by: Midavaine, Nesta, et al.
Published: (2026)

SpectralLoRA: Is Low-Frequency Structure Sufficient for LoRA Adaptation? A Spectral Analysis of Weight Updates
by: Singh, Rajveer
Published: (2026)

BLP-2023 Task 2: Sentiment Analysis
by: Hasan, Md. Arid, et al.
Published: (2023)

FIM-LoRA: Task-Informative Rank Allocation for LoRA via Calibration-Time Gradient-Variance Estimation
by: Sathyavageeswaran, Ramakrishnan
Published: (2026)

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
by: Hanna, Michael, et al.
Published: (2024)

Integrating Expert Labels into LLM-based Emission Goal Detection: Example Selection vs Automatic Prompt Design
by: Wrzalik, Marco, et al.
Published: (2024)

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
by: Zhang, Zhaowei, et al.
Published: (2025)

Where Should LoRA Go? Component-Type Placement in Hybrid Language Models
by: Borobia, Hector, et al.
Published: (2026)

Interpreto: An Explainability Library for Transformers
by: Poché, Antonin, et al.
Published: (2025)

RightNow-Arabic-0.5B-Turbo: An Open Sub-1B Arabic Language Model via Vocabulary Injection and Edge-First Deployment
by: Jaber, Jaber, et al.
Published: (2026)

Structured-Sparse Attention for Entity Tracking with Subquadratic Sequence Complexity
by: Zhao, Hangyue, et al.
Published: (2026)

Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
by: Aars, Corinne, et al.
Published: (2024)

WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training
by: Tian, Changxin, et al.
Published: (2025)

PowLU: An Activation Function for Stable Pre-Training of LLMs
by: Jiang, Peijie, et al.
Published: (2026)

Reconstructing Syllable Sequences in Abugida Scripts with Incomplete Inputs
by: Thu, Ye Kyaw, et al.
Published: (2025)

Human-interpretable clustering of short-text using large language models
by: Miller, Justin K., et al.
Published: (2024)

Scaling Laws for Forgetting When Fine-Tuning Large Language Models
by: Kalajdzievski, Damjan
Published: (2024)

Structure-Guided Entity Resolution: Fine-Tuning LLMs for Robust Name Matching in Complex Linguistic Contexts
by: Chourasia, Shivam, et al.
Published: (2026)

Lon-ea at SemEval-2023 Task 11: A Comparison of Activation Functions for Soft and Hard Label Prediction
by: Hosseini, Peyman, et al.
Published: (2023)

A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
by: Aponte, Ryan, et al.
Published: (2024)