Saved in:
| Main Authors: | Kardos, Márton, Kostkan, Jan, Vermillet, Arnault-Quentin, Nielbo, Kristoffer, Enevoldsen, Kenneth, Rocca, Roberta |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.09556 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Topeax -- An Improved Clustering Topic Model with Density Peak Detection and Lexical-Semantic Term Importance
by: Kardos, Márton
Published: (2026)
by: Kardos, Márton
Published: (2026)
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
by: Nielsen, Dan Saattrup, et al.
Published: (2024)
by: Nielsen, Dan Saattrup, et al.
Published: (2024)
Resonant Context Anchoring: Decoupling Attention Routing and Signal Gain at Inference Time
by: Zhao, Mingkuan, et al.
Published: (2026)
by: Zhao, Mingkuan, et al.
Published: (2026)
Geometric Deviation as an Unsupervised Pre-Generation Reliability Signal: Probing LLM Representations for Answerability
by: Du, Yucheng
Published: (2026)
by: Du, Yucheng
Published: (2026)
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)
by: Oketunji, Abiodun Finbarrs
Published: (2023)
topicwizard -- a Modern, Model-agnostic Framework for Topic Model Visualization and Interpretation
by: Kardos, Márton, et al.
Published: (2025)
by: Kardos, Márton, et al.
Published: (2025)
Improving the Capabilities of Large Language Model Based Marketing Analytics Copilots With Semantic Search And Fine-Tuning
by: Gao, Yilin, et al.
Published: (2024)
by: Gao, Yilin, et al.
Published: (2024)
Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
by: Walker, Nicholas
Published: (2024)
by: Walker, Nicholas
Published: (2024)
Path-Lock Expert: Separating Reasoning Mode in Hybrid Thinking via Architecture-Level Separation
by: Wang, Shouren, et al.
Published: (2026)
by: Wang, Shouren, et al.
Published: (2026)
Large Language Model (LLM) Bias Index -- LLMBI
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
by: Oketunji, Abiodun Finbarrs, et al.
Published: (2023)
Engineering A Large Language Model From Scratch
by: Oketunji, Abiodun Finbarrs
Published: (2024)
by: Oketunji, Abiodun Finbarrs
Published: (2024)
GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge
by: Dugan, Liam, et al.
Published: (2025)
by: Dugan, Liam, et al.
Published: (2025)
Weakly Supervised Distillation of Hallucination Signals into Transformer Representations
by: Salehmohamed, Shoaib Sadiq, et al.
Published: (2026)
by: Salehmohamed, Shoaib Sadiq, et al.
Published: (2026)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
by: Xu, Shuyao, et al.
Published: (2025)
by: Xu, Shuyao, et al.
Published: (2025)
Semantic Convergence: Investigating Shared Representations Across Scaled LLMs
by: Son, Daniel, et al.
Published: (2025)
by: Son, Daniel, et al.
Published: (2025)
In-Context Fixation: When Demonstrated Labels Override Semantics in Few-Shot Classification
by: Liu, Ming
Published: (2026)
by: Liu, Ming
Published: (2026)
Language as a Wave Phenomenon: Semantic Phase Locking and Interference in Neural Networks
by: Yıldırım, Alper, et al.
Published: (2025)
by: Yıldırım, Alper, et al.
Published: (2025)
MMSciBench: Benchmarking Language Models on Chinese Multimodal Scientific Problems
by: Ye, Xinwu, et al.
Published: (2025)
by: Ye, Xinwu, et al.
Published: (2025)
Influence-driven Curriculum Learning for Pre-training on Limited Data
by: Schoenegger, Loris, et al.
Published: (2025)
by: Schoenegger, Loris, et al.
Published: (2025)
Towards Latent Diffusion Suitable For Text
by: Midavaine, Nesta, et al.
Published: (2026)
by: Midavaine, Nesta, et al.
Published: (2026)
SpectralLoRA: Is Low-Frequency Structure Sufficient for LoRA Adaptation? A Spectral Analysis of Weight Updates
by: Singh, Rajveer
Published: (2026)
by: Singh, Rajveer
Published: (2026)
BLP-2023 Task 2: Sentiment Analysis
by: Hasan, Md. Arid, et al.
Published: (2023)
by: Hasan, Md. Arid, et al.
Published: (2023)
FIM-LoRA: Task-Informative Rank Allocation for LoRA via Calibration-Time Gradient-Variance Estimation
by: Sathyavageeswaran, Ramakrishnan
Published: (2026)
by: Sathyavageeswaran, Ramakrishnan
Published: (2026)
Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms
by: Hanna, Michael, et al.
Published: (2024)
by: Hanna, Michael, et al.
Published: (2024)
Integrating Expert Labels into LLM-based Emission Goal Detection: Example Selection vs Automatic Prompt Design
by: Wrzalik, Marco, et al.
Published: (2024)
by: Wrzalik, Marco, et al.
Published: (2024)
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
by: Zhang, Zhaowei, et al.
Published: (2025)
by: Zhang, Zhaowei, et al.
Published: (2025)
Where Should LoRA Go? Component-Type Placement in Hybrid Language Models
by: Borobia, Hector, et al.
Published: (2026)
by: Borobia, Hector, et al.
Published: (2026)
Interpreto: An Explainability Library for Transformers
by: Poché, Antonin, et al.
Published: (2025)
by: Poché, Antonin, et al.
Published: (2025)
RightNow-Arabic-0.5B-Turbo: An Open Sub-1B Arabic Language Model via Vocabulary Injection and Edge-First Deployment
by: Jaber, Jaber, et al.
Published: (2026)
by: Jaber, Jaber, et al.
Published: (2026)
Structured-Sparse Attention for Entity Tracking with Subquadratic Sequence Complexity
by: Zhao, Hangyue, et al.
Published: (2026)
by: Zhao, Hangyue, et al.
Published: (2026)
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
by: Aars, Corinne, et al.
Published: (2024)
by: Aars, Corinne, et al.
Published: (2024)
WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training
by: Tian, Changxin, et al.
Published: (2025)
by: Tian, Changxin, et al.
Published: (2025)
PowLU: An Activation Function for Stable Pre-Training of LLMs
by: Jiang, Peijie, et al.
Published: (2026)
by: Jiang, Peijie, et al.
Published: (2026)
Reconstructing Syllable Sequences in Abugida Scripts with Incomplete Inputs
by: Thu, Ye Kyaw, et al.
Published: (2025)
by: Thu, Ye Kyaw, et al.
Published: (2025)
Human-interpretable clustering of short-text using large language models
by: Miller, Justin K., et al.
Published: (2024)
by: Miller, Justin K., et al.
Published: (2024)
Scaling Laws for Forgetting When Fine-Tuning Large Language Models
by: Kalajdzievski, Damjan
Published: (2024)
by: Kalajdzievski, Damjan
Published: (2024)
Structure-Guided Entity Resolution: Fine-Tuning LLMs for Robust Name Matching in Complex Linguistic Contexts
by: Chourasia, Shivam, et al.
Published: (2026)
by: Chourasia, Shivam, et al.
Published: (2026)
Lon-ea at SemEval-2023 Task 11: A Comparison of Activation Functions for Soft and Hard Label Prediction
by: Hosseini, Peyman, et al.
Published: (2023)
by: Hosseini, Peyman, et al.
Published: (2023)
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
by: Aponte, Ryan, et al.
Published: (2024)
by: Aponte, Ryan, et al.
Published: (2024)
Similar Items
-
Topeax -- An Improved Clustering Topic Model with Density Peak Detection and Lexical-Semantic Term Importance
by: Kardos, Márton
Published: (2026) -
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
by: Nielsen, Dan Saattrup, et al.
Published: (2024) -
Resonant Context Anchoring: Decoupling Attention Routing and Signal Gain at Inference Time
by: Zhao, Mingkuan, et al.
Published: (2026) -
Geometric Deviation as an Unsupervised Pre-Generation Reliability Signal: Probing LLM Representations for Answerability
by: Du, Yucheng
Published: (2026) -
Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
by: Oketunji, Abiodun Finbarrs
Published: (2023)