Saved in:
| Main Authors: | Czinczoll, Tamara, Hönes, Christoph, Schall, Maximilian, de Melo, Gerard |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.17682 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CommitBench: A Benchmark for Commit Message Generation
by: Schall, Maximilian, et al.
Published: (2024)
by: Schall, Maximilian, et al.
Published: (2024)
Query-Level Uncertainty in Large Language Models
by: Chen, Lihu, et al.
Published: (2025)
by: Chen, Lihu, et al.
Published: (2025)
Learning to Predict Usage Options of Product Reviews with LLM-Generated Labels
by: Kohlenberg, Leo, et al.
Published: (2024)
by: Kohlenberg, Leo, et al.
Published: (2024)
NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning
by: Yi, Xin, et al.
Published: (2024)
by: Yi, Xin, et al.
Published: (2024)
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning
by: von Klinski, Maximilian, et al.
Published: (2026)
by: von Klinski, Maximilian, et al.
Published: (2026)
ChuLo: Chunk-Level Key Information Representation for Long Document Understanding
by: Li, Yan, et al.
Published: (2024)
by: Li, Yan, et al.
Published: (2024)
GraphLSS: Integrating Lexical, Structural, and Semantic Features for Long Document Extractive Summarization
by: Bugueño, Margarita, et al.
Published: (2024)
by: Bugueño, Margarita, et al.
Published: (2024)
NeoBERT: A Next-Generation BERT
by: Breton, Lola Le, et al.
Published: (2025)
by: Breton, Lola Le, et al.
Published: (2025)
MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations
by: Yadav, Hemant, et al.
Published: (2024)
by: Yadav, Hemant, et al.
Published: (2024)
Connecting the Dots: What Graph-Based Text Representations Work Best for Text Classification Using Graph Neural Networks?
by: Bugueño, Margarita, et al.
Published: (2023)
by: Bugueño, Margarita, et al.
Published: (2023)
Rethinking Graph-Based Document Classification: Learning Data-Driven Structures Beyond Heuristic Approaches
by: Bugueño, Margarita, et al.
Published: (2025)
by: Bugueño, Margarita, et al.
Published: (2025)
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining
by: Liang, Wen, et al.
Published: (2024)
by: Liang, Wen, et al.
Published: (2024)
Adapting Large Language Models for Document-Level Machine Translation
by: Wu, Minghao, et al.
Published: (2024)
by: Wu, Minghao, et al.
Published: (2024)
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT
by: Cho, Cheol Jun, et al.
Published: (2023)
by: Cho, Cheol Jun, et al.
Published: (2023)
Are Expert-Level Language Models Expert-Level Annotators?
by: Tseng, Yu-Min, et al.
Published: (2024)
by: Tseng, Yu-Min, et al.
Published: (2024)
FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models
by: Dobler, Konstantin, et al.
Published: (2023)
by: Dobler, Konstantin, et al.
Published: (2023)
General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token Level
by: Shi, Bingkang, et al.
Published: (2023)
by: Shi, Bingkang, et al.
Published: (2023)
An Encoder-Integrated PhoBERT with Graph Attention for Vietnamese Token-Level Classification
by: Nguyen, Ba-Quang
Published: (2025)
by: Nguyen, Ba-Quang
Published: (2025)
UniBERT: Adversarial Training for Language-Universal Representations
by: Avram, Andrei-Marius, et al.
Published: (2025)
by: Avram, Andrei-Marius, et al.
Published: (2025)
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation
by: Ramos, Miguel Moura, et al.
Published: (2025)
by: Ramos, Miguel Moura, et al.
Published: (2025)
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
by: Shao, Chenze, et al.
Published: (2024)
by: Shao, Chenze, et al.
Published: (2024)
Representation Deficiency in Masked Language Modeling
by: Meng, Yu, et al.
Published: (2023)
by: Meng, Yu, et al.
Published: (2023)
Efficient Parallelization Layouts for Large-Scale Distributed Model Training
by: Hagemann, Johannes, et al.
Published: (2023)
by: Hagemann, Johannes, et al.
Published: (2023)
DOREMI: Optimizing Long Tail Predictions in Document-Level Relation Extraction
by: Menotti, Laura, et al.
Published: (2026)
by: Menotti, Laura, et al.
Published: (2026)
Language Adaptation on a Tight Academic Compute Budget: Tokenizer Swapping Works and Pure bfloat16 Is Enough
by: Dobler, Konstantin, et al.
Published: (2024)
by: Dobler, Konstantin, et al.
Published: (2024)
ANGO: A Next-Level Evaluation Benchmark For Generation-Oriented Language Models In Chinese Domain
by: Wang, Bingchao
Published: (2024)
by: Wang, Bingchao
Published: (2024)
Chinese ModernBERT with Whole-Word Masking
by: Zhao, Zeyu, et al.
Published: (2025)
by: Zhao, Zeyu, et al.
Published: (2025)
AutoRE: Document-Level Relation Extraction with Large Language Models
by: Xue, Lilong, et al.
Published: (2024)
by: Xue, Lilong, et al.
Published: (2024)
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers
by: Clavié, Benjamin, et al.
Published: (2025)
by: Clavié, Benjamin, et al.
Published: (2025)
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts
by: Su, Zhenpeng, et al.
Published: (2024)
by: Su, Zhenpeng, et al.
Published: (2024)
Within-Document Event Coreference with BERT-Based Contextualized Representations
by: Ahmed, Shafiuddin Rehan, et al.
Published: (2021)
by: Ahmed, Shafiuddin Rehan, et al.
Published: (2021)
ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists
by: Ruan, Jie, et al.
Published: (2025)
by: Ruan, Jie, et al.
Published: (2025)
Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT
by: Yamauchi, Kazuki, et al.
Published: (2024)
by: Yamauchi, Kazuki, et al.
Published: (2024)
Efficiently Exploring Large Language Models for Document-Level Machine Translation with In-context Learning
by: Cui, Menglong, et al.
Published: (2024)
by: Cui, Menglong, et al.
Published: (2024)
Mask-guided BERT for Few Shot Text Classification
by: Liao, Wenxiong, et al.
Published: (2023)
by: Liao, Wenxiong, et al.
Published: (2023)
K-Level Reasoning: Establishing Higher Order Beliefs in Large Language Models for Strategic Reasoning
by: Zhang, Yadong, et al.
Published: (2024)
by: Zhang, Yadong, et al.
Published: (2024)
Shaping Human-AI Collaboration: Varied Scaffolding Levels in Co-writing with Language Models
by: Dhillon, Paramveer S., et al.
Published: (2024)
by: Dhillon, Paramveer S., et al.
Published: (2024)
MaBERT:A Padding Safe Interleaved Transformer Mamba Hybrid Encoder for Efficient Extended Context Masked Language Modeling
by: Kim, Jinwoong, et al.
Published: (2026)
by: Kim, Jinwoong, et al.
Published: (2026)
Enhancing Next-Generation Language Models with Knowledge Graphs: Extending Claude, Mistral IA, and GPT-4 via KG-BERT
by: Chaabene, Nour El Houda Ben, et al.
Published: (2025)
by: Chaabene, Nour El Houda Ben, et al.
Published: (2025)
SwissBERT: The Multilingual Language Model for Switzerland
by: Vamvas, Jannis, et al.
Published: (2023)
by: Vamvas, Jannis, et al.
Published: (2023)
Similar Items
-
CommitBench: A Benchmark for Commit Message Generation
by: Schall, Maximilian, et al.
Published: (2024) -
Query-Level Uncertainty in Large Language Models
by: Chen, Lihu, et al.
Published: (2025) -
Learning to Predict Usage Options of Product Reviews with LLM-Generated Labels
by: Kohlenberg, Leo, et al.
Published: (2024) -
NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning
by: Yi, Xin, et al.
Published: (2024) -
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning
by: von Klinski, Maximilian, et al.
Published: (2026)