Saved in:
| Main Authors: | Courtois, Martin, Ostendorff, Malte, Hennig, Leonhard, Rehm, Georg |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.06366 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigating Gender Bias in Turkish Language Models
by: Caglidil, Orhun Mersin, et al.
Published: (2024)
by: Caglidil, Orhun Mersin, et al.
Published: (2024)
Reward Modeling with Weak Supervision for Language Models
by: Hauptvogel, Ben, et al.
Published: (2024)
by: Hauptvogel, Ben, et al.
Published: (2024)
HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure Information
by: Ruan, Qian, et al.
Published: (2022)
by: Ruan, Qian, et al.
Published: (2022)
Entity Linking using LLMs for Automated Product Carbon Footprint Estimation
by: Castle, Steffen, et al.
Published: (2025)
by: Castle, Steffen, et al.
Published: (2025)
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data
by: Borisova, Ekaterina, et al.
Published: (2025)
by: Borisova, Ekaterina, et al.
Published: (2025)
SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing
by: Foppiano, Luca, et al.
Published: (2025)
by: Foppiano, Luca, et al.
Published: (2025)
Multilingual European Language Models: Benchmarking Approaches and Challenges
by: Barth, Fabio, et al.
Published: (2025)
by: Barth, Fabio, et al.
Published: (2025)
Are Multilingual Language Models an Off-ramp for Under-resourced Languages? Will we arrive at Digital Language Equality in Europe in 2030?
by: Rehm, Georg, et al.
Published: (2025)
by: Rehm, Georg, et al.
Published: (2025)
PyTorch-IE: Fast and Reproducible Prototyping for Information Extraction
by: Binder, Arne, et al.
Published: (2024)
by: Binder, Arne, et al.
Published: (2024)
LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations
by: Wang, Qianli, et al.
Published: (2024)
by: Wang, Qianli, et al.
Published: (2024)
Positional Attention for Efficient BERT-Based Named Entity Recognition
by: Sun, Mo, et al.
Published: (2025)
by: Sun, Mo, et al.
Published: (2025)
UniBERT: Adversarial Training for Language-Universal Representations
by: Avram, Andrei-Marius, et al.
Published: (2025)
by: Avram, Andrei-Marius, et al.
Published: (2025)
How desirable is alignment between LLMs and linguistically diverse human users?
by: Knoeferle, Pia, et al.
Published: (2025)
by: Knoeferle, Pia, et al.
Published: (2025)
Analyzing Multi-Head Attention on Trojan BERT Models
by: Wang, Jingwei
Published: (2024)
by: Wang, Jingwei
Published: (2024)
ClimateCheck 2026: Scientific Fact-Checking and Disinformation Narrative Classification of Climate-related Claims
by: Ahmad, Raia Abu, et al.
Published: (2026)
by: Ahmad, Raia Abu, et al.
Published: (2026)
SwissBERT: The Multilingual Language Model for Switzerland
by: Vamvas, Jannis, et al.
Published: (2023)
by: Vamvas, Jannis, et al.
Published: (2023)
Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition
by: Piñeiro-Martín, Andrés, et al.
Published: (2024)
by: Piñeiro-Martín, Andrés, et al.
Published: (2024)
GottBERT: a pure German Language Model
by: Scheible, Raphael, et al.
Published: (2020)
by: Scheible, Raphael, et al.
Published: (2020)
Sliding Window Attention Training for Efficient Large Language Models
by: Fu, Zichuan, et al.
Published: (2025)
by: Fu, Zichuan, et al.
Published: (2025)
KyrgyzBERT: A Compact, Efficient Language Model for Kyrgyz NLP
by: Metinov, Adilet, et al.
Published: (2025)
by: Metinov, Adilet, et al.
Published: (2025)
Extending Translate-Train for ColBERT-X to African Language CLIR
by: Yang, Eugene, et al.
Published: (2024)
by: Yang, Eugene, et al.
Published: (2024)
ConfliBERT: A Language Model for Political Conflict
by: Brandt, Patrick T., et al.
Published: (2024)
by: Brandt, Patrick T., et al.
Published: (2024)
SpikeBERT: A Language Spikformer Learned from BERT with Knowledge Distillation
by: Lv, Changze, et al.
Published: (2023)
by: Lv, Changze, et al.
Published: (2023)
Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution
by: Anikina, Tatiana, et al.
Published: (2025)
by: Anikina, Tatiana, et al.
Published: (2025)
MEDBERT.de: A Comprehensive German BERT Model for the Medical Domain
by: Bressem, Keno K., et al.
Published: (2023)
by: Bressem, Keno K., et al.
Published: (2023)
Comparative Study of Pre-Trained BERT and Large Language Models for Code-Mixed Named Entity Recognition
by: Shirke, Mayur, et al.
Published: (2025)
by: Shirke, Mayur, et al.
Published: (2025)
Analysis of Argument Structure Constructions in the Large Language Model BERT
by: Ramezani, Pegah, et al.
Published: (2024)
by: Ramezani, Pegah, et al.
Published: (2024)
Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax
by: Zaitova, Iuliia, et al.
Published: (2025)
by: Zaitova, Iuliia, et al.
Published: (2025)
ManufactuBERT: Efficient Continual Pretraining for Manufacturing
by: Armingaud, Robin, et al.
Published: (2025)
by: Armingaud, Robin, et al.
Published: (2025)
BERT-LSH: Reducing Absolute Compute For Attention
by: Li, Zezheng, et al.
Published: (2024)
by: Li, Zezheng, et al.
Published: (2024)
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
by: Lv, Xingtai, et al.
Published: (2024)
by: Lv, Xingtai, et al.
Published: (2024)
Training-free Context-adaptive Attention for Efficient Long Context Modeling
by: You, Zeng, et al.
Published: (2025)
by: You, Zeng, et al.
Published: (2025)
Data Processing for the OpenGPT-X Model Family
by: Brandizzi, Nicolo', et al.
Published: (2024)
by: Brandizzi, Nicolo', et al.
Published: (2024)
Detecting Bias in Large Language Models: Fine-tuned KcBERT
by: Lee, J. K., et al.
Published: (2024)
by: Lee, J. K., et al.
Published: (2024)
CultureBERT: Measuring Corporate Culture With Transformer-Based Language Models
by: Koch, Sebastian, et al.
Published: (2022)
by: Koch, Sebastian, et al.
Published: (2022)
Patent Language Model Pretraining with ModernBERT
by: Yousefiramandi, Amirhossein, et al.
Published: (2025)
by: Yousefiramandi, Amirhossein, et al.
Published: (2025)
BERT-LID: Leveraging BERT to Improve Spoken Language Identification
by: Nie, Yuting, et al.
Published: (2022)
by: Nie, Yuting, et al.
Published: (2022)
Enhancing BERT Fine-Tuning for Sentiment Analysis in Lower-Resourced Languages
by: Kubík, Jozef, et al.
Published: (2025)
by: Kubík, Jozef, et al.
Published: (2025)
ModernBERT is More Efficient than Conventional BERT for Chest CT Findings Classification in Japanese Radiology Reports
by: Yamagishi, Yosuke, et al.
Published: (2025)
by: Yamagishi, Yosuke, et al.
Published: (2025)
An Encoder-Integrated PhoBERT with Graph Attention for Vietnamese Token-Level Classification
by: Nguyen, Ba-Quang
Published: (2025)
by: Nguyen, Ba-Quang
Published: (2025)
Similar Items
-
Investigating Gender Bias in Turkish Language Models
by: Caglidil, Orhun Mersin, et al.
Published: (2024) -
Reward Modeling with Weak Supervision for Language Models
by: Hauptvogel, Ben, et al.
Published: (2024) -
HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure Information
by: Ruan, Qian, et al.
Published: (2022) -
Entity Linking using LLMs for Automated Product Carbon Footprint Estimation
by: Castle, Steffen, et al.
Published: (2025) -
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data
by: Borisova, Ekaterina, et al.
Published: (2025)