Saved in:
| Main Author: | Li, Wenxi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.04887 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Pretraining Language Models for Diachronic Linguistic Change Discovery
by: Fittschen, Elisabeth, et al.
Published: (2025)
by: Fittschen, Elisabeth, et al.
Published: (2025)
Towards Linguistically-informed Representations for English as a Second or Foreign Language: Review, Construction and Application
by: Li, Wenxi, et al.
Published: (2026)
by: Li, Wenxi, et al.
Published: (2026)
DA-Cramming: Enhancing Cost-Effective Language Model Pretraining with Dependency Agreement Integration
by: Kuo, Martin, et al.
Published: (2023)
by: Kuo, Martin, et al.
Published: (2023)
DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining
by: Yan, Yutong, et al.
Published: (2026)
by: Yan, Yutong, et al.
Published: (2026)
A Comprehensive Evaluation of Semantic Relation Knowledge of Pretrained Language Models and Humans
by: Cao, Zhihan, et al.
Published: (2024)
by: Cao, Zhihan, et al.
Published: (2024)
Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages
by: Ghosh, Poulami, et al.
Published: (2024)
by: Ghosh, Poulami, et al.
Published: (2024)
Multilingual Knowledge Graph Completion from Pretrained Language Models with Knowledge Constraints
by: Song, Ran, et al.
Published: (2024)
by: Song, Ran, et al.
Published: (2024)
Knowledge of Pretrained Language Models on Surface Information of Tokens
by: Hiraoka, Tatsuya, et al.
Published: (2024)
by: Hiraoka, Tatsuya, et al.
Published: (2024)
Do Language Models Encode Knowledge of Linguistic Constraint Violations?
by: Hardy, et al.
Published: (2026)
by: Hardy, et al.
Published: (2026)
Large Language Models as Code Executors: An Exploratory Study
by: Lyu, Chenyang, et al.
Published: (2024)
by: Lyu, Chenyang, et al.
Published: (2024)
Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
by: Chen, Xiaolin, et al.
Published: (2022)
by: Chen, Xiaolin, et al.
Published: (2022)
Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models
by: Nie, Ercong, et al.
Published: (2024)
by: Nie, Ercong, et al.
Published: (2024)
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
by: Ruis, Laura, et al.
Published: (2024)
by: Ruis, Laura, et al.
Published: (2024)
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
by: Gottesman, Daniela, et al.
Published: (2025)
by: Gottesman, Daniela, et al.
Published: (2025)
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models
by: Bayazit, Deniz, et al.
Published: (2023)
by: Bayazit, Deniz, et al.
Published: (2023)
The American Sign Language Knowledge Graph: Infusing ASL Models with Linguistic Knowledge
by: Kezar, Lee, et al.
Published: (2024)
by: Kezar, Lee, et al.
Published: (2024)
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
by: Kim, Jiyeon, et al.
Published: (2024)
by: Kim, Jiyeon, et al.
Published: (2024)
IKnow: Instruction-Knowledge-Aware Continual Pretraining for Effective Domain Adaptation
by: Zhang, Tianyi, et al.
Published: (2025)
by: Zhang, Tianyi, et al.
Published: (2025)
Demographic and Linguistic Bias Evaluation in Omnimodal Language Models
by: Elobaid, Alaa
Published: (2026)
by: Elobaid, Alaa
Published: (2026)
Merging Continual Pretraining Models for Domain-Specialized LLMs: A Case Study in Finance
by: Ueda, Kentaro, et al.
Published: (2025)
by: Ueda, Kentaro, et al.
Published: (2025)
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language Models
by: Zhou, Xinyu, et al.
Published: (2024)
by: Zhou, Xinyu, et al.
Published: (2024)
Linguistically Informed Evaluation of Multilingual ASR for African Languages
by: Chen, Fei-Yueh, et al.
Published: (2026)
by: Chen, Fei-Yueh, et al.
Published: (2026)
Are Large Language Models Effective Knowledge Graph Constructors?
by: Chen, Ruirui, et al.
Published: (2025)
by: Chen, Ruirui, et al.
Published: (2025)
Bilingual Evaluation of Language Models on General Knowledge in University Entrance Exams with Minimal Contamination
by: Salido, Eva Sánchez, et al.
Published: (2024)
by: Salido, Eva Sánchez, et al.
Published: (2024)
CLEAR: A Comprehensive Linguistic Evaluation of Argument Rewriting by Large Language Models
by: Huber, Thomas, et al.
Published: (2025)
by: Huber, Thomas, et al.
Published: (2025)
Harnessing the Intrinsic Knowledge of Pretrained Language Models for Challenging Text Classification Settings
by: Gao, Lingyu
Published: (2024)
by: Gao, Lingyu
Published: (2024)
UrBLiMP: A Benchmark for Evaluating the Linguistic Competence of Large Language Models in Urdu
by: Adeeba, Farah, et al.
Published: (2025)
by: Adeeba, Farah, et al.
Published: (2025)
CxMP: A Linguistic Minimal-Pair Benchmark for Evaluating Constructional Understanding in Language Models
by: Oba, Miyu, et al.
Published: (2026)
by: Oba, Miyu, et al.
Published: (2026)
A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives
by: Li, Zihao, et al.
Published: (2024)
by: Li, Zihao, et al.
Published: (2024)
Finding Challenging Metaphors that Confuse Pretrained Language Models
by: Li, Yucheng, et al.
Published: (2024)
by: Li, Yucheng, et al.
Published: (2024)
Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models
by: Huang, Yukun, et al.
Published: (2025)
by: Huang, Yukun, et al.
Published: (2025)
Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction
by: Naik, Atharva, et al.
Published: (2024)
by: Naik, Atharva, et al.
Published: (2024)
Pretraining Language Models Using Translationese
by: Doshi, Meet, et al.
Published: (2024)
by: Doshi, Meet, et al.
Published: (2024)
Geographic Adaptation of Pretrained Language Models
by: Hofmann, Valentin, et al.
Published: (2022)
by: Hofmann, Valentin, et al.
Published: (2022)
Hire a Linguist!: Learning Endangered Languages with In-Context Linguistic Descriptions
by: Zhang, Kexun, et al.
Published: (2024)
by: Zhang, Kexun, et al.
Published: (2024)
How Do Large Language Models Acquire Factual Knowledge During Pretraining?
by: Chang, Hoyeon, et al.
Published: (2024)
by: Chang, Hoyeon, et al.
Published: (2024)
No Universal Courtesy: A Cross-Linguistic, Multi-Model Study of Politeness Effects on LLMs Using the PLUM Corpus
by: Mehta, Hitesh, et al.
Published: (2026)
by: Mehta, Hitesh, et al.
Published: (2026)
XPERT: Expert Knowledge Transfer for Effective Training of Language Models
by: Liu, Chang, et al.
Published: (2026)
by: Liu, Chang, et al.
Published: (2026)
Evaluating and Mitigating Linguistic Discrimination in Large Language Models
by: Dong, Guoliang, et al.
Published: (2024)
by: Dong, Guoliang, et al.
Published: (2024)
Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models
by: Linzbach, Stephan, et al.
Published: (2024)
by: Linzbach, Stephan, et al.
Published: (2024)
Similar Items
-
Pretraining Language Models for Diachronic Linguistic Change Discovery
by: Fittschen, Elisabeth, et al.
Published: (2025) -
Towards Linguistically-informed Representations for English as a Second or Foreign Language: Review, Construction and Application
by: Li, Wenxi, et al.
Published: (2026) -
DA-Cramming: Enhancing Cost-Effective Language Model Pretraining with Dependency Agreement Integration
by: Kuo, Martin, et al.
Published: (2023) -
DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining
by: Yan, Yutong, et al.
Published: (2026) -
A Comprehensive Evaluation of Semantic Relation Knowledge of Pretrained Language Models and Humans
by: Cao, Zhihan, et al.
Published: (2024)