:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Li, Wenxi
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2506.04887
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Pretraining Language Models for Diachronic Linguistic Change Discovery
by: Fittschen, Elisabeth, et al.
Published: (2025)

Towards Linguistically-informed Representations for English as a Second or Foreign Language: Review, Construction and Application
by: Li, Wenxi, et al.
Published: (2026)

DA-Cramming: Enhancing Cost-Effective Language Model Pretraining with Dependency Agreement Integration
by: Kuo, Martin, et al.
Published: (2023)

DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining
by: Yan, Yutong, et al.
Published: (2026)

A Comprehensive Evaluation of Semantic Relation Knowledge of Pretrained Language Models and Humans
by: Cao, Zhihan, et al.
Published: (2024)

Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages
by: Ghosh, Poulami, et al.
Published: (2024)

Multilingual Knowledge Graph Completion from Pretrained Language Models with Knowledge Constraints
by: Song, Ran, et al.
Published: (2024)

Knowledge of Pretrained Language Models on Surface Information of Tokens
by: Hiraoka, Tatsuya, et al.
Published: (2024)

Do Language Models Encode Knowledge of Linguistic Constraint Violations?
by: Hardy, et al.
Published: (2026)

Large Language Models as Code Executors: An Exploratory Study
by: Lyu, Chenyang, et al.
Published: (2024)

Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
by: Chen, Xiaolin, et al.
Published: (2022)

Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models
by: Nie, Ercong, et al.
Published: (2024)

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
by: Ruis, Laura, et al.
Published: (2024)

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
by: Gottesman, Daniela, et al.
Published: (2025)

Discovering Knowledge-Critical Subnetworks in Pretrained Language Models
by: Bayazit, Deniz, et al.
Published: (2023)

The American Sign Language Knowledge Graph: Infusing ASL Models with Linguistic Knowledge
by: Kezar, Lee, et al.
Published: (2024)

Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
by: Kim, Jiyeon, et al.
Published: (2024)

IKnow: Instruction-Knowledge-Aware Continual Pretraining for Effective Domain Adaptation
by: Zhang, Tianyi, et al.
Published: (2025)

Demographic and Linguistic Bias Evaluation in Omnimodal Language Models
by: Elobaid, Alaa
Published: (2026)

Merging Continual Pretraining Models for Domain-Specialized LLMs: A Case Study in Finance
by: Ueda, Kentaro, et al.
Published: (2025)

Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language Models
by: Zhou, Xinyu, et al.
Published: (2024)

Linguistically Informed Evaluation of Multilingual ASR for African Languages
by: Chen, Fei-Yueh, et al.
Published: (2026)

Are Large Language Models Effective Knowledge Graph Constructors?
by: Chen, Ruirui, et al.
Published: (2025)

Bilingual Evaluation of Language Models on General Knowledge in University Entrance Exams with Minimal Contamination
by: Salido, Eva Sánchez, et al.
Published: (2024)

CLEAR: A Comprehensive Linguistic Evaluation of Argument Rewriting by Large Language Models
by: Huber, Thomas, et al.
Published: (2025)

Harnessing the Intrinsic Knowledge of Pretrained Language Models for Challenging Text Classification Settings
by: Gao, Lingyu
Published: (2024)

UrBLiMP: A Benchmark for Evaluating the Linguistic Competence of Large Language Models in Urdu
by: Adeeba, Farah, et al.
Published: (2025)

CxMP: A Linguistic Minimal-Pair Benchmark for Evaluating Constructional Understanding in Language Models
by: Oba, Miyu, et al.
Published: (2026)

A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives
by: Li, Zihao, et al.
Published: (2024)

Finding Challenging Metaphors that Confuse Pretrained Language Models
by: Li, Yucheng, et al.
Published: (2024)

Cite Pretrain: Retrieval-Free Knowledge Attribution for Large Language Models
by: Huang, Yukun, et al.
Published: (2025)

Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction
by: Naik, Atharva, et al.
Published: (2024)

Pretraining Language Models Using Translationese
by: Doshi, Meet, et al.
Published: (2024)

Geographic Adaptation of Pretrained Language Models
by: Hofmann, Valentin, et al.
Published: (2022)

Hire a Linguist!: Learning Endangered Languages with In-Context Linguistic Descriptions
by: Zhang, Kexun, et al.
Published: (2024)

How Do Large Language Models Acquire Factual Knowledge During Pretraining?
by: Chang, Hoyeon, et al.
Published: (2024)

No Universal Courtesy: A Cross-Linguistic, Multi-Model Study of Politeness Effects on LLMs Using the PLUM Corpus
by: Mehta, Hitesh, et al.
Published: (2026)

XPERT: Expert Knowledge Transfer for Effective Training of Language Models
by: Liu, Chang, et al.
Published: (2026)

Evaluating and Mitigating Linguistic Discrimination in Large Language Models
by: Dong, Guoliang, et al.
Published: (2024)

Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models
by: Linzbach, Stephan, et al.
Published: (2024)