:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Ying, Li, Dongyuan, Okumura, Manabu
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2408.01308
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Embedding-based In-Context Prompt Training for Enhancing LLMs as Text Encoders
by: Lin, Ailiang, et al.
Published: (2026)

Causal2Vec: Improving Decoder-only LLMs as Embedding Models through a Contextual Token
by: Lin, Ailiang, et al.
Published: (2025)

Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition
by: Li, Dongyuan, et al.
Published: (2024)

Text to Band Gap: Pre-trained Language Models as Encoders for Semiconductor Band Gap Prediction
by: Yeh, Ying-Ting, et al.
Published: (2025)

Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
by: Xin, Yuan, et al.
Published: (2024)

On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation
by: Wu, Di, et al.
Published: (2024)

Can we obtain significant success in RST discourse parsing by using Large Language Models?
by: Maekawa, Aru, et al.
Published: (2024)

InstructCMP: Length Control in Sentence Compression through Instruction-based Large Language Models
by: Juseon-Do, et al.
Published: (2024)

Length Representations in Large Language Models
by: Moon, Sangjun, et al.
Published: (2025)

Stable Language Model Pre-training by Reducing Embedding Variability
by: Chung, Woojin, et al.
Published: (2024)

DEPT: Decoupled Embeddings for Pre-training Language Models
by: Iacob, Alex, et al.
Published: (2024)

DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation
by: Maekawa, Aru, et al.
Published: (2024)

Understanding Fact Recall in Language Models: Why Two-Stage Training Encourages Memorization but Mixed Training Teaches Knowledge
by: Zhang, Ying, et al.
Published: (2025)

Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model
by: Geng, Yuxia, et al.
Published: (2023)

Automatic Answerability Evaluation for Question Generation
by: Wang, Zifan, et al.
Published: (2023)

Adaptive Pre-training Data Detection for Large Language Models via Surprising Tokens
by: Zhang, Anqi, et al.
Published: (2024)

NITP: Next Implicit Token Prediction for LLM Pre-training
by: Zhang, Xiangdong, et al.
Published: (2026)

Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models
by: Purason, Taido, et al.
Published: (2025)

A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification
by: Calbucura, Nicolas, et al.
Published: (2025)

MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models
by: Zhang, Ying, et al.
Published: (2024)

Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation
by: Lyu, Boxuan, et al.
Published: (2024)

Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings
by: Sharma, Kartik, et al.
Published: (2025)

Concept Tokens: Learning Behavioral Embeddings Through Concept Definitions
by: Sastre, Ignacio, et al.
Published: (2026)

Fuzzy Fingerprinting Encoder Pre-trained Language Models for Emotion Recognition in Conversations: Human Assessment and Validity Study
by: Pereira, Patrícia, et al.
Published: (2026)

Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization
by: Zhang, Haode, et al.
Published: (2022)

Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction
by: Nagayama, Kotaro, et al.
Published: (2024)

QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction
by: Wang, Ziliang, et al.
Published: (2025)

On Initializing Transformers with Pre-trained Embeddings
by: Kim, Ha Young, et al.
Published: (2024)

Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
by: An, Keyu, et al.
Published: (2024)

Cofca: A Step-Wise Counterfactual Multi-hop QA benchmark
by: Wu, Jian, et al.
Published: (2024)

AdParaphrase v2.0: Generating Attractive Ad Texts Using a Preference-Annotated Paraphrase Dataset
by: Murakami, Soichiro, et al.
Published: (2025)

Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
by: Jin, Yang, et al.
Published: (2024)

Masked Structural Growth for 2x Faster Language Model Pre-training
by: Yao, Yiqun, et al.
Published: (2023)

Diversity of Transformer Layers: One Aspect of Parameter Scaling Laws
by: Kamigaito, Hidetaka, et al.
Published: (2025)

Model Merging in Pre-training of Large Language Models
by: Li, Yunshui, et al.
Published: (2025)

Who Laughs with Whom? Disentangling Influential Factors in Humor Preferences across User Clusters and LLMs
by: Murakami, Soichiro, et al.
Published: (2026)

Oogiri-Master: Benchmarking Humor Understanding via Oogiri
by: Murakami, Soichiro, et al.
Published: (2025)

Examining Forgetting in Continual Pre-training of Aligned Large Language Models
by: Li, Chen-An, et al.
Published: (2024)

KoCo: Conditioning Language Model Pre-training on Knowledge Coordinates
by: Li, Yudong, et al.
Published: (2026)

Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models
by: Zhuang, Xinlin, et al.
Published: (2025)