Saved in:
| Main Authors: | Zhang, Ying, Li, Dongyuan, Okumura, Manabu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.01308 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Embedding-based In-Context Prompt Training for Enhancing LLMs as Text Encoders
by: Lin, Ailiang, et al.
Published: (2026)
by: Lin, Ailiang, et al.
Published: (2026)
Causal2Vec: Improving Decoder-only LLMs as Embedding Models through a Contextual Token
by: Lin, Ailiang, et al.
Published: (2025)
by: Lin, Ailiang, et al.
Published: (2025)
Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition
by: Li, Dongyuan, et al.
Published: (2024)
by: Li, Dongyuan, et al.
Published: (2024)
Text to Band Gap: Pre-trained Language Models as Encoders for Semiconductor Band Gap Prediction
by: Yeh, Ying-Ting, et al.
Published: (2025)
by: Yeh, Ying-Ting, et al.
Published: (2025)
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
by: Xin, Yuan, et al.
Published: (2024)
by: Xin, Yuan, et al.
Published: (2024)
On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation
by: Wu, Di, et al.
Published: (2024)
by: Wu, Di, et al.
Published: (2024)
Can we obtain significant success in RST discourse parsing by using Large Language Models?
by: Maekawa, Aru, et al.
Published: (2024)
by: Maekawa, Aru, et al.
Published: (2024)
InstructCMP: Length Control in Sentence Compression through Instruction-based Large Language Models
by: Juseon-Do, et al.
Published: (2024)
by: Juseon-Do, et al.
Published: (2024)
Length Representations in Large Language Models
by: Moon, Sangjun, et al.
Published: (2025)
by: Moon, Sangjun, et al.
Published: (2025)
Stable Language Model Pre-training by Reducing Embedding Variability
by: Chung, Woojin, et al.
Published: (2024)
by: Chung, Woojin, et al.
Published: (2024)
DEPT: Decoupled Embeddings for Pre-training Language Models
by: Iacob, Alex, et al.
Published: (2024)
by: Iacob, Alex, et al.
Published: (2024)
DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation
by: Maekawa, Aru, et al.
Published: (2024)
by: Maekawa, Aru, et al.
Published: (2024)
Understanding Fact Recall in Language Models: Why Two-Stage Training Encourages Memorization but Mixed Training Teaches Knowledge
by: Zhang, Ying, et al.
Published: (2025)
by: Zhang, Ying, et al.
Published: (2025)
Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model
by: Geng, Yuxia, et al.
Published: (2023)
by: Geng, Yuxia, et al.
Published: (2023)
Automatic Answerability Evaluation for Question Generation
by: Wang, Zifan, et al.
Published: (2023)
by: Wang, Zifan, et al.
Published: (2023)
Adaptive Pre-training Data Detection for Large Language Models via Surprising Tokens
by: Zhang, Anqi, et al.
Published: (2024)
by: Zhang, Anqi, et al.
Published: (2024)
NITP: Next Implicit Token Prediction for LLM Pre-training
by: Zhang, Xiangdong, et al.
Published: (2026)
by: Zhang, Xiangdong, et al.
Published: (2026)
Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models
by: Purason, Taido, et al.
Published: (2025)
by: Purason, Taido, et al.
Published: (2025)
A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification
by: Calbucura, Nicolas, et al.
Published: (2025)
by: Calbucura, Nicolas, et al.
Published: (2025)
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models
by: Zhang, Ying, et al.
Published: (2024)
by: Zhang, Ying, et al.
Published: (2024)
Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation
by: Lyu, Boxuan, et al.
Published: (2024)
by: Lyu, Boxuan, et al.
Published: (2024)
Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings
by: Sharma, Kartik, et al.
Published: (2025)
by: Sharma, Kartik, et al.
Published: (2025)
Concept Tokens: Learning Behavioral Embeddings Through Concept Definitions
by: Sastre, Ignacio, et al.
Published: (2026)
by: Sastre, Ignacio, et al.
Published: (2026)
Fuzzy Fingerprinting Encoder Pre-trained Language Models for Emotion Recognition in Conversations: Human Assessment and Validity Study
by: Pereira, Patrícia, et al.
Published: (2026)
by: Pereira, Patrícia, et al.
Published: (2026)
Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization
by: Zhang, Haode, et al.
Published: (2022)
by: Zhang, Haode, et al.
Published: (2022)
Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction
by: Nagayama, Kotaro, et al.
Published: (2024)
by: Nagayama, Kotaro, et al.
Published: (2024)
QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction
by: Wang, Ziliang, et al.
Published: (2025)
by: Wang, Ziliang, et al.
Published: (2025)
On Initializing Transformers with Pre-trained Embeddings
by: Kim, Ha Young, et al.
Published: (2024)
by: Kim, Ha Young, et al.
Published: (2024)
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study
by: An, Keyu, et al.
Published: (2024)
by: An, Keyu, et al.
Published: (2024)
Cofca: A Step-Wise Counterfactual Multi-hop QA benchmark
by: Wu, Jian, et al.
Published: (2024)
by: Wu, Jian, et al.
Published: (2024)
AdParaphrase v2.0: Generating Attractive Ad Texts Using a Preference-Annotated Paraphrase Dataset
by: Murakami, Soichiro, et al.
Published: (2025)
by: Murakami, Soichiro, et al.
Published: (2025)
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
by: Jin, Yang, et al.
Published: (2024)
by: Jin, Yang, et al.
Published: (2024)
Masked Structural Growth for 2x Faster Language Model Pre-training
by: Yao, Yiqun, et al.
Published: (2023)
by: Yao, Yiqun, et al.
Published: (2023)
Diversity of Transformer Layers: One Aspect of Parameter Scaling Laws
by: Kamigaito, Hidetaka, et al.
Published: (2025)
by: Kamigaito, Hidetaka, et al.
Published: (2025)
Model Merging in Pre-training of Large Language Models
by: Li, Yunshui, et al.
Published: (2025)
by: Li, Yunshui, et al.
Published: (2025)
Who Laughs with Whom? Disentangling Influential Factors in Humor Preferences across User Clusters and LLMs
by: Murakami, Soichiro, et al.
Published: (2026)
by: Murakami, Soichiro, et al.
Published: (2026)
Oogiri-Master: Benchmarking Humor Understanding via Oogiri
by: Murakami, Soichiro, et al.
Published: (2025)
by: Murakami, Soichiro, et al.
Published: (2025)
Examining Forgetting in Continual Pre-training of Aligned Large Language Models
by: Li, Chen-An, et al.
Published: (2024)
by: Li, Chen-An, et al.
Published: (2024)
KoCo: Conditioning Language Model Pre-training on Knowledge Coordinates
by: Li, Yudong, et al.
Published: (2026)
by: Li, Yudong, et al.
Published: (2026)
Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models
by: Zhuang, Xinlin, et al.
Published: (2025)
by: Zhuang, Xinlin, et al.
Published: (2025)
Similar Items
-
Embedding-based In-Context Prompt Training for Enhancing LLMs as Text Encoders
by: Lin, Ailiang, et al.
Published: (2026) -
Causal2Vec: Improving Decoder-only LLMs as Embedding Models through a Contextual Token
by: Lin, Ailiang, et al.
Published: (2025) -
Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition
by: Li, Dongyuan, et al.
Published: (2024) -
Text to Band Gap: Pre-trained Language Models as Encoders for Semiconductor Band Gap Prediction
by: Yeh, Ying-Ting, et al.
Published: (2025) -
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
by: Xin, Yuan, et al.
Published: (2024)