Saved in:
| Main Authors: | Nishida, Yuto, Isonuma, Masaru, Oda, Yusuke |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.04848 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Unlearning Traces the Influential Training Data of Language Models
by: Isonuma, Masaru, et al.
Published: (2024)
by: Isonuma, Masaru, et al.
Published: (2024)
What's New in My Data? Novelty Exploration via Contrastive Generation
by: Isonuma, Masaru, et al.
Published: (2024)
by: Isonuma, Masaru, et al.
Published: (2024)
Investigating Training and Generalization in Faithful Self-Explanations of Large Language Models
by: Doi, Tomoki, et al.
Published: (2025)
by: Doi, Tomoki, et al.
Published: (2025)
Comprehensive Evaluation of Large Language Models for Topic Modeling
by: Doi, Tomoki, et al.
Published: (2024)
by: Doi, Tomoki, et al.
Published: (2024)
Exclusive Unlearning
by: Sasaki, Mutsumi, et al.
Published: (2026)
by: Sasaki, Mutsumi, et al.
Published: (2026)
Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
by: Ozaki, Shintaro, et al.
Published: (2025)
by: Ozaki, Shintaro, et al.
Published: (2025)
Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality
by: Harada, Yuto, et al.
Published: (2025)
by: Harada, Yuto, et al.
Published: (2025)
UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation
by: Lu, Huimin, et al.
Published: (2025)
by: Lu, Huimin, et al.
Published: (2025)
Towards Transfer Unlearning: Empirical Evidence of Cross-Domain Bias Mitigation
by: Lu, Huimin, et al.
Published: (2024)
by: Lu, Huimin, et al.
Published: (2024)
How a Bilingual LM Becomes Bilingual: Tracing Internal Representations with Sparse Autoencoders
by: Inaba, Tatsuro, et al.
Published: (2025)
by: Inaba, Tatsuro, et al.
Published: (2025)
How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments
by: Ide, Yusuke, et al.
Published: (2024)
by: Ide, Yusuke, et al.
Published: (2024)
Scaling Laws for Downstream Task Performance of Large Language Models
by: Isik, Berivan, et al.
Published: (2024)
by: Isik, Berivan, et al.
Published: (2024)
llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length
by: Sugiura, Issa, et al.
Published: (2025)
by: Sugiura, Issa, et al.
Published: (2025)
Generating Diverse Translation with Perturbed kNN-MT
by: Nishida, Yuto, et al.
Published: (2024)
by: Nishida, Yuto, et al.
Published: (2024)
Exploring the Impact of a Transformer's Latent Space Geometry on Downstream Task Performance
by: Marbut, Anna C., et al.
Published: (2024)
by: Marbut, Anna C., et al.
Published: (2024)
Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text
by: Jarca, Andrei, et al.
Published: (2025)
by: Jarca, Andrei, et al.
Published: (2025)
Contrastive Learning for Task-Independent SpeechLLM-Pretraining
by: Züfle, Maike, et al.
Published: (2024)
by: Züfle, Maike, et al.
Published: (2024)
Understanding and Mitigating Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks
by: Li, Miaomiao, et al.
Published: (2025)
by: Li, Miaomiao, et al.
Published: (2025)
Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?
by: Uchiyama, Fumiya, et al.
Published: (2024)
by: Uchiyama, Fumiya, et al.
Published: (2024)
Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks
by: Shapira, Ori, et al.
Published: (2025)
by: Shapira, Ori, et al.
Published: (2025)
An Evaluation of Sindhi Word Embedding in Semantic Analogies and Downstream Tasks
by: Ali, Wazir, et al.
Published: (2024)
by: Ali, Wazir, et al.
Published: (2024)
Edit Distances and Their Applications to Downstream Tasks in Research and Commercial Contexts
by: Carmo, Félix do, et al.
Published: (2024)
by: Carmo, Félix do, et al.
Published: (2024)
Long-Tail Crisis in Nearest Neighbor Language Models
by: Nishida, Yuto, et al.
Published: (2025)
by: Nishida, Yuto, et al.
Published: (2025)
Llama-Mimi: Exploring the Limits of Flattened Speech Language Modeling
by: Sugiura, Issa, et al.
Published: (2025)
by: Sugiura, Issa, et al.
Published: (2025)
Vaporetto: Efficient Japanese Tokenization Based on Improved Pointwise Linear Classification
by: Akabe, Koichi, et al.
Published: (2024)
by: Akabe, Koichi, et al.
Published: (2024)
Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation
by: Iluz, Bar, et al.
Published: (2024)
by: Iluz, Bar, et al.
Published: (2024)
FedEval-LLM: Federated Evaluation of Large Language Models on Downstream Tasks with Collective Wisdom
by: He, Yuanqin, et al.
Published: (2024)
by: He, Yuanqin, et al.
Published: (2024)
Forecasting Downstream Performance of LLMs With Proxy Metrics
by: Patel, Arkil, et al.
Published: (2026)
by: Patel, Arkil, et al.
Published: (2026)
Quantifying the Importance of Data Alignment in Downstream Model Performance
by: Chawla, Krrish, et al.
Published: (2025)
by: Chawla, Krrish, et al.
Published: (2025)
Scaling Laws Are Unreliable for Downstream Tasks: A Reality Check
by: Lourie, Nicholas, et al.
Published: (2025)
by: Lourie, Nicholas, et al.
Published: (2025)
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
by: Riabi, Arij, et al.
Published: (2021)
by: Riabi, Arij, et al.
Published: (2021)
How Does Code Pretraining Affect Language Model Task Performance?
by: Petty, Jackson, et al.
Published: (2024)
by: Petty, Jackson, et al.
Published: (2024)
Adapting Decoder-Based Language Models for Diverse Encoder Downstream Tasks
by: Suganthan, Paul, et al.
Published: (2025)
by: Suganthan, Paul, et al.
Published: (2025)
Optimising Language Models for Downstream Tasks: A Post-Training Perspective
by: Shi, Zhengyan
Published: (2025)
by: Shi, Zhengyan
Published: (2025)
Scaling Laws for Predicting Downstream Performance in LLMs
by: Chen, Yangyi, et al.
Published: (2024)
by: Chen, Yangyi, et al.
Published: (2024)
CoLA: Cross-Modal Low-rank Adaptation for Multimodal Downstream Tasks
by: Suharitdamrong, Wish, et al.
Published: (2026)
by: Suharitdamrong, Wish, et al.
Published: (2026)
Personality as a Probe for LLM Evaluation: Method Trade-offs and Downstream Effects
by: Handa, Gunmay, et al.
Published: (2025)
by: Handa, Gunmay, et al.
Published: (2025)
Refactoring Programs Using Large Language Models with Few-Shot Examples
by: Shirafuji, Atsushi, et al.
Published: (2023)
by: Shirafuji, Atsushi, et al.
Published: (2023)
Protoknowledge Shapes Behaviour of LLMs in Downstream Tasks: Memorization and Generalization with Knowledge Graphs
by: Ranaldi, Federico, et al.
Published: (2025)
by: Ranaldi, Federico, et al.
Published: (2025)
Routing-Aligned Fine-Tuning for Multilingual Downstream Tasks in Mixture-of-Experts Models
by: Deng, Guanzhi, et al.
Published: (2026)
by: Deng, Guanzhi, et al.
Published: (2026)
Similar Items
-
Unlearning Traces the Influential Training Data of Language Models
by: Isonuma, Masaru, et al.
Published: (2024) -
What's New in My Data? Novelty Exploration via Contrastive Generation
by: Isonuma, Masaru, et al.
Published: (2024) -
Investigating Training and Generalization in Faithful Self-Explanations of Large Language Models
by: Doi, Tomoki, et al.
Published: (2025) -
Comprehensive Evaluation of Large Language Models for Topic Modeling
by: Doi, Tomoki, et al.
Published: (2024) -
Exclusive Unlearning
by: Sasaki, Mutsumi, et al.
Published: (2026)